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Preface 


The modern development of the Kelly, or growth optimal, approach to allocating invest- 
ments began with J.L. Kelly's seminal 1956 paper, over two hundred years after Daniel 
Bernoulli's 1738 introduction to the notion of logarithmic utility. Kelly's paper was fol- 
lowed by Latané's (1959) intuitive economic analysis and theoretical advances by Breiman 
(1960, 1961). Breiman showed that the Kelly maximization of expected utility with a log- 
arithmic utility function also maximized the long run asymptotic growth of wealth while 
minimizing the expected time to reach arbitrarily large goals. Thorp (1962, 1966, 1969, 
1971) pioneered the application of the Kelly criterion to actual gambling and investment. 
Ziemba and Vickson (1975) surveyed the literature presenting key papers, introductions 
and problems up to that time; for an update, see Ziemba and Vickson (2006). Algoet 
and Cover (1988) generalized the Breiman results to wider classes of assets and arbitrary 
ergodic market processes. MacLean, Ziemba and Blazenko (1992) show applications to 
a wide variety of sports and gambling events following Hausch, Ziemba and Rubinstein's 
(1981) application to racetrack betting. 


Thorp (1960) suggested the term Fortune's Formula which later became the title of William 
Poundstone's 2005 book. This was in an abstract for a talk Thorp gave to the American 
Mathematical Society in January 1961, presenting his blackjack card counting discovery 
and his use of the Kelly approach to size bets in favorable situations. The term Kelly 
criterion appears to date from Thorp (1966) and is used in Thorp (1969) 


Over the years both theory and practice have developed prolifically. The theory has been 
extended to managing portfolios of investments, results have been obtained for a broad 
range of distributional assumptions, the simultaneous management of assets and liabilities 
has been elaborated upon, and the various properties, advantages and disadvantages have 
been clarified. 


We now have a fuller understanding of the tradeoff between risk and reward for fractional 
Kelly versus full Kelly and for Kelly subject to minimizing the underperformance of à 
benchmark or specified desired wealth path. 


The theory has also benefitted from the practical experience of gamblers, traders, hedge 


v 


xvi Preface 


fund managers and investors, especially from some of the greatest investors. 


In this volume we present a selection of many of the most important papers from the now 
vast and growing literature on the subject. While we could not publish all the important 
papers, we feel that the main results appear here. 


The volume is organized into six sections that cover the early ideas and contributions, 
classic papers and theories, relations to asset allocation including optimization with with- 
drawals, fractional Kelly wagering and its relations to benchmarks, assessing the good and 
bad properties of Kelly wagering, utility foundations and the use of Kelly type strategies 
by various investors including the greatest investors. 


We thank our authors for their contributions, those who helped us with the editing and 
production especially Sandra Schwartz and our publisher, World Scientific, for their pro- 
duction and promotion of this volume. Special thanks go to Tom Cover for many helpful 
comments on earlier versions of the introductions and to Bryan Fitzgerald for valuable 
data. 


Leonard C. MacLean 
Edward O. Thorp 
William T. Ziemba 
March 2010 
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Introduction to the Early Ideas and Contributions 


We live in an age of instant contact through email, twitter, TV, and other modes 
of communication. Back in the 1700s, communication was slower and mail would 
take weeks or months between receipt and response. The first paper in this volume 
and arguably the first on log utility is Daniel Bernoulli's article, written in 1738 in 
Basel, Switzerland where he was professor of physics and philosophy. Bernoulli, at 
age 25, studied in Basel, went to St. Petersburg and then returned to Basel. He is à 
member of the famous family of Swiss mathematicians, who were known as the first 
to apply mathematical analysis to the movement of liquid bodies. His article on log 
utility and the St. Petersburg paradox reprinted here was translated by Dr. Louise 
Sommer of the American University with assistance from Karl Menger, mathemat- 
ics professor at the Illinois Institute of Technology, and William J. Baumol, eco- 
nomics professor at Princeton University. The article was published in Econometrica 
in 1954. 

A great paper often has only one new idea well developed with no major error. 
In his paper, Bernoulli develops two new ideas. The first idea is the development 
of declining marginal utility of wealth leading to logarithmic utility. His simple 
idea is that marginal utility should be proportional to current wealth. So upon 
integration, one has log utility. Bernoulli postulated monotone utility so that utility 
was increasing in wealth. Prior to this, it was assumed that decisions were made 
on an expected value or linear utility basis. 'T'he general idea of declining marginal 
utility or what we would later call “risk aversion” or “concavity” is crucial in modern 
decision theory. 

The second idea is his contribution to the St. Petersburg paradox. This prob- 
lem actually originates from Daniel Bernoulli's cousin, Nicolas Bernoulli, profes- 
sor at the University of Basel. In 1708, he submitted five important problems to 
professor Pierre Montmort, one of which was the St. Petersburg paradox. The 
idea is to determine the expected value and what you would pay for the following 
gamble: 


A fair coin with i probability of heads is repeatedly tossed until 
heads occurs, ending the game. The investor pays c dollars and 
receives in return 2*^! with probability 27% for k = 1,2,... should 
a head occur. Thus, after each succeeding loss, assuming a head 
does not appear, the bet is doubled to 2, 4, 8, ... etc. Clearly the 
expected value is i + i + i +... or infinity with linear utility. 
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Bell and Cover (1980) argue that the St. Petersburg gamble is attractive at any 
price c, but the investor wants less of it as c — oo. The proportion of the investor's 
wealth invested in the St. Petersburg gamble is always positive but decreases with 
the cost c as c increases. The rest of the wealth is in cash. 

Bernoulli offers two solutions since he feels that this gamble is worth a lot less 
than infinity. In the first solution, he arbitrarily sets a limit to the utility of very 
large payoffs. Specifically, any amount over 10 million is assumed to be equal to 
224. Under that assumption, the expected value is 


50 (2) + (D E (24) + (3) e + (3) e jos 


— 12 4 the original 1 — 13 


If utility is yw, the expected value is 
1 1 1 1 
-V14+ -V24+ -vV44...= ~ 2.9 
3 it qu SV jum 


When utility is log, as Bernoulli proposed, the expected value is 


1 1 1 
5 logit 411082 p IBN ete — log2 — 0.69315 


'The use of a concave utility function does not eliminate the paradox. For exam- 
ple, the utility function U(x) = z/log(z-F A), where A > 2 is a constant, is strictly 
concave, strictly increasing, and infinitely differentiable yet the expected value for 
the St. Petersburg gamble is +00. 

As Menger (1934) pointed out, the log, the square root and many others, but 
not all, concave utility functions eliminate the original St. Petersburg paradox but 
it does not solve one where the payoffs grow faster than 2”. So if log is the utility 
function, one creates à new paradox by having the payoffs increase at least as fast 
as log reduces them so one still has an infinite sum for the expected utility. With 
exponentially growing payoffs one has 


1 1 
J log(e!) + 4 log(e?) +... = oo 


The super St. Petersburg paradox, in which even Elog.X = oo, is examined in 
Cover and Thomas (2006: 181, 182) where a satisfactory resolution is reached by 
looking at relative growth rates of wealth. Another solution to such paradoxes is to 
have bounded utility, for example, as Bernoulli suggested above 10 million. To solve 
the St. Petersburg paradox with exponentially growing payoffs, or any other growth 
rate, a second solution, in addition to that of bounding the utility function above, 
is simply to choose a utility function which, though unbounded, grows “sufficiently 
more slowly" than the inverse of the payoff function, e.g., like the log of the inverse 
function to the payoff function. The key is whether the valuation using a utility 
function is finite or not; if finite, the specific value does not matter since utilities 
are equivalent to within a positive linear transformation (V = aU + b, a > 0). So 
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for any utility giving a finite result, there is an equivalent one that will give you 
any specified finite value as a result. Only the behavior of U(x) as x —^ co matters 
and strict monotonicity is necessary for a paradox. For example, U(x) =x, z« A 
will not produce à paradox, but the continuous concave utility function 

Ue) - 24. z>A 
will have a paradox. Samuelson (1977) provides an extensive survey of the paradox 
(see also Menger (1967) and Aase (2001)). 

Kelly (1956) is given credit for the idea of using log utility in gambling and 
repeated investment problems, as such it is known as the Kelly criterion. Kelly’s 
analyses use Bernoulli trials. Not only does he show that log is the utility function 
which maximizes the long run growth rate, but that this utility function is myopic 
in the sense that period by period maximization based only on current capital 
is optimal. Working at Bell Labs, Kelly was strongly influenced by information 
theorist Claude Shannon. Kelly defined the long run growth rate of the investor’s 
fortune using 


W 
G — lim log —N 
N—oo 0 
where Wo is the initial wealth and Wy is the wealth after N trials in sequence. With 
Bernoulli trials, one wins = +1 with probability p and loses —1 with probability 


q — 1— p. The wealth with M wins and L = N — M loses is 
Wy — (14 f)" - f)" "ws 
where f is the fraction of wealth wagered on each of the N trials. Substituting this 


into G yields 
«(M N-M . 
G= im e log(1 + f) + ( N ) log(1 — D) 


= plog(1 + f)  glog(1— f) = ElogW 


by the strong law of large numbers. 
Maximizing G is equivalent to maximizing the expected value of the log of each 
period’s wealth. The optimal wager for this is 


f=p-q p2q>d0 
which is the expected gain per trial, or the edge. If there is no edge, the bet is zero. 
If the payoff is +B for a win and —1 for a loss, then the edge is Bp — q, the odds 
are B, and 
« Bp—q _ edge 
=~ B~ = odds 
Latané (1959) introduced log utility as an investment criterion to the finance 
world independent of Kelly’s work. Focusing, like Kelly, on simple intuitive versions 
of the expected log criteria, he suggested that it had superior long run properties. 
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Breiman (1961), following his earlier intuitive paper Breiman (1960), established 
the basic mathematical properties of the expected log criterion in a rigorous fash- 
ion. He proves three basic asymptotic results in a general discrete time setting 
with intertemporally independent assets. Suppose in each period, N, there are 
K investment opportunities with returns per unit invested Xyij,...,Xwx. Let 
A = (Ai,..., Ak) be the fraction of wealth invested in each asset. The wealth at 
the end of period N is 


K 
Wn = » hats Ww.a 


i=l 


Property 1. In each time period, two portfolio managers have the same family 
of investment opportunities with returns, X, and one uses a A* which maximizes 
Elog Wy whereas the other uses an essentially different strategy, A, so they differ 
infinitely often, that is 


E log Wy A* — Elog Wn (A) > co 
Then 


lim ————— > oo 


So the wealth exceeds that with any other strategy by more and more as the horizon 
becomes more distant. 

This generalizes the Kelly-Bernoulli trial setting to intertemporally independent 
and stationary returns. 


Property 2. The expected time to reach a preassigned goal A is, asymptotically 
least as A increases with a strategy maximizing E log Wy. 


Property 3. Ássuming a fixed opportunity set, there is a fixed fraction strategy 
that maximizes E log Wy, which is independent of N. 


Thorp (1969) discusses the general theory of optimal betting over time on fa- 
vorable games or investments. Favorable games are those with a strategy such 
that 


Prob| lim Wy > 0| =1 
N o0 


where Wy is the investor's capital after N trials. Thorp follows the footsteps of 
Kelly and Breiman, Bellman and Kalaba (1957), and Ferguson (1965), by discussing 
some favorable games such as blackjack, the side bet in Nevada-style baccarat (also 
called chemin de fer), roulette, the wheel of fortune, and the stock market. Once one 
has an edge, with positive expectation, Thorp outlines a general theory for optimal 
wagering on such games. For the stock market, Thorp discusses mispricings in 
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warrants and how to hedge them with advantage and to choose the optimal amount 
to invest using the Kelly approach. 

Markowitz arithmetic mean-variance (MV) efficiency applies to single period 
returns. For multiperiod returns we are interested in geometric or compound rates 
of return and the associated variance. Arithmetic MV-efficient portfolios are, in 
general, not geometric MV-efficient and conversely. The Kelly strategy is always 
geometric MV-efficient and has the highest growth rate of all such strategies. Any 
strategy which bets more is not geometric MV-efficient. 

Thorp (1971) focuses on the theory of logarithmic utility as applied to portfolio 
selection and contrasts it with Markowitz mean-variance portfolio theory. He shows 
that the Kelly strategy is not necessarily arithmetic mean-variance efficient. 

Hakansson (1970) presents arithmetic optimal strategies for the class of utility 
functions 


eo 
Soa’ tu(er), 0cocl 


t=1 


where c; is the consumption in period t and either the relative —cu"(c)/u'(c) or the 
absolute Arrow-Pratt risk aversion u"(c)/w'(c) is a positive constant for all c > 0. 
This includes positive power u(c) = c*, 0 < y < 1, negative power u(c) = —c ^, 
y > 0, logarithmic u(c) = logc, and exponential u(c) = —e "4, y > 0. 

Using this model, Hakansson is able to determine, in closed form, optimal con- 
sumption, investment and borrowing strategies. The investor has an initial capital 
position, which could be negative, and a known deterministic non-capital income 
stream, and an arbitrary number of possible investments that can be held long or 
shorted whose probability distributions satisfy an arbitrage free condition called no 
easy money. It means that no combination of risky assets can with probability one 
out return the constant rate of interest, that no combination of short sale invest- 
ments exists where the probability is zero that a loss will exceed the lending rate of 
interest, and that no combination of short sale investments can guarantee against 
loss. It is assumed that there are no taxes or transaction costs, and that the joint 
distribution functions of the assets are known and stationary and have stochasti- 
cally constant returns to scale. Double the investment provides double returns. The 
stationarity assumption can be generalized and the same results obtained. 

The optimal investment strategies are independent of wealth, noncapital income, 
age and impatience to consume. Optimal consumption is linear and increasing in 
current wealth and in the present value of the noncapital income stream. The opti- 
ma] asset mix depends only on the probability distribution of returns, the interest 
rate and the investor's one-period utility function of consumption. 

Necessary and sufficient conditions for capital growth are derived. With loga- 
rithmic utility of consumption, the investor will always invest the capital available 
after consumption is paid so as to maximize the expected growth rate of capital 
plus the present value of the noncapital income stream. 
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Hakansson's definition of capital growth is asymptotic 
lim Pr(W, > W1)-1 
toco 
When there is statistical independence period by period, this condition implies that 
E(WiilW4) 2 Wi 


but not the converse. 

A necessary and sufficient condition for capital growth is ar > 1 or that 
EllogW] > 0. Here « is the time discount factor and r is the rate of interest. 
A sufficient but not necessary condition for growth is that there be a non-zero in- 
vestment in at least one of the risky investments since then Pr(W,,1 > Wa) > 0 
for all t. 

A key property that aids multiperiod optimization calculations is myopia. Let 
u(w) be a utility function of final wealth. The u is myopic if induced interme- 
diate time utility functions of wealth are independent of the past and the future, 
in particular, they are independent of yields beyond the current period so they 
are positive linear transformations of u(w). Hakansson (1971) shows that log util- 
ity induces myopia for general asset return distributions. Hakansson also clarifies 
earlier results of Mossin (1968) and Leland (1968) for power utility functions; for 
these one needs independence of period by period returns to induce myopia. For 
u(w) = log w, for all time dependent markets one should, with myopia, maximize 
E (log Wy |W, W»5,..., Ww 1), the conditional mean. 

Mossin and Leland showed that with zero interest rates, myopia obtains for 
linear risk tolerance utility 


u'(w)/u"(w) = a+ bw 


When interest rates are non-zero, there is myopia if these future interest rates are 
known. The linear risk tolerance family includes exponential, log and the negative 
and positive power utility families. All of these results are under the assumption 
of stochastically constant returns to scale, perfect liquidity and divisibility of the 
assets, no transaction costs, withdrawals, capital additions, taxes, and short sales. 

Roll (1973) follows the finance literature starting with Latané (1959) rather 
than the Kelly (1956) literature. He investigates the relationship between the Kelly 
capital growth model, mean-variance and capital asset pricing (CAPM) analysis. 
Additional results on this topic are presented by Thorp (1969, 1971), who shows that 
Kelly weightings are not necessarily mean-variance efficient and Markowitz (1976), 
who argues that the log-optimal portfolio is a limiting mean-variance portfolio. 
MacLean, Ziemba and Blazenko (1992) argue that, from a growth versus security 
prospective, the Kelly portfolio is the most aggressive utility function that can 
be used and betting more leads to lower long run growth and less security. If 
estimation error is considered, the log utility is even riskier. Since, as Chopra 
and Ziemba (1993) show, with such low Arrow-Pratt absolute risk aversion (almost 
zero), errors in means are about 100 times errors in variances and co-variances in 
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certainty equivalent value. Hence, obtaining accurate mean estimates is crucial to 
successful investing. 

Roll (1973) observes the result proved by Breiman (1961) that growth-optimum 
“portfolios maximize the probability of exceeding a given level of wealth in a fixed 
time". Roll focuses on an empirical study of the implications for observed common 
stock returns of all investors selecting such a portfolio. Mean-variance type mod- 
els have dominated finance theory and practice largely because of the relationship 
with the capital asset pricing model and its theoretical justification of index funds 
which beat about 7596 of managers with less cost and effort. On the other hand, 
growth-optimal Kelly portfolios have been used not by ordinary investors but those, 
like Warren Buffett, who attempt to have superior returns. This may lead to less 
diversification, larger positions and more monthly losses, see Ziemba (2005). 

For his test, Roll does not specify specific return distributions but rather relies 
on the fact that in each period the value of 


1+ Ry 
l4 Rp Ma [Aca a — Rfi) 


expected by each individual is equal for all risky assets, where Ry is the return from 
asset j in period t, Rr; is the risk free return in period t and Aj is the investment 
is asset 2 in period t. Under typical, rather strict, financial economics assumptions 
to get to the aggregate level, it is assumed that: (a) all investors hold identical 
probability beliefs or (b) that a “representative” investor exists with the invested 
proportions X*s being equal to relative values of existing asset supplies. In either 
case, the denominator equals 1+ R5, ;, a “market return” defined as a value-weighted 
average of all individual asset returns. It is only approximately observable because 
no comprehensive value-weighted asset indices exist. 
| l+ Rye 

"T TH Rm 
will have the same mean for all securities. Then the return on stock 7 relative to the 
market will have the same mean for all securities. Such assumptions fly in the face 
of standard Kelly application which picks “the best stocks”, and we see clearly the 
difference in philosophy from academic financial economics and other investment 
approaches. Using 1962-1969 data and rather clever ways of looking at the two 
models: growth-optimium and CAPM, Roll concludes that there is: 


*...a close correspondence was demonstrated between their quali- 
tative implications. For example, both models imply that an asset’s 
expected return will equal the risk-free interest rate if the covari- 
ance between the asset’s return and the average return on all assets, 
Cov(R;, Rm), is zero. For most cases, the growth-optimum model 
also shares the Sharpe-Lintner implication that an asset’s expected 
return will exceed the risk-free rate if and only if Cov( Ry, Rm) >0. 
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There are, however, some cases of highly-skewed probability dis- 
tributions where this implication does not follow for the growth- 
optimum model. 

A close empirical correspondence between the two models was 
demonstrated for common stock returns. The procedure: (1) esti- 
mated returns and risk premia implied by the two models from time 
series; (2) calculated cross-sectional relations between estimated 
returns and risks; and (3) compared the cross sectional relations 
to the theoretical predictions of the two models. They could not 
be distinguished on an empirical basis. In every period, estimated 
corresponding coefficients of the two models were nearly equal; and 
indeed, they deviated much further from their theoretically antici- 
pated levels than they deviated from each other." 


'The investment time horizon is, as Roll points out, important here since, for 
short intervals, the models are essentially equivalent after quadratic approximations 
and for long horizons, the normality of returns kicks in. 
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EXPOSITION OF A NEW THEORY ON THE MEASUREMENT 
OF RISK! 


By DANIEL BERNOULLI 


$1. Ever sINce mathematicians first began to study the measurement of risk 
there has been general agreement on the following proposition: Expected values 
are computed by multiplying each possible gain by the number of ways in which it 
can occur, and then dividing the sum of these products by the total number of possible 
cases where, in this theory, the consideration of cases which are all of the same 
probability is insisted upon. If this rule be accepted, what remains to be done 
within the framework of this theory amounts to the enumeration of all alterna- 
tives, their breakdown into equi-probable cases and, finally, their insertion into 
corresponding classifications. 

$2. Proper examination of the numerous demonstrations of this proposition 
that have come forth indicates that they all rest upon one hypothesis: since 
there is no reason to assume that of two persons encountering identical risks,’ either 


1 Translated from Latin into English by Dr. Louise Sommer, The American University, 
Washington, D. C., from “Specimen Theoriae Novae de Mensura Sortis,’’ Commentarii 
Academiae Scientiarum Imperialis Petropolitanae, Tomus V [Papers of the Imperial Academy 
of Sciences in Petersburg, Vol. V], 1738, pp. 175-192. Professor Karl Menger, Illinois Insti- 
tute of Technology has written footnotes 4, 9, 10, and 16. 

Epitor’s Nots: In view of the frequency with which Bernoulli's famous paper has been 
referred to in recent economic discussion, it has been thought appropriate to make it more 
generally available by publishing this English version. In her translation Professor Sommer 
has sought, in so far as possible, to retain the eighteenth century spirit of the original. The 
mathematical notation and much of the punctuation are reproduced without change. 
References to some of the recent literature concerned with Bernoulli’s theory are given at 
the end of the article. 

TRANSLATOR’S NOTE: I highly appreciate the help of Karl Menger, Professor of Mathe- 
matics, Illinois Institute of Technology, a distinguished authority on the Bernoulli problem, 
who has read this translation and given me expert advice. I am also grateful to Mr. William 
J. Baumol, Professor of Economics, Princeton University, for his valuable assistance in 
interpreting Bernoulli's paper in the light of modern econometries. I wish to thank also 
Mr. John H. Klingenfeld, Economist, U. S. Department of Labor, for his cooperation in the 
English rendition of this paper. The translation is based solely upon the original Latin text. 

BIOGRAPHICAL NOTE: Daniel Bernoulli, a member of the famous Swiss family of distin- 

- guished mathematicians, was born in Groningen, January 29, 1700 and died in Basle, March 
17, 1782. He studied mathematics and medical sciences at the University of Basle. In 1725 
he accepted an invitation to the newly established academy in Petersburg, but returned to 
Basle in 1733 where he was appointed professor of physics and philosophy. Bernoulli was a 
member of the academies of Paris, Berlin, and Petersburg and the Royal Academy in 
London. He was the first to apply mathematical analysis to the problem of the movement 
of liquid bodies. 

(On Bernoulli see: Handwörterbuch der Naturwissenschaften, second edition, 1931, pp. 
800-801; ‘‘Die Basler Mathematiker Daniel Bernoulli und Leonhard Euler. Hundert Jahre 
nach ihrem Tode gefeiert von der Naturforschenden Gesellschaft,” Basle, 1884 (Annex to 
part VII of the proceedings of this Society); and Correspondance mathématique . . ., edited 
by Paul Heinrich Fuss, 1843 containing letters written by Daniel Bernoulli to Leonhard 
Euler, Nicolaus Fuss, and C. Goldbach.) 

? i.e., risky propositions (gambles). [Translator] 
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Should. expect to have his desires more closely fulfilled, the risks anticipated by each 
must be deemed equal in value. No characteristic of the persons themselves ought 
to be taken into consideration; only those matters should be weighed carefully 
that pertain to the terms of the risk. The relevant finding might then be made 
by the highest judges established by public authority. But really there is here 
no need for judgment but of deliberation, i.e., rules would be set up whereby 
anyone could estimate his prospects from any risky undertaking in light of one’s 
specific financial circumstances. 

§3. To make this clear it is perhaps advisable to consider the following exam- 
ple: Somehow a very poor fellow obtains a lottery ticket that will yield with 
equal probability either nothing or twenty thousand ducats. Will this man 
evaluate his chance of winning at ten thousand ducats? Would he not be ill- 
advised to sell this lottery ticket for nine thousand ducats? To me it seems that 
the answer is in the negative. On the other hand I am inclined to believe that a 
rich man would be ill-advised to refuse to buy the lottery ticket for nine thou- 
sand ducats. If I am not wrong then it seems clear that all men cannot use the 
same rule to evaluate the gamble. The rule established in §1 must, therefore, 
be discarded. But anyone who considers the problem with perspicacity and in- 
terest will ascertain that the concept of value which we have used in this rule 
may be defined in a way which renders the entire procedure universally accept- 
able without reservation. To do this the determination of the value of an item 
must not be based on its price, but rather on the utility it yields. The price of 
the item is dependent only on the thing itself and is equal for everyone; the 
utility, however, is dependent on the particular circumstances of the person 
making the estimate. Thus there is no doubt that a gain of one thousand ducats 
is more significant to a pauper than to a rich man though both gain the same 
amount. 

$4. The discussion has now been developed to a point where anyone may 
proceed with the investigation by the mere paraphrasing of one and the same 
principle. However, since the hypothesis is entirely new, it may nevertheless 
require some elucidation. I have, therefore, decided to explain by example what 
I have explored. Meanwhile, let us use this as a fundamental rule: Zf the utility 
of each possible profit expectation is multiplied by the number of ways in which it 
can occur, and we then divide the sum of these products by the total number of possible 
cases, a mean utility? [moral expectation] will be obtained, and the profit which 
corresponds to this utility will equal the value of the risk in question. 

§5. Thus it becomes evident that no valid measurement of the value of a risk 
can be obtained without consideration being given to its utility, that is to say, 
the utility of whatever gain accrues to the individual or, conversely, how much 
profit is required to yield a given utility. However it hardly seems plausible to 
make any precise generalizations since the utility of an item may change with 
circumstances. Thus, though a poor man generally obtains more utility than 
does a rich man from an equal gain, it is nevertheless conceivable, for example, 


? Free translation of Bernoulli’s *emolumentum medium," literally: “mean utility." 
[Translator] 
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that a rich prisoner who possesses two thousand ducats but needs two thousand 
ducats more to repurchase his freedom, will place a higher value on a gain of 
two thousand ducats than does another man who has less money than he. 
Though innumerable examples of this kind may be constructed, they repre- 
sent exceedingly rare exceptions. We shall, therefore, do better to consider 
what usually happens, and in order to perceive the problem more correctly 
we shall assume that there is an imperceptibly small growth in the individ- 
ual’s wealth which proceeds continuously by infinitesimal increments. Now 
it is highly probable that any increase in wealth, no matter how insignificant, 
will always result in an increase in utility which ts inversely proportionate to the 
quantity of goods already possessed. To explain this hypothesis it is necessary to 
define what is meant by the quantity of goods. By this expression I mean to con- 
note food, clothing, all things which add to the conveniences of life, and even 
to luxury—anything that can contribute to the adequate satisfaction of any 
sort of want. There is then nobody who can be said to possess nothing at all in 
this sense unless he starves to death. For the great majority the most valuable 
portion of their possessions so defined will consist in their productive capacity, 
this term being taken to include even the beggar’s talent: a man who is able to 
acquire ten ducats yearly by begging will scarcely be willing to accept a sum of 
fifty ducats on condition that he henceforth refrain from begging or otherwise 
trying to earn money. For he would have to live on this amount, and after he 
had spent it his existence must also come to an end. I doubt whether even those 
who do not possess a farthing and are burdened with financial obligations would 
be willing to free themselves of their debts or even to accept a still greater gift 
on such a condition. But if the beggar were to refuse such a contract unless 
immediately paid no less than one hundred ducats and the man pressed by credi- 
tors similarly demanded one thousand ducats, we might say that the former is 
possessed of wealth worth one hundred, and the latter of one thousand ducats, 
though in common parlance the former owns nothing and the latter less than 
nothing. 

§6. Having stated this definition, I return to the statement made in the pre- 
vious paragraph which maintained that, in the absence of the unusual, the utility 
resulting from any small increase in wealth will be inversely proportionate to the 
quantity of goods previously possessed. Considering the nature of man, it seems to 
me that the foregoing hypothesis is apt to be valid for many people to whom this 
sort of comparison can be applied. Only a few do not spend their entire yearly 
incomes. But, if among these, one has a fortune worth a hundred thousand ducats 
and another a fortune worth the same number of semi-ducats and if the former 
receives from it a yearly income of five thousand ducats while the latter obtains 
the same number of semi-ducats it is quite clear that to the former a ducat has 
exactly the same significance as a semi-ducat to the latter, and that, therefore, 
the gain of one ducat will have to the former no higher value than the gain of a 
semi-ducat to the latter. Accordingly, if each makes a gain of one ducat the 
latter receives twice as much utility from it, having been enriched by two semi- 
ducats. This argument applies to many other cases which, therefore, need not 
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be discussed separately. The proposition is all the more valid for the majority 
of men who possess no fortune apart from their working capacity which is their 
only source of livelihood. True, there are men to whom one ducat means more 
than many ducats do to others who are less rich but more generous than they. 
But since we shall now concern ourselves only with one individual (in different 
States of affluence) distinctions of this sort do not concern us. The man who is 
emotionally less affected by a gain will support & loss with greater patience. 
Since, however, in special cases things can conceivably occur otherwise, I shall 
first deal with the most general case and then develop our special hypothesis in 
order thereby to satisfy everyone. 


Q 


87. Therefore, le& AB represent the quantity of goods initially possessed. 
Then after extending AB, a curve BGLS must be constructed, whose ordinates 
CG, DH, EL, FM, etc., designate utilities corresponding to the abscissas BC, 
BD, BE, BF, ete., designating gains in wealth. Further, let m, n, p, q, ete., be 
the numbers which indicate the number of ways in which gains in wealth BC, 
BD, BE, BF [misprinted in the original as CF], ete., can occur. Then (in accord 
with $4) the moral expectation of the risky proposition referred to is given by: 


Qm. CG + n.DH + p.EL + g.FM +: 


Po m-Hnc4pctgctoe 


Now, if we erect AQ perpendicular to AE, and on it measure off AN = PO, the 
straight line NO — AB represents the gain which may properly be expected, or 
the value of the risky proposition in question. If we wish, further, to know how 
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large a stake the individual should be willing to venture on this risky proposi- 
tion, our curve must be extended in the opposite direction in such a way that 
the abscissa Bp now represents a loss and the ordinate po represents the cor- 
responding decline in utility. Since in a fair game the disutility to be suffered by 
losing must be equal to the utility to be derived by winning, we must assume 
that An = AN, or po = PO. Thus Bp will indicate the stake more than which 
persons who consider their own pecuniary status should not venture. 


COROLLARY I 


§8. Until now scientists have usually rested their hypothesis on the assump- 
tion that all gains must be evaluated exclusively in terms of themselves, i.e., 
on the basis of their intrinsic qualities, and that these gains will always produce 
a utility directly proportionate to the gain. On this hypothesis the curve BS 
becomes a straight line. Now if we again have: 


_ m.CG + n.DH + p.EL + q.FM +- 


PO mntptqtoe , 


and if, on both sides, the respective factors are introduced it follows that: 


BP = m.BC + n.BD + p.BE + g.BF + -+ 
mtenbtptat: , 


which is in conformity with the usually accepted rule. 


COROLLARY II 


§9. If AB were infinitely great, even in proportion to BF, the greatest possible 
gain, the arc BM may be considered very like an infinitesimally small straight 
line. Again in this case the usual rule [for the evaluation of risky propositions] 
is applicable, and may continue to be considered approximately valid in games 
of insignificant moment. 

§10. Having dealt with the problem in the most general way we turn now to 
the aforementioned particular hypothesis, which, indeed, deserves prior atten- 
tion to all others. First of all the nature of curve sBS must be investigated under 
the conditions postulated in §7. Since on our hypothesis we must consider in- 
finitesimally small gains, we shall take gains BC and BD to be nearly equal, so 
that their difference CD becomes infinitesimally small. If we draw Gr parallel 
to BR, then rH will represent the infinitesimally small gain in utility to a man 
whose fortune is AC and who obtains the small gain, CD. This utility, however, 
should be related not only to the tiny gain CD, to which it is, other things being 
equal, proportionate, but also to AC, the fortune previously owned to which it 

‘is inversely proportionate. We therefore set: AC = x, CD = dz, CG = y, rH = 


dy and AB = a; and if b designates some constant we obtain dy = nae ory = 
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x 
b log a The curve sBS is therefore a logarithmic curve, the subtangent* of which 


is everywhere b and whose asymptote is Qq. 

$11. If we now compare this result with what has been said in paragraph 7, 
it will appear that: PO = blog AP/AB, CG = blog AC/AB, DH = blog AD/AB 
and so on; but since we have 


PO = m.CG + n.DH + p. EL + q.FM +- 
mM+ntprat:: 


it follows that 


AP AC AD AE AF 
blog — = — T nad £5 d.e): 
log B (mb tog $6 + nb log 42 + pb log 4E + qb tog 4 + ) 


(m+n+p+aqt-:) 
and therefore 
AP = (AC". AD" AE’. AF". e) tatott 


and if we subtract AB from this, the remaining magnitude, BP, will represent 
the value of the risky proposition in question. 

$12. Thus the preceding paragraph suggests the following rule: Any gain must 
be added to the fortune previously possessed, then this sum must be raised to the power 
given by the number of possible ways in which the gain may be obtained; these terms 
should then be multiplied together. Then of this product a root must be extracted the 
degree of which is given by the number of all possible cases, and finally the value 
of the initial possessions must be subtracted therefrom; what then remains indicates 
the value of the risky proposition in question. This principle is essential for the 
measurement of the value of risky propositions in various cases. I would elab- 
orate it into a complete theory as has been done with the traditional analysis, 
were it not that, despite its usefulness and originality, previous obligations do 
not permit me to undertake this task. I shall therefore, at this time, mention only 
the more significant points among those which have at first glance occurred to me. 


4 The tangent to the curve y = blog z at the point (x , log =) is the line y — b log 7 
Q « a 
M — To). This tangent intersects the Y-axis (x = 0) at the point with the ordinate 


. E^ 
b log 2. b. The point of contact of the tangent with the curve hag the ordinate b log = . 
Q 


So also does the projection of this point on the Y-axis. The segment between the two points 
on the Y-axis that have been mentioned has the length b. That segment is the projection 
of the segment on the tangent between its intersection with the Y-axis and the point of 
contact. The length of this projection (which is b) is what Bernoulli here calls the ''sub- 
tangent.” Today, by the subtangent of the curve y = f(z) at the point (zo , f(zo)) is meant 
the length of the segment on the X-axis (and not the Y-axis) between its intersection with 
the tangent and the projection of the point of contact. This length is f(zo)/f'(zo). In the 


case of the logarithmic curve it equals zo log 2» _ Karl Menger. 
Q 
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§13. First, it appears that in many games, even those that are absolutely fair, 
both of the players may expect to suffer a loss; indeed this is Nature’s admoni- 
tion to avoid the dice altogether. ... This follows from the concavity of curve 
sBS to BR. For in making the stake, Bp, equal to the expected gain, BP, it is 
clear that the disutility po which results from a loss will always exceed the ex- 
pected gain in utility, PO. Although this result will be quite clear to the mathe- 
matician, I shall nevertheless explain it by example, so that it will be clear to 
everyone. Let us assume that of two players, both possessing one hundred ducats, 
each puts up half this sum as a stake in a game that offers the same probabilities 
to both players. Under this assumption each will then have fifty ducats plus 
the expectation of winning yet one hundred ducats more. However, the sum 
of the values of these two items amounts, by the rule of §12, to only 
(50°.150')! or 4/50.150, i.e., less than eighty-seven ducats, so that, though 
the game be played under perfectly equal conditions for both, either will suffer 
an expected loss of more than thirteen ducats. We must strongly emphasize 
this truth, although it be self evident: the imprudence of a gambler will be the 
greater the larger the part of his fortune which he exposes to a game of chance. 
For this purpose we shall modify the previous example by assuming that one of 
the gamblers, before putting up his fifty ducat stake possessed two hundred 
ducats. This gambler suffers an expected loss of 200 — +/150.250, which is 
not much greater than six ducats. 

$14. Since, therefore, everyone who bets any part of his fortune, however 
small, on a mathematically fair game of chance acts irrationally, it may be of 
interest to inquire how great an advantage the gambler must enjoy over his 
opponent in order to avoid any expected loss. Let us again consider a game which 
is as simple as possible, defined by two equiprobable outcomes one of which is 
favorable and the other unfavorable. Let us take a to be the gain to be won in 
case of a favorable outcome, and z to be the stake which is lost in the unfavorable 
case. If the initial quantity of goods possessed is a we have AB = a; BP = a; 
PO = blog ta (see §10), and since (by §7) po = PO it follows by the nature 


Q 


of a logarithmic curve that Bp = T 
Q 


a Since however Bp represents the stake 
aa 

a 
gain. It also follows from this that a man who risks his entire fortune acts like 
a simpleton, however great may be the possible gain. No one will have difficulty 
in being persuaded of this if he has carefully examined our definitions given 
above. Moreover, this result sheds light on a statement which is universally 
accepted in practice: it may be reasonable for some individuals to invest in a 
doubtful enterprise and yet be unreasonable for others to do so. 

§15. The procedure customarily employed by merchants in the insurance of 
commodities transported by sea seems to merit special attention. This may again 
be explained by an example. Suppose Caius,’ a Petersburg merchant, has pur- 


x, we havez = a magnitude which is always smaller than a, the expected 


5 Caius is a Roman name, used here in the sense of our ‘Mr. Jones." Caius is the older 
form; in the later Roman period it was spelled ‘“‘Gaius.’’ [Translator] 
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chased commodities in Amsterdam which he could sell for ten thousand rubles 
if he had them in Petersburg. He therefore orders them to be shipped there by 
sea, but is in doubt whether or not to insure them. He is well aware of the fact 
that at this time of year of one hundred ships which sail from Amsterdam to 
Petersburg, five are usually lost. However, there is no insurance available below 
the price of eight hundred rubles a cargo, an amount which he considers out- 
rageously high. The question is, therefore, how much wealth must Caius possess 
apart from the goods under consideration in order that it be sensible for him to 
abstain from insuring them? If z represents his fortune, then this together with 
the value of the expectation of the safe arrival of his goods is given by 
We F 10000) ®t = Wie + 10000)"z in case he abstains. With insurance 
he will have a certain fortune of z + 9200. Equating these two magnitudes we 
get: (x + 10000)" = (x + 9200)" or, approximately, z = 5043. If, therefore, 
Caius, apart from the expectation of receiving his commodities, possesses an 
amount greater than 5043 rubles he will be right in not buying insurance. If, 
on the contrary, his wealth is less than this amount he should insure his cargo. 
And if the question be asked “What minimum fortune should be possessed by 
the man who offers to provide this insurance in order for him to be rational in 
doing so?" We must answer thus: let y be his fortune, then 


V (y + 800)". (y — 9200) =y 


or approximately, y = 14243, a figure which is obtained from the foregoing 
without additional ealeulation. À man less wealthy than this would be foolish 
to provide the surety, but it makes sense for a wealthier man to do so. From 
this it is clear that the introduction of this sort of insurance has been so useful 
since it offers advantages to all persons concerned. Similarly, had Caius been 
able to obtain the insurance for six hundred rubles he would have been unwise 
to refuse 1t if he possessed less than 20478 rubles, but he would have acted much 
too eautiously had he insured his commodities at this rate when his fortune was 
greater than this amount. On the other hand a man would act unadvisedly if 
he were to offer to sponsor this insurance for six hundred rubles when he himself 
possesses less than 29878 rubles. However, he would be well advised to do so if 
he possessed more than that amount. But no one, however rich, would be manag- 
ing his affairs properly if he individually undertook the insurance for less than 
five hundred rubles. 

$16. Another rule which may prove useful can be derived from our theory. 
This is the rule that it is advisable to divide goods which are exposed to some 
danger into several portions rather than to risk them all together. Again I shall 
explain this more precisely by an example. Sempronius owns goods at home 
worth a total of 4000 ducats and in addition possesses 8000 ducats worth of 
commodities in foreign countries from where they can only be transported by sea. 
However, our daily experience teaches us that of ten ships one perishes. Under 
these conditions I maintain that if Sempronius trusted all his 8000 ducats of 
goods to one ship his expectation of the commodities is worth 6751 ducats. That 
is 


4/120009.4000! — 4000. 
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If, however, he were to trust equal portions of these commodities to two ships 
the value of his expectation would be 


1/120008! .800018.4000 — 4000, i.e., 7033 ducats. 


In this way the value of Sempronius' prospects of success will grow more favor- 
able the smaller the proportion committed to each ship. However, his expectation 
will never rise in value above 7200 ducats. This counsel will be equally service- 
able for those who invest their fortunes in foreign bills of exchange and other 
hazardous enterprises. 

$17. I am forced to omit many novel remarks though these would clearly not 
be unserviceable. And, though a person who is fairly judicious by natural instinct 
might have realized and spontaneously applied much of what I have here ex- 
plained, hardly anyone believed it possible to define these problems with the 
precision we have employed in our examples. Since all our propositions harmonize 
perfectly with experience it would be wrong to neglect them as abstractions rest- 
ing upon precarious hypotheses. This is further confirmed by the following ex- 
ample which inspired these thoughts, and whose history is as follows: My most 
honorable cousin the celebrated Nicolas Bernoulli, Professor utriusque iuris? at 
the University of Basle, once submitted five problems to the highly distinguished’ 
mathematician Montmort.’ These problems are reproduced in the work L'analyse 
sur les jeux de hazard de M. de Montmort, p. 402. The last of these problems runs 
as follows: Peter tosses a coin and continues to do so until it should land “heads” 
when it comes to the ground. He agrees to give Paul one ducat if he gets “heads” on 
the very first throw, two ducats if he gets 1t on the second, four if on the third, eight 
af on the fourth, and so on, so that with each additional throw the number of ducats 
he must pay is doubled. Suppose we seek to determine the value of Paul’s expectation. 
My aforementioned cousin discussed this problem in a letter to me asking for 
my opinion. Although the standard calculation shows’ that the value of Paul’s 
expectation is infinitely great, it has, he said, to be admitted that any fairly 
reasonable man would sell his chance, with great pleasure, for twenty ducats. 
The accepted method of calculation does, indeed, value Paul’s prospects at 
infinity though no one would be willing to purchase it at a moderately high price. 


6 Faculties of law of continental European universities bestow up to the present time the 
title of a Doctor utriusque juris, which means Doctor of both systems of laws, the Roman 
and the canon law. [Translator] 

7 Cl., i.e., Vir Clarissimus, a title of respect. [Translator] 

§ Montmort, Pierre Rémond, de (1678-1719). The work referred to here is the then famous 
“Essai d’analyse sur les jeux de hazard,” Paris, 1708. Appended to the second edition, 
published in 1713, is Montmort’s correspondence with Jean and Nicolas Bernoulli referring 
to the problems of chance and probabilities. [Translator]. 

* The probability of heads turning up on the 1st throw is 1/2. Since in this case Paul 
receives one ducat, this probability contributes 1/2-1 = 1/2 ducats to his expectation. The 
probability of heads turning up on the 2nd throw is 1/4. Since in this case Paul receives 2 
ducats, this possibility contributes 1/4-2 = 1/2 to his expectation. Similarly, for every 
integer n, the possibility of heads turning up on the n-th throw contributes 1/2*-2°-! = 1/2 
ducats to his expectation. Paul's total expectation is therefore 1/2 + 1/2 +++ +1/2 +, 
and that is infinite —Karl Menger. 


20 D. Bernoulli 


32 DANIEL BERNOULLI 


If, however, we apply our new rule to this problem we may see the solution and 
thus unravel the knot. The solution of the problem by our principles is as follows. 

$18. The number of cases to be considered here is infinite: in one half of the 
cases the game will end at the first throw, in one quarter of the cases it will 
conclude at the second, in an eighth part of the cases with the third, in & six- 
teenth part with the fourth, and so on.” If we designate the number of cases 
through infinity by N it is clear that there are 14N cases in which Paul gains 
one ducat, 14N cases in which he gains two ducats, 1¢N in which he gains four, 
MN in which he gains eight, and so on, ad infinitum. Let us represent Paul's 
fortune by o; the proposition in question will then be worth 


Vla + D72. la + 2). (a F 4)NIS (a + S)NIS --. — a 
= V(a FI). N/(a 2). (a + 4). V (a 8) — a 


$19. From this formula which evaluates Paul's prospective gain it follows 
that this value will increase with the size of Paul's fortune and will never attain 
&n infinite value unless Paul's wealth simultaneously becomes infinite. In addi- 
tion we obtain the following corollaries. If Paul owned nothing at all the value 
of his expectation would be 


V1. A2. N/A. NB s 


which amounts to two ducats, precisely. If he owned ten ducats his opportunity 
would be worth approximately three ducats; it would be worth approximately 
four if his wealth were one hundred, and six if he possessed one thousand. From 
this we can easily see what a tremendous fortune a man must own for it to 
make sense for him to purchase Paul's opportunity for twenty ducats. The 
amount which the buyer ought to pay for this proposition differs somewhat 
from the amount it would be worth to him were it already in his possession. 
Since, however, this difference is exceedingly small if a (Paul's fortune) is great, 


10 Since the number of cases is infinite, it is impossible to speak about one half of the 
cases, one quarter of the cases, etc., and the letter N in Bernoulli's argument is meaning- 
less. However, Paul's expectation on the basis of Bernoulli’s hypothesis concerning evalua- 
tion can be found by the same method by which, in footnote 9, Paul's classical expectation 
was determined. If Paul's fortune is « ducats, then, according to Bernoulli, he attributes 


n=l 
a+? . If the probability of this gain is 1/2", his 
a 


to a gain of 277! ducats the value b log 


n-i 
expectation is b/2" log eror . Paul's expectation resulting from the game is therefore 
[^4 


1 2 b 2771 
b log Tt a+ pos peg tt y... 
a a 2^ 


2 a 
= b log [(a + 1)!2(a + 2)!^-...- (a + 2»71)!2"....] — b log a. 


a + D 


+ Flog 


What addition D to Paul's fortune has the same value for him? Clearly, b log must 


a 


equal the above sum. Therefore 
D = (a $1) (a + 2)14....- (a + QI) U2". — o. 
—Karl Menger. 
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we can take them to be equal. If we designate the purchase price by z its value 
can be determined by means of the equation 


Wati-s).Wat2—-2).Wat4—2).Vat8—2- =a 
and if a is a large number this equation will be approximately satisfied by 


a= VatlWat2.VWat4.Vat8- — a. 


After having read this paper to the Society” I sent a copy to the aforementioned 
Mr. Nicolas Bernoulli, to obtain his opinion of my proposed solution to the difficulty 
he had indicated. In a letter to me written in 1732 he declared that he was in no way 
dissatisfied with my proposition on the evaluation of risky propositions when applied 
to the case of a man who is to evaluate his own prospects. However, he thinks that 
the case is different if a third person, somewhat in the position of a judge, is to 
evaluate the prospects of any participant in a game in accord with equity and jus- 
tice. I myself have discussed this problem in §2. Then this distinguished scholar 
informed me that the celebrated mathematician, Cramer,” had developed a theory 
on the same subject several years before I produced my paper. Indeed I have found 
his theory so similar to mine that it seems miraculous that we independently reached 
such close agreement on this sort of subject. Therefore it seems worth quoting the 
words with which the celebrated Cramer himself first described his theory in his 
letter of 1728 to my cousin. His words are as follows:* 

“Perhaps I am mistaken, but I believe that I have solved the extraordinary” 
problem which you submitted to M. de Montmort, in your letter of September 9," 
1718, (problem 5, page 402). For the sake of simplicity I shall assume that A” 
tosses a coin into the air and B commits himself to give A 1 ducat if, at the” 
first throw, the coin falls with its cross upward; 2 if it falls thus only at the” 
second throw, 4 if at the third throw, 8 if at the fourth throw, etc. The paradox" 
consists in the infinite sum which calculation yields as the equivalent which” 
A must pay to B. This seems absurd since no reasonable man would be willing" 
to pay 20 ducats as equivalent. You ask for an explanation of the discrepancy” 
between the mathematical calculation and the vulgar evaluation. I believe" 
that it results from the fact that, in their theory, mathematicians evaluate” 
money in proportion to its quantity while, in practice, people with common" 
sense evaluate money in proportion to the utility they can obtain from it. The" 
mathematical expectation is rendered infinite by the enormous amount which" 
I ean win if the coin does not fall with its cross upward until rather late, perhaps" 
at the hundredth or thousandth throw. Now, as a matter of fact, if I reason" 
as a sensible man, this sum is worth no more to me, causes me no more pleasure" 


1! Bernoulli's paper had been submitted to the Imperial Academy of Sciences in Peters- 
burg. [Translator] 

12 Cramer, Gabriel, famous mathematician, born in Geneva, Switzerland (1704-1752). 
[Translator] 

13 The following passage of the original text is in French. [Translator] 
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“and influences me no more to accept the game than does a sum amounting 
"only to ten or twenty million ducats. Let us suppose, therefore, that any 
“amount above 10 millions, or (for the sake of simplicity) above 2% = 166777216 
“ducats be deemed by him equal in value to 2“ ducats or, better yet, that I 
"can never win more than that amount, no matter how long it takes before the 
“coin falls with its cross upward. In this case, my expectation is 14.1 + 14.2 + 
Ug A eee -- 42.2" + 4.2" + is.27 D... 2 K+ l4 4 ol& Lo. 
“(24 times) :-- + 14 + 14 + 3% + +- = 12 + 1 = 18. Thus, my moral ex- 
“pectation is reduced in value to 13 ducats and the equivalent to be paid for 
“it is similarly reduced—a result which seems much more reasonable than does 
“rendering it infinite." 

Thus far!* the exposition is somewhat vague and subject to counter argument. 
If it, indeed, be true that the amount 2” appears to us to be no greater than P 
no attention whatsoever should be paid to the amount that may be won after the 
twenty-fourth throw, since just before making the twenty-fifth throw I am certain to 
end up with no less than 2 — 1, an amount that, according to this theory, may be 
considered equivalent to 2". Therefore it may be said correctly that my expectation 
is only worth twelve ducats, not thirteen. However, in view of the coincidence between 
the basic principle developed by the aforementioned author and my own, the fore- 
going is clearly not intended to be taken to invalidate that principle. I refer to the 
ptoposition that reasonable men should evaluate money in accord with the utility 
they derive therefrom. I state this to avoid leading anyone to judge that entire theory 
adversely. And this is exactly what Cl. C. Cramer states, expressing in the following 
manner precisely what we would ourselves conclude. He continues thus: 

“The equivalent can turn out to be smaller yet if we adopt some alternative 
“hypothesis on the moral value of wealth. For that which I have just assumed 
‘dg not entirely valid since, while it is true that 100 millions yield more satis- 
“faction than do 10 millions, they do not give ten times as much. If, for example, 
“we suppose the moral value of goods to be directly proportionate to the square 
“root of their mathematical quantities, e.g., that the satisfaction provided by 
“40000000 is double that provided by 10000000, my psychic expectation 
“becomes 


"VI + MVE + MVE MoVE + mRL UE 


“However this magnitude is not the equivalent we seek, for this equivalent 
“need not be equal to my moral expectation but should rather be of such a 
“magnitude that the pain caused by its loss is equal to the moral expectation 
“of the pleasure I hope to derive from my gain. Therefore, the equivalent must, 


14 Fyom here on the text is again translated from Latin. [Translator] 

16 This remark of Bernoulli’s is obscure. Under the conditions of the game a gain of 
234 — 1 ducats is impossible. —Karl Menger. 

18 To be translated as ''the distinguished Gabriel.” [Translator] 

Y Text continues in French. [Translator] 
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. 1 2 1 si» 
on our hypothesis, amount to (; m vs) - (s —4 vs) = 29---, which 
is consequently less than 3, truly a trifling amount, but nevertheless, I believe,” 
closer than is 18 to the vulgar evaluation." 
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A New Interpretation of Information Rate 
reproduced with permission of AT&T 


By J. L. KELLY, JR. 


If the input symbols to a communication channel represent the outcomes of a 
chance event on which bets are available at odds consistent with their probabilities 
(i.e., "fair" odds), a gambler can use the knowledge given him by the received 
symbols to cause his money to grow exponentially. The maximum exponential 
rate of growth of the gambler’s capital is equal to the rate of transmission of 
information over the channel. This result is generalized to include the case of 
arbitrary odds. 

Thus we find a situation in which the transmission rate is significant even 
though no coding is contemplated. Previously this quantity was given significance 
only by a theorem of Shannon’s which asserted that, with suitable encoding, 
binary digits could be transmitted over the channel at this rate with an arbitrarily 
small probability of error. 


INTRODUCTION 


Shannon defines the rate of transmission over a noisy communication channel 
in terms of various probabilities.! This definition is given significance by a 
theorem which asserts that binary digits may be encoded and transmitted over 
the channel at this rate with arbitrarily small probability of error. Many workers 
in the field of communication theory have felt a desire to attach significance to 
the rate of transmission in cases where no coding was contemplated. Some 
have even proceeded on the assumption that such a significance did, in fact, 
exist. For example, in systems where no coding was desirable or even possible 
(such as radar), detectors have been designed by the criterion of maximum 
transmission rate or, what is the same thing, minimum equivocation. Without 
further analysis such a procedure is unjustified. 

The problem then remains of attaching a value measure to a communication 


1C.E. Shannon, A Mathematical Theory of Communication, B.S.T.J., 27, pp. 379-423, 
623-656, Oct., 1948. 
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system in which errors are being made at a non-negligible rate, i.e., where 
optimum coding is not being used. In its most general formulation this problem 
seems to have but one solution. A cost function must be defined on pairs of 
symbols which tells how bad it is to receive a certain symbol when a specified 
signal is transmitted. Furthermore, this cost function must be such that its 
expected value has significance, i.e., à system must be preferable to another if 
its average cost is less. The utility theory of Von Neumann? shows us one way 
to obtain such a cost function. Generally this cost function would depend on 
things external to the system and not on the probabilities which describe the 
system, so that its average value could not be identified with the rate as defined 
by Shannon. 

The cost function approach is, of course, not limited to studies of commu- 
nication systems, but can actually be used to analyze nearly any branch of 
human endeavor. The author believes that it is too general to shed any light 
on the specific problems of communication theory. The distinguishing feature 
of a communication system is that the ultimate receiver (thought of here as a 
person) is in a position to profit from any knowledge of the input symbols or 
even from a better estimate of their probabilities. A cost function, if it is sup- 
posed to apply to a communication system, must somehow reflect this feature. 
The point here is that an arbitrary combination of a statistical transducer (i.e., 
a channel) and a cost function does not necessarily constitute a communication 
system. In fact (not knowing the exact definition of a communication system on 
which the above statements are tacitly based) the author would not know how 
to test such an arbitrary combination to see if it were a communication system. 

What can be done, however, is to take some real-life situation which seems 
to possess the essential features of a communication problem, and to analyze 
it without the introduction of an arbitrary cost function. The situation which 
will be chosen here is one in which a gambler uses knowledge of the received 
symbols of à communication channel in order to make profitable bets on the 
transmitted symbols. 


THE GAMBLER WITH A PRIVATE WIRE 


Let us consider a communication channel which is used to transmit the 
results of a chance situation before those results become common knowledge, so 
that a gambler may still place bets at the original odds. Consider first the case 
of a noiseless binary channel, which might be 


2Von Neumann and Morgenstein, Theory of Games and Economic Behavior, Princeton 
Univ. Press, 2nd Edition, 1947. 
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used, for example, to transmit the results of a series of baseball games between 
two equally matched teams. The gambler could obtain even money bets even 
though he already knew the result of each game. The amount of money he 
could make would depend only on how much he chose to bet. How much would 
he bet? Probably all he had since he would win with certainty. In this case 
his capital would grow exponentially and after N bets he would have 2” times 
his original bankroll. This exponential growth of capital is not uncommon in 
economics. In fact, if the binary digits in the above channel were arriving at the 
rate of one per week, the sequence of bets would have the value of an investment 
paying 100 per cent interest per week compounded weekly. We will make use 
of a quantity G called the exponential rate of growth of the gambler's capital, 
where 


. 1 Vn 
C = SN 08 ys 
where Vy is the gambler’s capital after N bets, Vo is his starting capital, and 
the logarithm is to the base two. In the above example G = 1. 

Consider the case now of a noisy binary channel, where each transmitted 
symbol has probability, p, of error and q of correct transmission. Now the 
gambler could still bet his entire capital each time, and, in fact, this would 
maximize the expected value of his capital, (Vy), which in this case would be 
given by 


(Vn) = (23) Vo 


This would be little comfort, however, since when N was large he would probably 
be broke and, in fact, would be broke with probability one if he continued 
indefinitely. Let us, instead, assume that he bets a fraction, £, of his capital 
each time. Then 


Vy = (14+ QV (1— 07 Vs 


where W and L are the number of wins and losses in the N bets. Then 


G 


W L 
i —- = 1-2 
Jim N log(1 + £) + N log( ) 


qlog(1 + £) + plog(1— £) with probability one 


It 


Let us maximize G with respect to 2. The maximum value with respect to 
the Y; of a quantity of the form Z = Y, Xj log Yi, subject to the constraint 
> Yi = Y, is obtained by putting 


28 J. L. Kelly, Jr. 


920 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1956 


where X = ».Xj;. This may be shown directly from the convexity of the 
logarithm. 
Thus we put 


(1+ @) = 2q 
(1 — £) = 2p 


and 


Gmaz =1+plogp +q logg 
=R 


which is the rate of transmission as defined by Shannon. 

One might still argue that the gambler should bet all his money (make £ = 1) 
in order to maximize his expected win after N times. It is surely true that if the 
game were to be stopped after N bets the answer to this question would depend 
on the relative values (to the gambler) of being broke or possessing a fortune. If 
we compare the fates of two gamblers, however, playing a nonterminating game, 
the one which uses the value £ found above will, with probability one, eventually 
get ahead and stay ahead of one using any other £. At any rate, we will assume 
that the gambler will always bet so as to maximize G. 


THE GENERAL CASE 


Let us now consider the case in which the channel has several input symbols, 
not necessarily equally likely, which represent the outcome of chance events. We 
wil] use the following notation: 


p(s) the probability that the transmitted symbol is the s'th one. 


p(r/s) the conditional probability that the received symbol is the r'th on the 
hypothesis that the transmitted symbol is the s'th one. 


p(s,r) the joint probability of the s'th transmitted and r'th received symbol. 
q(r) received symbol probability. 


q(s/r) conditional probability of transmitted symbol on hypothesis of received 
symbol. 


the odds paid on the occurrence of the s’th transmitted symbol, i.e., a, 
is the number of dollars returned for a one-dollar bet (including that one 
dollar). 


a(s/r) the fraction of the gambler's capital that he decides to bet on the oc- 
currence of the s'th transmitted symbol after observing the r'th received 
symbol. 


a 


E 
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Only the case of independent transmitted symbols and noise will be consid- 
ered. We will consider first the case of “fair” odds, i.e., 


Os = Fs) 


In any sort of parimutuel betting there is a tendency for the odds to be fair 
(ignoring the “track take”). To see this first note that if there is no “track take” 


1 
a} 


since all the money collected is paid out to the winner. Next note that if 


for some s a bettor could insure a profit by making repeated bets on the sí^ 
outcome. The extra betting which would result would lower a,. The same feed- 
back mechanism probably takes place in more complicated betting situations, 
such as stock market speculation. 

There is no loss in generality in assuming that 


> a(s/r) =1 


i.e., the gambler bets his total capital regardless of the received symbol. Since 
1 
Xie 
Os 
he can effectively hold back money by placing canceling bets. Now 


Vy = [[lots/2)os) 7 Vo 


T,8 


where W,, is the number of times that the transmitted symbol is s and the 
received symbol is r. 


Va 
log — = Werl 
8y 2 sr log asa(s/r) 
1. Vw (1) 
G= Jim N log Vo = 2 ps7) log a,a(s/r) 


with probability one. Since 
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here 


a(s/r) 
p(s) 


= > p(s,7) log a(s/r) + H(X) 


TS 


G > p(s,r) log 


where H(X ) is the source rate as defined by Shannon. The first term is maxi- 
mized by putting 


_ p(sr) _ p(s,r) _ 
a(s/r) = 3, plk,r) — q(r) -= q(s/r) 


Then Gmaz = H(X) — H(X/Y), which is the rate of transmission defined by 
Shannon. 


WHEN THE ODDS ARE NOT FAIR 


Consider the case where there is no track take, i.e., 
1 
Ein 
Os 
but where a, is not necessarily 
1 


p(s) 


It is still permissible to set >), a(s/r) = 1 since the gambler can effectively hold 
back any amount of money by betting it in proportion to the 1/o,. Equation 
(1) now can be written 


G = X p(s,r) loga(s/r) + > p(s) log as. 
T8 8 
G is still maximized by placing a(s/r) = g(s/r) and 
Gmoz -H(X/Y) + > p(s) log Os 


H(a) - H(X/Y) 


li 


where 
H(a) = J pls) logas 


Several interesting facts emerge here i 
(a) In this case G is maximized as before by putting a(s/r) = q(s/r). That 
is, the gambler ignores the posted odds in placing his bets! 
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(b) Since the minimum value of H (o) subject to 


XL 
8 Os 
obtains when 
_ 1 
p(s) 

and H(X) = H(o), any deviation from fair odds helps the gambler. 

(c) Since the gambler’s exponential gain would be H(a) — H(X) if he had 
no inside information, we can interpret R = H(X) — H(X/Y) as the increase 


of Gmaz due to the communication channel. When there is no channel, i.e., 
H(X/Y) = H(X), Gmaz is minimized (at zero) by setting 


Qs = — 
Ds 
This gives further meaning to the concept "fair odds." 


WHEN THERE IS A "TRACK TAKE" 


In the case there is a “track take” the situation is more complicated. It can 
no longer be assumed that 5, a(s/r) = 1. The gambler cannot make canceling 
bets since he loses a percentage to the track. Let b, = 1 — $^, a(s/r), i.e., the 
fraction not bet when the received symbol is the r*^ one. Then the quantity to 
be maximized is 

G = V p(s,r)loglb, + a,a(s/r)), (2) 


r8 


subject to the constraints 
br + » a(s/r) — 1. 
6 


In maximizing (2) it is sufficient to maximize the terms involving a particular 


value of r and to do this separately for each value of r since both in (2) and in 
the associated constraints, terms involving different r's are independent. That 
is, we must maximize terms of the type 


G, = q(r) Y ^ a(s/7)log[b- + asa(s/7)) 


subject to the constraint 


bp + Y a(s/r) =1 
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Actually, each of these terms is the same form as that of the gambler's 
exponential gain where there is no channel 


G= »»50 log[b + o/;a(s)]. (3) 


8 


We will maximize (3) and interpret the results either as a typical term in the 
general problem or as the total exponential gain in the case of no communication 
channel. Let us designate by A the set of indices, s, for which a(s) > 0, and by 
A' the set for which a(s) = 0. Now at the desired maximum 


0G p(s)o, 
—— = —————— = À 
8a(s) b+ a{s)a, loge =k for se 
0G _ p(s) _ 
Ob — 2 b+als)as loge = k 
8G D(s)a, < ! 
= Uc < À 
Ba(s) b loge S k for se 


where k is a constant. The equations yield 


1- 
k=loge, b= 7-2 


fom 
a(s) = p(s) — 2 for seX 


& 


where p = Y, p(s), o = >>) (1/o;), and the inequalities yield 


1- 
p(s)o, Sb = ice . for se 


We will see that the conditions 


o «l1 
1- 

p(so, > i- : for se 
l~ 

pso, < e for seA' 


completely determine A. - 
If we permute indices so that 


p(s)a, 2 p(s + 1)o541 
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then A must consist of all s S t where t is a positive integer or zero. Consider 
how the fraction 


varies with t£, where 


t t 
p = 9 pls), a= oo Fo =1 
1 1 08 


Now if p(1)o; < 1, F; increases with t until c; 2 1. In this case t = 0 satisfies 
the desired conditions and A is empty. If p(1)a, > 1 F; decreases with t until 
p(t + 1)o541 < Fi or o, 2 1. If the former occurs, i.e., p(t + lary; < Fi, then 
Fi41 > F; and the fraction increases until o; 2 1. In any case the desired value 
of t is the one which gives PF; its minimum positive value, or if there is more 
than one such value of t, the smallest. The maximizing process may be summed 


up as follows: 
(a) Permute indices so that p(s)o, 2 p(s + 1)og41 


(b) Set b equal to the minimum positive value of 


1-p : 2E 
LT—— where p, = s), 04 = — 
Io, Pt dvs) t ars 


(c) Set a(s) = p(s) — b/o, or zero, whichever is larger. (The a(s) will sum to 
1—b.) 


The desired maximum G will then be 


t 
1- 
Gmaz = Xs) log p(s)as + (1 — pz) log l-o 
where t is the smallest index which gives 


l-m 
l-e: 


its minimum positive value. 

It should be noted that if p(s)a, < 1 for all s no bets are placed, but if 
the largest p(s)a, > 1 some bets might be made for which p(s)a, < 1, i.e., the 
expected gain is negative. This violates the criterion of the classical gambler 
who never bets on such an event. | 


CONCLUSION 


The gambler introduced here follows an essentially different criterion from 
the classical gambler. At every bet he maximizes the expected value of the 
logarithm of his capital. The reason has nothing to do with 
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the value function which he attached to his money, but merely with the fact 
that it is the logarithm which is additive in repeated bets and to which the 
law of large numbers applies. Suppose the situation were different; for example, 
suppose the gambler's wife allowed him to bet one dollar each week but not to 
reinvest his winnings. He should then maximize his expectation (expected value 
of capital) on each bet. He would bet all his available capital (one dollar) on 
the event yielding the highest expectation. With probability one he would get 
ahead of anyone dividing his money differently. 

It should be noted that we have only shown that our gambler's capital will 
surpass, with probability one, that of any gambler apportioning his money dif- 
ferently from ours but still in a fixed way for each received symbol, independent 
of time or past events. Theorems remain to be proved showing in what sense, if 
any, our strategy is superior to others involving a(s/r) which are not constant. 

Although the model adopted here is drawn from the real-life situation of 
gambling it is possible that it could apply to certain other economic situations. 
The essential requirements for the validity of the theory are the possibility of 
reinvestment of profits and the ability to control or vary the amount of money 
invested or bet in different categories. The "channel" of the theory might cor- 
respond to a real communication channel or simply to the totality of inside 
information available to the investor. 

Let us summarize briefly the results of this paper. If a gambler places bets on 
the input symbol to a communication channel and bets his money in the same 
proportion each time a particular symbol is received, his capital will grow (or 
shrink) exponentially. If the odds are consistent with the probabilities of occur- 
rence of the transmitted symbols (i.e., equal to their reciprocals), the maximum 
value of this exponential rate of growth will be equal to the rate of transmission 
of information. If the odds are not fair, i.e., not consistent with the transmit- 
ted symbol probabilities but consistent with some other set of probabilities, the 
maximum exponential rate of growth will be larger than it would have been 
with no channel by an amount equal to the rate of transmission of information. 
In case there is a “track take” similar results are obtained, but the formulae 
involved are more complex and have less direct information theoretic interpre- 


tations. 
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CRITERIA FOR CHOICE AMONG RISKY VENTURES 


HENRY ALLEN LATANE 
Chapel Hill, North Carolina 


THE SUBGOAL 


IS paper is concerned with the 
problem of how to make rational 
choices among strategies in situa- 

tions involving uncertainty. Such choices 
can be expressed through payout matrices 
stated in terms of some measure of value 
to be maximized. These matrices show 
the probabilities of al! relevant future oc- 
currences and the payouts resulting from 
the combined effects of each possible 
strategy, on the one hand, and each 
relevant future occurrence, on the other.’ 
All this information is needed to choose 
the proper strategy rationally. It would 


! Payout matrices are shown in Tables 1, 2, and 
3. The payouts represent the possible final outcomes 
of choices among strategies. The matrices have 
single-valued payouts and probabilities. Many, if 
not all, decision problems can be reduced to such 
form. Consider first the probabilities. A subjective 
probability distribution of an imperfectly known 
underlying probability can be reduced to a subjec- 
tive probability of the event itself. For example, sup- 
pose a gambler believes that there is a 0.5 probabil- 
ity that a coin is biased so that it always comes up 
tails and a 0.5 probability that it is unbiased. He 
has a subjective probability of 0.25 for heads and 
0.75 for tails, and these probabilities would be used 
in his payout matrix as long as his probability be- 
liefs remain unchanged. Consider next the payouts. 
In much discussion of decision theory the payouts 
are taken as given, with only the probabilities sub- 
ject to uncertainty. However, in real life the sizes 
of the payouts often are as subject to uncertainty as 
are the probabilities. But, even when the payouts 
are uncertain a matrix filled in with single values 
can be constructed. If we have probability distribu- 
tions of payouts for all specified occurrences (such as 
heads and tails in the toss of a coin), a payout matrix 
can be constructed listing each possible payout and 
the subjective probability of its occurrence. These 
large matrices often can be reduced to simple two- 
valued distributions of payouts without much loss of 
information. 


be impossible for a decision-maker to 
choose rationally among strategies if he 
disregarded either the probability of the 
relevant future occurrences or any of the 
possible payouts. 

The problem of rational decision- 
making can be broken down into three 
steps: (1) deciding upon an objective and 
criteria for choosing among strategies; 
(2) filling out a payout matrix; and (3) 
choosing among available strategies on 
the basis of this matrix and the criteria. 
In real life the second step—deciding 
upon the size of the payout matrix, meas- 
ured by the number of columns represent- 
ing relevant future occurrences and rows 
representing available strategies, and 
filling in the matrix with reasonable esti- 
mates of payouts and probabilities—is 
by far the most difficult part of the 
decision-maker’s job. This paper has 
little to say about these problems. It 
deals largely with the first step: the prob- 
lem of setting up criteria for choosing 
among strategies on the basis of a filled- 
in payout matrix. 

A hierarchy of goals and guides for 
reaching these goals is involved in ra- 
tional choices among strategies. This 
hierarchy consists of (1) a goal; (2) a 
subgoal; and (3) a criterion for choosing 
among strategies to reach the subgoal, 
that is, a measure that must be maxi- 
mized to attain the subgoal. The goal in 
rational decision-making is the maxi- 
mization of some measure of value. Each 
decision is made for the sake of the 
difference it will make in terms of this 
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objective. The decision-maker is con- 
fronted with a payout matrix expressed 
in terms of either a subjective utility 
measure such as utiles or an objective 
measure such as money or bushels of 
wheat. He wishes to choose the strategy 
that will give him the maximum payout. 
This is his goal. When some one strategy 
gives a higher payout than any other 
strategy in all relevant future occur- 
rences, the goal itself enables the de- 
cision-maker to choose among strategies. 
He merely chooses the dominant strategy. 

When there is no strategy superior to 
all tbe rest in all possible future occur- 
rences, the decision-maker needs some 
other guide for making decisions, since 
the goal itself does not enable him to 
make his choice. This guide is here called 
the "subgoal'" The need for a subgoal 
exists because the outcome of specific 
strategies is subject to probabilistic un- 
certainty. In utility theory the payout 
matrix is expressed in terms of some 
measure of subjective utility, say, utiles. 
Choice of the strategy that will give the 
maximum payout in utiles is the goal, 
and choice of the strategy with the maxi- 
mum expected utility? is taken as the 
subgoal. Given a completely filled-in 
matrix, this subgoal can surely be 
reached. Whether or not the goal is 
reached depends on future occurrences, 
but, in any event, the subgoal of maxi- 
mization of the expected value of the pay- 
outs expressed in utiles is logically re- 
lated to the goal of maximization of the 
forthcoming payout also expressed in 
utiles. 

In this paper a second subgoal is pro- 
posed for use when the choice is repe- 
titious and has cumulative effects and 


? The expected utility of a strategy is computed 
by multiplying all possible payouts expressed in 
utiles by their respective probabilities and then 
summing the products. 
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when the goal is maximization of wealth 
at the end of a large number of choices. 
Under these conditions the choice of the 
strategy that has a greater probability 
(P') of leading to as much or more 
wealth than any other significantly dif- 
ferent strategy at the end of a large 
number of choices also is a logical sub- 
goal. The P' subgoal is not as general as 
the maximum expected utility subgoal. 
For example, it would not apply to 
unique choices. When a man is faced with 
a once-in-a-lifetime choice of risking his 
whole fortune and his life on a venture 
that will produce great rewards if suc- 
cessful, it does not help him to know that 
he is almost certain to be ruined if he 
takes such a risk often enough. The P' 
subgoal is not logically related to the 
goal in this case.? Such a man, however, 
could set up a payout matrix expressed 
in utiles and decide which course of 
action maximized his expected utility. 
Here the maximum expected utility sub- 
goal is logically related to the goal even 
though the P’ subgoal is not. The P’ sub- 
goal is less general but would seem to be 
more operational than the expected util- 
ity subgoal because of the difficulty of 
constructing a payout matrix expressed 
in terms of utiles, especially if the de- 
cision involves a firm or group of people. 

The P’ subgoal would seem to be par- 
ticularly applicable to many business 

3 For certain utility functions and for certain re- 
peated gambles, no amount of repetition justifies the 
rule that the gamble which is almost sure to bring 
the greatest wealth is the preferable one. For ex- 
ample, the P’ subgoal is not appropriate for a 
decision-maker for whom the possibility of great 
gain, however small and diminishing, is more im- 
portant than maximization of the probability of as 
much or more wealth than can be obtained by any 
other strategy in the long run. Such a decision- 
maker may adopt a course of action that is almost 
certain to result in less wealth in the long run. 
Whether or not his utility function is compatible 


with the specified goal of maximum long-run wealth 
is not at issue here. 


Criteria for Choice among Risky Ventures 
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decisions such as those involved in port- 
folio management. Wealth-holders have 
the option of holding their wealth in 
many different combinations of stocks, 
bonds, and cash. The allocation of 
wealth among these types of assets in- 
volves a series of choices extending over 
time. The fact that these choices are 
repetitive in nature with cumulative ef- 
fects may be used as the key factor in 
defining a goal, a subgoal, and a cri- 
terion for choosing among portfolios. 
The problem of choice among port- 
folios may be stated in terms of the pay- 
out matrix in Table 1. In this table p; 


TABLE 1 
PAYOUT MATRIX FOR VARYING PORTFOLIOS 


PogTFOLIO RELEVANT FUTURE OCCURRENCES 


1,...,9,...,28 
Loe eee ee Quy... 01 e Gk 
doses Gn, .... Vij. ey Qi 
[lll pn, PU o Gee 
Probability of occur- 
rence. 1... 2.0.04. Pr, oy Êj., De 


represents the probability of the jth oc- 
currence, with £p; = 1, and ai; repre- 
sents the return from the ith portfolio 
with z = 1,..., £ if the 7th occurrence 
takes place, withy = 1,..., & A return 
is the payout, including return of princi- 
pal, per dollar of portfolio value per in- 
vestment period (here called "'year"). 
Returns cannot be negative, so that 
a; Z 0. The portfolio manager is faced 
with such a payout matrix for n years 
and wants to choose in a rational manner 
one portfolio from all available port- 
folios in each of the # years.‘ 

The goal of portfolio management is 
taken to be to select a portfolio so as to 
niaximize wealth at the end of a period 
of years, assuming reinvestment of all re- 
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turns.? Let W? be the final value of $1.00 
placed in portfolio 7 if returns are rein- 
vested n times. Then the goal of port- 
folio management is taken to be to 
select the optimum portfolio so that 
Win = Wi, with i= 1,...,f. This 
goal cannot be used as a basis for choice 
among portfolios, since which portfolio 
will have the maximum W^ depends on 
future occurrences. 

The subgoal proposed here is the 
choice of the portfolio that has a greater 
probability (P’) of being as valuable or 
more valuable than any other signifi- 
cantly different portfolio at the end of n 
years, n being large. It is shown below 


4 The idea of maximizing wealth at the end of a 
large number of separate decisions based on the same 
payout matrix may appear unrealistic, but portfolio 
managers are continually being faced with choices 
having cumulative effects and involving approxi- 
mately the same payouts and probabilities time after 
time. For example, year after year a portfolio man- 
ager may have probability beliefs such as: “I look 
for conditions in the next ten years to be very similar 
to those prevailing in 1926 through 1935. Bonds will 
yield about 4 per cent per annum during the whole 
period. Some day we are going to have a boom and 
a bust in the stock market, but I do not know which 
is going to come first," Choosing one portfolio to 
hold in each of the » years is not the same as choos- 
ing one portfolio at the beginning of the period to 
hold throughout the years. For example, if the 
probability beliefs at the beginning of one year are 
such that the maximum P’ allocation of the port- 
folio is 40 per cent in bonds and 60 per cent in stock 
and if these beliefs remain the same at the beginning 
of the next year, then the maximum P’ allocation 
again will be 40 per cent in bonds and 60 per cent in 
stock at the beginning of the second year. If the rela- 
tive prices of stocks and bonds have changed be- 
tween the two dates, it will be necessary for thc 
portfolio manager to make some sales and purchases 
in his portfolio to bring it into line with the desired 
proportions even if these proportions themselves 
have not changed. 

5 Few wealth-holders reinvest all returns, so the 
problem of maximizing wealth assuming no with- 
drawals is somethat unrealistic. However, this re- 
striction can be modified, If withdrawals per unit of 
time are a fixed proportion of wealth (considered as 
interest, for example), they will not affect proper 
maximizing action. Whatever would maximize 
wealth, assuming no withdrawals, would maximize 
wealth, assuming proportionate withdrawals. 
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that the portfolio having a probability 
distribution of returns with the highest 
geometric mean, G, also has the greatest 
P'. 

The central fact of this paper is a 
simple one: If the value of an asset, say, 
portfolio 7, priced initially at $1.00 is be- 
lieved to change after a year to, alterna- 
tively, aa or Qiz ..., OY Gy, with re- 
spective probabilities pi, po,..., br 
and if the proceeds are reinvested n 
times, then the final value of the invest- 
ment, W7, ‘‘converges in probability" to 
G? = at * + af". The prob- 
ability that the absolute difference be- 
tween W7 and G? is smaller than any 
preassigned positive number will ap- 
proach 1 as # increases indefinitely. In 
other words, the final return from $1.00 
invested in portfolio 7, assuming rein- 
vestment of all annual returns for n years 
will converge in probability on G?, the 
nth power of the geometric mean of the 
probability distribution of annual re- 
turns from that portfolio. This relation- 
ship is intuitively obvious, since the aa 
return will “tend” to occur np times, the 
ai; return will tend to occur np; times, 
and so forth, if n is large. It can be 
proved rigorously by use of the law of 
large numbers applied to the logarithms 
of the annual returns. 

Let n; be the number of occurrences of 
the jth relevant future occurrence, with 
En; on, with j= 1,..., k, then n,/ 
n im P; and mj/mlog a; tim Ps log ai; as 
4 o. But log W; = ZEnj/n log aij 
with 7 = 1,..., & and log G; = Zy log 
aij, S0 log W: im log G: and Wi im Gi as 
n— c Jt follows from this that, if 
G; > Gj, then the probability, P', that 
Wi > Wj at the end of n years ap- 
proaches 1 as n increases indefinitely. 
The portfolio with the highest G is al- 
most certain to be more valuable than 
any other significantly different port- 
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folio in the long run. l'or this reason G is 
accepted here as a rational criterion for 
choice among portfolios. 


SUBGOALS AND SUBJECTIVE UTILITY 


Rational choice among strategles is 
the ancient problem of the gambler who 
has the option to choose among bets. 
Classical writers on probability theory 
recommended that problems of this kind 
be solved by first computing the ex- 
pected winnings (possibly negative) for 
each available bet and then choosing the 
bet with the highest mathematical ex- 
pectation of winning. Since there was no 
reason to assume that, of two persons 
encountering identical risks, either 
should expect to have his desires more 
closely fulfilled, the classical writers 
thought that no characteristic of the 
risk-takers themselves ought to be taken. 
into consideration; only those matters 
should be weighed carefully that pertain 
to the terms of the risk." In 1738 Daniel 
Bernoulli in four short paragraphs dem- 
onstrated that the use of the mathe- 
matical expectation of winnings did not 
always apply and proposed instead that 
gamblers should evaluate bets on the 
basis of the mathematical expectation of 
the utilities of winnings.* 

In terms of subgoals as defined in this 
study, Bernoulli showed that use of the 

é The asymptotic quality of G is used in informa- 
tion theory as developed by Dr. Claude Shannon 
and was applied to a gambling situation by John 
Kelly in “A New Interpretation of Information 
Rate,” Bell System Technical Journal, August, 1956, 
pp. 917-26. See also R. Bellman and R. Kalaba, 
‘Dynamic Programming and Statistical Communi- 
cation Theory," Proceedings of the National Academy 
of Science, XLIII (1957), 749-51. I had no knowl- 
edge of this work when I first proposed the P’ sub- 


goal at a Cowles Foundation Seminar in February, 
1956. 


*See Daniel Bernoulli, Exposition of a New 
Theory on the Measurement of Risk," trans. Louise 
Sommer, Economelrica, XXII (January, 1954), 23. 


8 Ibid., p. 24. 
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expected-value subgoal did not always 
lead to choices that seemed rational to 
him and proposed instead the use of the 
expected-utility subgoal. He used the fol- 
lowing example: 

Somehow a very poor fellow obtains a lottery 
ticket that will yield with equal probability 
either nothing or twenty thousand ducats. Will 
this man evaluate his chances of winuing at teu 
thousand ducats? Would he not be ill-advised to 
sell this lottery ticket for nine thousand 
ducats? To me it seems that the answer is in the 
negative. On the other hand I am inclined to 
believe that a rich man would be ill-advised to 
refuse to buy the lottery ticket for nine thou- 
sand ducats. If I am not wrong then it seems 
clear that al] men cannot use the same rule to 
evaluate the gamble.? 


Bernoulli's example is somewhat aside 
from the daily business of living, but, 
when stripped of its gambling wrappings 
and expressed in terms of payouts and 
returns, it is seen to represent a major 
segment of economic decision-making. 
The hypothetical market price of the 
ticket, which has an equal probability of 
paying 20,000 ducats or nothing, is 9,000 
ducats. Both the poor man and the rich 
man have the option either to hold the 
lottery ticket or to hold 9,000 ducats. 
Possible payouts range from 2.22 per 
ducat risked to 0. Payouts with ranges 
such as this—indeed, much greater 
ranges—are ordinary economic occur- 
rences. The magnitude of the choice 
faced by the rich man is well within the 
range of ordinary business decisions, and 
the "poor man" today is continually 
faced with implicit or explicit decisions as 
serious as that faced by Bernoulli’s 
lottery-ticket owner. He must decide 
whether to move to a new job, buy a new 
home, sign a second mortgage. He is con- 
tinually offered the opportunity to un- 
dertake such risky ventures as purchas- 
ing his own truck, opening a restaurant, 


* Ibid. 
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buying some uranium stock, some oil 
stock, or some investment shares. Some 
of these options may be highly ad- 
vantageous, and he must choose some 
one course of action in each case. The ef- 
fects of these choices are cumulative; 
that is, the decision-maker never comes 
back to exactly the same position he 
occupied before making his choice. The 
major difference between Bernoulli’s 
problem and other choices among courses 
of action is that the ticket-holder's 
choice is clearly defined, while the other 
opportunities are usually ignored, or the 
choices are muddled. 


TABLE 2 
PAYOUT MATRIX OF GAINS AND LOSSES 


FUTURE OCCURRENCE 


Ticket Ticket Crr- 
STRATEGY Wins Loses TERION A 
a) Poor man: 
Hold ticket... 20 0 10 
Sell ticket.... 9 9 9 
b) Rich man: 
Buy ticket... 11 - 9 1 
Not buy ticket 0 0 0 
Probability of oc- 
currence........ 0.5 0.5 


Thus Bernoulli’s example is represent- 
ative of a wide class of choices. The 
decision-maker is being faced continually 
with such choices, and the outcome of 
each decision affects his entire future. In 
the following discussion this example is 
stated in payout matrices constructed to 
illustrate choices based on (a) classical 
mathematical expectation (the expected 
value subgoal); (b) Bernoulli's subjective 
utility (the expected-utility subgoal); 
and (c) the maximum chance (7) sub- 
goal. 

Table 2 shows the classical approach 
to choosing among risky ventures. The 
payout matrix, expressed in terms of 
thousands of ducats, shows the probabil- 
ity of the lottery ticket paying off or not 
and the net payout to the poor man and 
to the rich man for each of two courses of 
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action. The classical writers would calcu- 
late the mathematical expectation, 4, of 
the net payouts and choose that strategy 
which maximizes A. In this case they 
would recommend that the rich man buy 
the ticket for 9,000 ducats and that the 
poor man refuse to sell it at this price. 
The mathematical expectation (that 
is, the arithmetic mean) of the probabil- 
ity distribution of payouts is, indeed, a 
good criterion when there are large 
numbers of independent trials. Even 
decision-makers who make repeated 
choices with cumulative effects (for ex- 
ample, the operators of roulette wheels 
and insurance companies) are rightly 
interested in this average when each risk 
is small in relation to total wealth. There 
is little or no conflict under these condi- 
tions between the use of the arithmetic 
mean as a criterion and the use of the 
geometric mean of the probability dis- 
tribution of payouts per dollar of wealth 
as a criterion. When a decision-maker 
can surely bet the same small amount on 
a large number of independent trials, he 
can maximize the expected value of his 
gain, and also the likelihood of having 
more gain than can be obtained by any 
other plan, by choosing that set of bets 
which gives him the greatest mathe- 
matically expected payout. For example, 
if Bernoulli's poor man had found 10,000 
tickets involving 10,000 independent 
drawings, each with a payout equally 
likely to be 2 ducats or nothing, he clear- 
ly would be unwise to sell his block of 
tickets for 9,000 ducats. His winnings on 
10,000 different trials would be almost 


1? When a gambler who has the choice of betting 
or not betting bets all his wealth on the toss of a 
fair coin with a payout of $3.00 per $1.00 bet if 
heads occur and nothing per $1.00 bet if tails occur, 
he is maximizing the expected value of the payout 
but not G. When he can bet only 1 per cent of his 
wealth, however, he will maximize both 4 and G by 
betting. 
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certainly very close to 10,000 ducats, the 
mathematical expectation of the value of 
the set of tickets, and the advice of the 
classical writers would be sound. 

Bernoulli used the lottery-ticket ex- 
ample to show that the expected values of 
the payouts are not good guides in mak- 
ing choices involving large risks. He pro- 
posed instead that the expected value of 
the utilities of the payouts be used as a 
criterion. He would fill in the payout 
matrix in Table 2 not with the money 
value of the gains and losses but with 
their utilities and then would use the 
mathematical expectation of these utili- 
ties as his criterion. 

Whether or not particular payout 
matrices expressed in terms of subjective 
utility are realistic is not a problem here. 
But Bernoulli's procedure is very much 
at issue. He defines the “mean utility" 
of a course of action as the mathematical 
expectation of the probability distribu- 
tion of the possible utilities from that 
course of action. He then states, with no 
discussion, that this mean utility can be 
used as a basis for valuing risks, that is, 
as a basis for choosing among courses of 
action. In other words, he explains why 
he expresses his profits (or losses) in 
terms of subjective utility, but he does 
not give any justification for maximizing 
the mathematical expectation of these 
utilities. Bernoulli’s use of subjective 
utility has had wide recognition, and his 
use of mathematical expectation also has 
been widely adopted with little or no 
discussion." 


n Mathematical expectation now is used as a 
basis for defining utility. The present emphasis on 
the axiomatic approach to utility is largely derived 
from John von Neumann and Oskar Morgenstern, 
Theory of Games and Economic Behavior (rev. ed.; 
Princeton, N.J.: Princeton University Press, 1935). 
They say: “We have practically defined numerical 
utility as being that thing for which the calculus of 
mathematical expectations is legitimate" (p. 28). 
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Bernoulli's problem can also be solved 
by the use of the maximum-chance (P’) 
subgoal. Table 3 shows the payout 
matrix of returns (that is, payouts, in- 
cluding return of principal, per dollar of 
wealth) for the poor man, who is as- 
sumed to have a wealth of 1,000 ducats 
aside from his lottery ticket, and for the 
rich man, who is assumed to have a 
wealth of 100,000 ducats. The arithmetic 
mean, A, of the probability distribution 
of returns is higher for the poor man 
when he holds the ticket and for the rich 
man when he buys the ticket. The geo- 


TABLE 3 
PAYOUT MATRIX OF RETURNS 


Future OCCURRENCE 


Ticket Ticket CRITERION 
STRATEGY Wins Loses A G 
a) Poor man: 
Hold ticket. 2.1 0.1 1.1 0.46 
Sell ticket.. 1.0 1.0 1.0 1.0 
b) Rich man: 
Buy ticket. 1.11 0.91 1.01 1.005 
Not buy 
ticket.... 1.00 1.00 1.00 1.00 
Probability of oc- 
currence...... 0.5 0.5 


metric mean, G, of returns for the poor 
man is higher when the ticket is sold, 
however, and G for the rich man is higher 
when he buys the ticket. 

Over a long enough period of time 
many economic choices involving returns 
of the same order of magnitude repeat 
themselves. Bernoulli’s poor man may 
never find another lottery ticket, but he 
probably will have many options among 
courses of action with as wide, or wider, 
a range of returns. It is assumed here 
that both the rich man and the poor man 
will have many opportunities to risk the 
same proportions of their respective 
fortunes on approximately the same 
terms and that both men prefer more 
wealth to less wealth, everything else 
being equal. If these assumptions are 
valid, the maximization of P’, the prob- 
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ability of having more wealth at the end 
of a long series of such choices than can 
be obtained by any other specified 
course of action, is a rational subgoal, 
and G is a rational criterion. The use of 
the maximum-chance subgoal results in 
courses of action for both the rich man 
and the poor man which seemed rational 
to Bernoulli. 

The decision-maker who is interested 
in maximizing his wealth at the end of a 
long series of choices should ask himself 
how he would come out in the long run if 
he made the same choice on the same 
terms over and over again. It is not 
necessary for him to ask himself what his 
individual subjective utility of winning 
is. This is not to say that other goals, 
rather than the goal of maximum wealth 
at the end of a long series of choices, are 
irrational. Indeed, the use of subgoals 
based on the goal of maximum wealth 
often may be irrational. For example, the 
man who desperately needs $10.00 to 
escape a jail sentence and who has only 
$1.00 may well be justified in taking a 
gamble to get his money, even though 
this gamble would not stand the maxi- 
mum-chance subgoal test. Even under 
these conditions, however, it would be 
useful for the man to know that he 
should not often act in such a manner, if 
he wants to build up his fortune so as to 
avoid similar predicaments in the future. 

In his paper Bernoulli uses the ex- 
pected utilities of the payouts as his 
criterion. He then reaches the conclusion 
that the utility resulting from any small 
increase in wealth usually is inversely 
proportional to the quantity of goods 
previously possessed. Under these con- 
ditions the utilities of the returns vary as 
their logarithms, and the geometric 
mean, G, of the probability distributions 


V This is generally credited with being the first 
use of a utility function. 
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of returns can be used as a criterion in- 
stead of expected utility. The arithmetic 
mean of the logarithms (utilities) of re- 
turns is maximized when G is maxi- 
mized." 

Bernoulli gives a number of applica- 
tions of his formula to gambling and to 
insurance. In each instance he is able to 
give a specific answer. He says that 
everyone who bets any part of his fortune 
on a mathematically fair game of chance 
is acting irrationally, and he then de- 
termines what odds a gambler with a 
specified fortune must obtain to break 
even in the long run. Most of his prob- 
lems still are interesting in their own 
right, and many have a bearing on proper 
portfolio management. For instance, he 
demonstrates with numerical examples 
the advantages of diversification among 
equally risky ventures and between 
risky and safe assets. 

Bernoulli’s approach to the valuation 
of risky ventures is not contradictory to 
the maximum-chance (P') approach. 
Not only do the two approaches lead to 
the same conclusion when they both can 
be applied but they tend to support each 
other, Wealth-holders may be divided 
into two groups. The first group contains 
those to whom each risk is a unique event 
either because they do not expect it to 
recur or because they keep its effects en- 
tirely separate from the results of other 
risks. For example, the man who each 
year sets aside a small sum to bet on the 
races during his vacation, with the in- 
tention of “living it up" if he wins and 
writing it off to experience if he loses, 
presumably is not actuated by long-run 
profit-maximizing motives. The effects of 

13 As pointed out to me by Professor L. J. Savage 
(in correspondence), not only is the maximization of 
G the rule for maximum expected utility in connec- 
tion with Bernoulli's function but (insofar as certain 


approximations are permissible) this same rule is ap- 
proximately valid for all utility functions. 
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each risk are kept separate. Analysis 
based on maximum chance has nothing 
to offer this first class of wealth-holders. 
The choice between profit and safety or 
expected return and variance is a matter 
of subjective utility. Bernoulli's assump- 
tion that the satisfaction derived from a 
small gain tends to vary in inverse pro- 
portion to the initial wealth may or may 
not be a shrewd guess. 

The second class of wealth-holders in- 
cludes those who expect to be faced re- 
peatedly with risks of the same general 
type and magnitude. This group in- 
cludes those making most business and 
portfolio decisions and hence is of great 
importance. It includes, specifically, all 
those who want to maximize the value of 
their portfolio at the end of n years, 
assuming reinvestment of all returns. 
Here there is a definite rule for choosing 
between risk and return, the P' subgoal, 
based on maximum-chance principles. 
This class may be subdivided further 
into (a) those who undertake only one 
risky venture at a time and (b) those who 
are able to diversify their risky ventures. 
Because so many economic phenomena, 
including yields on stocks, tend to fluctu- 
ate together over time, diversification 
among risky ventures cannot go as far 
toward eliminating risk as otherwise 
would be the case. Final choice among 
efficient portfolios for both groups, (a) 
and (b), is based on maximization of G, 
not because this maximizes subjective 
utility, but because it maximizes P’. 

Bernoulli states that the wealth- 
holder should ask himself whether the 
added satisfaction associated with the 
expected gain justifies undertaking the 
risky venture. He bases an exact rule of 
behavior on his assumption as to how the 
added satisfaction varies with the size of 
the potential gain or loss in relation to 
the size of the portfolio. The rule may or 
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may not be empirically useful, but it is 
grounded on rather shaky evidence about 
the exact shape of the utility function. 
According to maximum-chance analysis, 
the wealth-holder or portfolio manager 
should ask himself how he can maximize 
his chances of getting as good or better 
return than can be obtained with any 
other specified plan, assuming that he 
risks the same proportion of his portfolio 
on the same terms over and over again. It 
turns out that the formula which enables 
the portfolio manager to answer the 
maximum-chance question is the same as 
that developed by Bernoulli on grounds 
of subjective utility. 

In conclusion Bernoulli says: 

Though a person who is fairly judicious by 
natural instinct might have realized and spon- 
taneously applied much of what I have here 
explained, hardly anyone believed it possible to 
define these problems with the precision we 
have employed in our examples. Since all of our 
propositions harmonize perfectly with experi- 
ence it would be wrong to neglect them as 


abstractions resting upon precarious hy- 
potheses.! t 


Professor Stigler, in a review article, 
gives considerable space to Bernoulli's 
hypothesis about the slope of the wealth- 
holder's utility function, even though the 
major emphasis of the article is on utility 
not affected by probability. He mentions 
that Laplace and Marshall, among 
others, have accepted the law as a 
realistic guide. He also points out the 
similarity of Bernoulli’s law to the 
Weber-Fechner psychological hypothesis 
that the just noticeable increment to any 
stimulus is proportional to the stimulus. 
Stigler says: “Bernoulli was right in 
seeking the explanation’ in utility and 

14 Op. cil., p. 31. 

16 George J. Stigler, “The Development of Util- 


ity Theory," Journal of Political Economy, LVIII 
(1950), 373-77. 
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he was wrong only in making a special 
assumption with respect to the slope of 
the utility curve for which there was no 
evidence and which he submitted to no 
tests. "7 

More recently Savage in a section on 
“Historical and Critical Comments on 
Utility" had this to say: 

Bernouli went further than the law of 
diminishing marginal utility and suggested that 
the slope of utility as a function of wealth 
might, at least as a rule of thumb, be supposed, 
not only to decrease with, but to be inversely 
proportional to, the cash value of wealth. To 
this day, no other function has been suggested 
as à better prototype for Everyman's utility 
function. .. . Though it might be a reasonable 
approximation to a person's utility in a moder- 
ate range of wealth, it cannot be taken seriously 
over extreme ranges? 


INDIVIDUAL RISK PREFERENCE 


As indicated in the previous section, 
Bernoulli took the following steps to de- 
velop his utility function and to justify 
diversification among risky ventures and 
between risk assets and safe assets. (1) 
He showed—subject to the implicit as- 
sumption about subgoals previously dis- 
cussed—that the value of a risky venture 
to the individual wealth-holder is not the 
arithmetic mean of the probability dis- 
tribution of returns (the mathematical 
expectation of returns) but may be taken 
to be the arithmetic mean of the prob- 
ability distribution of the utilities of the 
returns. (2) He stated that, in the ab- 
sence of the unusual, the gain in utility 


18 Bernoulli is explaining the reason for the 
limited value of the game involved in the St. Peters- 
burg paradox. This game is a type of risky venture 
with an infinitely large mathematically expected 
value but with an extremely small probability of 
winning. 


17 Stigler, op. cit., p. 375. 


18 Leonard J. Savage, The Foundations of Sta- 
tistics (New York: John Wiley & Sons, 1954), p. 94. 
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resulting from any small increase in 
wealth may be assumed to be inversely 
proportional to the quantity of goods 
previously possessed. (3) He developed a 
formula for calculating the utility of a 
risk asset to the individual wealth-holder 
using as a criterion the utility function 
developed in step 2. According to Ber- 
noulli, the subjective utility of the 
wealth-holder's assets, including the 
risky venture, is measured by the geo- 
metric mean, C, of the probability dis- 
tribution of payouts from such assets. (4) 
Using this formula, he was able to calcu- 
late exactly the utility of the wealth- 
holder's assets, including the risky ven- 
ture, and to show that diversification 
among risky ventures increases the 
utility .!? 

Bernoulli's step 2 may be a reasonable 
assumption about utility,"? but it is sub- 
ject to so many qualifications and excep- 
tions (it does not explain gambling, for 
example) that it has not been accepted 
as a suitable basis for erecting the super- 
structure of steps 3 and 4. The valuation 
of risky ventures has been left to indi- 
vidual risk preference without any cri- 
terion for deciding what this preference 
is likely to be. For example, Makower 
and Marschak present a hypothetical 
table in which an asset's marginal con- 
tribution is determined by adding to- 
gether its contribution to ‘‘lucrativity” 
and safety measured in “‘lucrativity 


19 Bernoulli, op. cit., pp. 24, 25, 28, 30. 


2 Cf. Alfred Marshall, Principles of Economics 
(8th ed.; New York: Macmillan Co., 1950), p. 135. 
Marshall says: "In accordance with a suggestion 
made by Daniel Bernoulli, we may regard the satis- 
faction which a person derives from his income as 
commencing when he has enough to support life, 
and afterwards as increasing by equal amounts with 
every equal successive percentage that is added to 
his income; and vice versa for loss of income." 

See also Savage's comment quoted previously. 
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units" determined by the safety prefer- 
ence rate for a single individual.? These 
individual safety preference rates, in 
turn, are a matter of taste and must be 
accepted as given. Friedman and Savage 
build on Bernoulli's step 1 but modify 
step 2 by developing a doubly inflected 
curve comparing utility with income.” 

Markowitz begins his analysis of port- 
folio selection by pointing out that “the 
portfolio with the maximum expected re- 
turn is not necessarily the one with the 
minimum variance. There is a rate at 
which the investor can gain expected re- 
turn by taking on variance, or reduce 
variance by giving up expected return."?? 
He assumes that the investor considers, 
or should consider, expected return a de- 
sirable thing and variance of return an 
undesirable thing, and he defines an ef- 
ficient portfolio as a portfolio with 
minimum variance for a given expected 
return or more and a maximum expected 
return for a given variance or less. He 
develops a method for selecting efficient 
portfolios from the set of all possible 
portfolios but does not give any basis for 
choice among the efficient portfolios ex- 
cept the individual's safety preference 
rate. 


THE NEED FOR AN OBJECTIVE 
CRITERION 


The difficulty of evaluating subjective 
risk preference and the need of an objec- 
tive criterion are well indicated in the 


21 Helen Makower and Jacob Marschak, “Assets, 
Prices and Marketing Theory," Economica, V 
(1938), 261-88. Reprinted in American Economic 
Association, Readings im Price Theory (Chicago: 
Richard D. Irwin, Inc., 1952), pp. 301-2. 

2 Milton Friedman and L. J. Savage, “The Util- 
ity Analysis of Choices Involving Risk," Journal of 
Political Economy, LVI (1948), 279-304. 

2 Harry Markowitz, “Portfolio Selection,” Jour- 
nal of Finance, VII (March, 1952), 79. 
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following quotation from a recent journal 
article dealing with selection of an opti- 
mum combination of crops for a farmer: 


The introduction of risk into an economic 
model of a firm and consequently into a linear 
programming model of a firm has been accom- 
plished by describing risky outcomes as prob- 
ability distributions and choosing from among 
alternate possible distributions by the expected 
utility hypothesis. 

Two basic weaknesses have appeared in 
applying this method of incorporating risk. One 
difficulty arises in choosing a value for the 
constant a, which in this case is some sort of 
risk aversion indicator, and is, to some degree, 
governed by the personal characteristics of the 
entrepreneur. A large value for a indicates that 
the entrepreneur places a great weight on the 
variance as a deciding factor and is consequent- 
ly highly averse to risk, and vice versa. The 
estimation of such a constant to be used in a 
model is thus quite important; the wrong 
choice will invalidate any results obtained. The 
derivation of this constant is a delicate task 
beyond the scope of this paper.?* 


A major advantage of the criterion for 
choice among risky ventures developed 
in this paper is that it avoids the 
necessity for direct subjective determi- 
nation of such factors as Marschak’s 
“lucrativity units" or Freund’s ‘risk 
aversion indicator." As Roy remarks, “A 
man who seeks advice about his actions 
will not be grateful for the suggestion 
that he maximize expected utility.” 
The criteria for choice between risk 
and safety in portfolio management can 
be illustrated by assuming that a gam- 
bler has the choice of holding his money 
in cash or of betting on a gambling de- 
vice which, with equal probability, will 
return R—s on loss occasions and R+s 
on gain occasions with an expected re- 
^! Rudolph J. Freund, “The Introduction of Risk 


into a Programming Model,” Econometrica, XXIV 
(July, 1956), 253-63. 


2 A.D. Roy, “Safety First and the Holding of 
Assets," Econometrica XX (1952), 433. 
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turn of R per dollar played. The gam- 
bler’s portfolio at any time consists of 
the proportion of his wealth held in cash 
plus the proportion bet on the gambling 
device. When the gambler bets none of 
his wealth, the expected return from his 
portfolio is 1, and the standard deviation 
of returns is 0. As the proportion bet in- 
creases, both the expected portfolio re- 
turn and the standard deviation of re- 
turns increase. When he bets all his 
wealth, the expected portfolio return is 
R, and the expected standard deviation 
of returns is s. As long as R is greater 
than 1, and R— s is less than 1, all pos- 
sible combinations of the two assets in 
this range are efficient portfolios in that 
any one of the combinations gives the 
maximum possible expected return for 
some standard deviation or variance and 
the minimum standard deviation or vari- 
ance for some expected return. Neither 
Marschak nor Friedman and Savage nor 
Markowitz would be able to help the 
gambler in choosing among these efficient 
portfolios beyond telling him that he 
should gamble heavily if he has a high 
preference for risk and should be very 
conservative in his betting if he has a 
high risk-aversion factor. In this paper 
an attempt is made to give the gambler 
(and wealth-holders, in general) an ob- 
jective criterion for making this choice. 

The wealth-holder who adopts the 
maximum-chance (P’) subgoal can reach 
this subgoal by using the geometric 
mean, G, of the probability distribution of 
returns as his criterion and choose the 
strategy that has the probability distri- 
bution of returns with the highest G. 
Bernoulli also has shown that choice of 
that risky venture with the highest Gis a 
rational choice (1) if maximization of the 
mathematical expectation of the utilities 
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of the payouts is a rational subgoal and 
(2) if the utility of a small gain or loss 
varies inversely with the amount of 
wealth already possessed. 

Most economists recognize that the 
mathematical expectation and the vari- 
ance of the probability distribution of 
returns and the chance of ruin are im- 
portant to the wealth-holder—but they 
leave it to indrvidual risk preference to 
balance one factor against the others. 
Since G depends on both the mathe- 
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matical expectation and the variance of 
the probability distribution of returns, 
when G is maximized, there is no chance 
of ruin if the wealth-holder's probability 
beliefs are correct. Consequently, maxi- 
mization of G falls within the generally 
accepted range of rational behavior. This 
is not to say that G is the only rational 
criterion for choice among strategies; it 
is to say, however, that it 1s a useful 
criterion in dealing with a broad range of 
problems. 
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OPTIMAL GAMBLING SYSTEMS FOR 
FAVORABLE GAMES 


L. BREIMAN 
UNIVERSITY OF CALIFORNIA, LOS ANGELES 


1. Introduction 


Assume that we are hardened and unscrupulous types with an infinitely 
wealthy friend. We induce him to match any bet we wish to make on the event 
that a coin biased in our favor will turn up heads. That is, at every toss we have 
probability p > 1/2 of doubling the amount of our bet. If we are clever, as well 
as unscrupulous, we soon begin to worry about how much of our available for- 
tune to bet at every toss. Betting everything we have on heads on every toss 
will lead to almost certain bankruptcy. On the other hand, if we bet a small, 
but fixed, fraction (we assume throughout that money is infinitely divisible) of 
our available fortune at every toss, then the law of large numbers informs us 
that our fortune converges almost surely to plus infinity. What to do? 

More generally, let X be a random variable taking values in the set 

= {1, --+,s} such that P{X = i} = p; and let there be a class C of subsets 
A; of I, where @ = {A,,---,A;}, with U; A; = I, together with positive 
numbers (0i, +*+ , 0,). We play this game by betting amounts fi, --- , 8, on the 
events (X C Aj) and if the event (X = i} is realized, we receive back the 
amount 27;c4; 8,0; where the sum is over all 7 such that ? € A;. We may assume 
that our entire fortune is distributed at every play over the betting sets ©, 
because the possibility of holding part of our fortune in reserve 1s realized by 
taking A1, say, such that A, = J, ando; = 1. Let S, be the fortune after n plays; 
we say that the game is favorable if there is a gambling strategy such that almost 
surely S, — o. We give in the next section a simple necessary and sufficient 
condition for a game to be favorable. 

How much to bet on the various alternatives in a sequence of independent 
repetitions of & favorable game depends, of course, on what our goal utility is. 
There are two criterions, among the many possibilities, that seem pre-eminently 
reasonable. One is the minimal time requirement, that is, we fix an amount z 
we wish to win and inquire after that gambling strategy which will minimize the 
expected number of trials needed to win or exceed x. The other is a magnitude 
condition; we fix at n the number of trials we are going to play and examine the 
size of our fortune after the n plays. 


This research was supported in part by the Office of Naval Research under Contract 
Nonr-222(53). 
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In this work, we are especially interested in the asymptotic point of view. 
We show that in the long run, from either of the two above criterions, there is 
one strategy A* which is optimal. This strategy is found as that system of betting 
(essentially unique) which maximizes E(log S,). The reason for this result is 
heuristically clear. Under reasonable betting systems S, increases exponentially 
and maximizing E(log S,) maximizes the rate of growth. 

In the second section we investigate the nature of A*. It is a conservative 
poliey which consists in betting fixed fractions of the available fortune on the 
various A;. For example, in the coin-tossing game A* is: bet a fraction p — g of 
our fortune on heads at every game. It is also, in general, a policy of diversifica- 
tion involving the placing of bets on many of the A; rather than the single one 
with the largest expected return. : 

The minimal expected time property is covered. in the third section. We show, ~ 
by an examination of the excess in Wald’s formula, thatathe desired fortune zi 
becomes infinite, that the expected time under A* to amass x becomes less than 
that under any other strategy. 

Section four is involved with the magnitude problem. The content here is that 
A* magnitudewise, does as well as any other strategy, and that if one picks a 
poliey which in the long run does not become close to A*, then we are asymptot- 
ically infinitely worse off. 

Finally, in section five, we discuss the finite (nonasymptotic) case for the 
coin-tossing game. We have been unsuccessful in our efforts to find & strategy 
which minimizes the expected time for z fixed, but we state a conjecture which 
expresses a moderate faith in the simplicity of things. It is not difficult, however, 
to find a strategy which maximizes P{S, 2 x} for fixed n, x and we state the 
results with only a scant indication of proof, and then launch into a comparison 
with the strategy A* for large n. 

The conclusion of these investigations is that the strategy A* seems by all 
reasonable standards to be asymptotically best, and that, in the finite case, it is 
suboptimal in the sense of providing a uniformly good approximation to the 
optimal results. 

Since completing this work we have been allowed to examine the most sig- 
nificant manuscript of L. Dubins and L. J. Savage [1], which will soon be pub- 
lished. Although gambling has been associated with probability since its birth, 
only quite recently has the question of gambling systems optimal with respect 
to some goal utility been investigated carefully. To the beautiful and deep results 
of Dubins and Savage, upon which work was commenced in 1956, must be given 
priority as the first to formulate systematically and solve the problems of optimal 
gambling strategies. We strongly recommend their work to every student of 
probability theory. 

Although our original impetus came from a different source, and although 
their manuscript is almost wholly concerned with unfavorable and fair games, 
there are a few small areas of overlap which I should like to point out and 
acknowledge priority. Dubins and Savage did, of course, formulate the concept 
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of a favorable game. For these games they considered the class of “fractionalizing 
strategies," which consist in betting a fixed fraction of one’s fortune at every 
play, and noticed the interesting phenomenon that there was a critical fraction 
such that if one bets a fixed fraction less than this critical value, then S, — © a.s. 
and if one bets a fixed fraction greater than this critical value, then S, — 0 a.s. 
In addition, our proposition 3 is an almost exact duplication of one of their 
theorems. In their work, also, will be found the solution to maximizing P (S, 2 x} 
for an unfavorable game, and it is interesting to observe here the abrupt dis- 
continuity in strategies as the game changes from unfavorable to favorable. 

My original curiosity concerning favorable games dates from a paper of 
J. L. Kelly, Jr. (2] in which there is an intriguing interpretation of information 
theory from a gambling point of view. Finally, some of the last section, in prob- 
lem and solution, is closely related to the theory of dynamie programming as 
originated by R. Bellman [3]. 


2. The nature of A* 


We introduce some notation. Let the outcome of the kth game be X, and 
R, = (Xn, 757, Xi). Take the initial fortune So to be unity, and S, the fortune 
after n games. To specify a strategy A we specify for every n, the fractions 
[PD e AMP] = Asa of our available fortune after the nth game, Sa, that 
we will bet on alternative Àj, --- , A, in the (n + 1)st game. Hence 


(2.1) Y AP? = 1, 
jal 


Note that À may depend on R,. Denote A = (XA, X, -+ -). Define the random 


nH 


variables V, by 


(2.2) V, = L APP 0j, X, = 1 
f$: i€AÀ 
so that Sayı = V44S,. Let W, = log Vn, so we have 
(2.3) log Sa = Wna + °- HW 
To define A*, consider the set of vectors X = (M, --- , A,) with r nonnegative 


components such that M + --- +A, = 1 and define a function W(X) on this 
space F by 


(2.4) W(X) = X p: log e Mo) 


The function W(X) achieves its maximum on § and we denote W = maxxes W(). 
PRoprosrioN 1. Let XO, XO be in F such that W = W(X) = WA), then 
for all i, we have Lica o; = Yea; N? 0, 
Pnoor. Let a, 8 be ositive numbers suc] At at a-4-8 —1. Then if 
A = ad) + B®), we have WÀ) € W. But by théjconcavity of log 
(2.5) W(X) 2 aW(X9) + BWA) 


with equality if and only if the conclusion of the proposition holds. 
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Now let À* be such that W = W(X*) and define A* as (A*, A*, ---). Although 
À* may not be unique, the random variables Wt, W3, --- arising from A* are by 


proposition 1 uniquely defined, and form a sequence of independent, identically 
distributed random variables. 

Questions of uniqueness and description of \* are complicated in the general 
case. But some insight into the type of strategy we get using A* 1s afforded by 

PROPOSITION 2. Let the sels Ai, +++ , Ay be disjoint, then no matter what the 
odds 0; are, X* is given by M = P(X C Aj). 

The proof is a simple computation and is omitted. 

From now on we restrict attention to favorable games and give the following 
criterion. 

PROpOsITION 3. A game is favorable if and only if W > 0. 

PRoor. We have 


(2.6) log St = È Wt. 


If W = EWtis positive, then the strong law of large numbers yields Si — © a.s. 
Conversely, if there is a strategy A such that S, — œ a.s. we use the result of 
section 4, which says that for any strategy A, lim, S,/S% exists a.s. finite. Hence 
Sa  * a.s. and therefore W = 0. Suppose W = 0, then the law of the iterated 
logarithm comes to our rescue and provides a contradiction to Si œ. 


3. The asymptotic time minimization problem 


For any strategy A and any number z > 1, define the random variable T (x) by 
(3.1) T(x) = (smallest n such that S, 2 2}, 


and T*(x) the corresponding random variable using the strategy A*. That is, 
T (x) is the number of plays needed under A to amass or exceed the fortune z. 
This section is concerned with the proof of the following theorem. 

THEOREM 1. If the random variables Wt, W$, --- are nonlattice] then for any 
strategy 


(3.2) lim [ET (x) — ET*(z)] = yÈ (W — EW.) 


and there is a constant a, independent of A and x such that 
(3.3) ET*(z) — ET (z) S a. 


Notice that the right side of (3.2) is always nonnegative and is zero only if A 
is equivalent to A* in the sense that for every n, we have W, = Wi. The reason 
for the restriction that W; be nonlattice is fairly apparent. But as this restriction 
is on log Vi rather than on V3 itself, the common games with rational values of 
the odds o; and probabilities p; usually will be nonlattice. For instance, a little 
number-theoretic juggling proves that in the coin-tossing case the countable set 
of values of p for which W% is lattice consists only of irrationals. 
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The proof of the above theorem is long and will be carried out in a sequence 
of propositions. The heart is an asymptotie estimate of the excess in Wald's 
identity [4]. 

Proposirion 4. Let Xj, Xz, +++ be a sequence of identically distributed, inde- 
pendent nonlattice random variables with 0 < EX, < o. Let Y, = Xit ^: $ X. 
For any real numbers x, £, with £ > 0, let F(t) = P{first Y, Z x is <z +g. 
T hen there is a continuous distribution G(£) such that for every value of £, 


(3.4) lim FE) = G(9. 


Proor. The above statement is contained in known results concerning the 
renewal theorem. If X, > 0 a.s. and has the distribution function F, it is known 


(see, for example, [5]) that lim; F.(é) = (1/EX) h * t1 — F(0] dt. If Xi is 


not positive, we use a device due to Blackwell [6]. Define the integer-valued 
random variables nı < n» < +++ by n, = {first n such that Xy + --- + X, > 0}, 
m = {first n such that Xn,41 + +--+ + X, > 0}, and so forth. Then the random 
variables Xi = Xi + --> + Xa, Xà = Xii ccc + Xm, + areindependent, 
identically distributed, positive, and EX{ < œ (see [6]). Letting Yn = Xi + -> 
+ Xi, note that P {first Y, z ris < xz + ¢} = P(first Yi z x is «z-4- §, 
which completes the proof. 

We find it useful to transform this problem by defining for any strategy A, 
a random variable N(y), 


(3.5) N(y) = (smallest n such that W, + --- + Wi 2 v) 
with N*(y) the analogous thing for A*. To prove (3.2) we need to prove 


(3.6) lim [EN (y) — EN*(y)] = 5 x (W — EW,), 


and we use a result very elose to Wald's identity. 
Proposition 5. For any strategy A such that Sn — © a.s. and any y 


GD EN) = g EX — EQTARI-]] + E [E w] 


Proor, The above identity is derived in a very similar fashion to Doob's 
derivation [6] of Wald’s identity. The difficult point is an integrability condition 
and we get around this by using, instead of the strategy A, a modification A; 
which consists in using A for the first J plays and then switching to A*. The 
condition S, — © a.s. implies that none of the W, may take on the value — œ 
and that N(y) is well defined. Let N;(y) be the random variable analogous to 
N(y) under A; and WP’ to W;. Define a sequence of random variables Za by 


(3.8) Zn = x [WP — E(W£^|R,..2)]. 


This sequence is a martingale with EZ, = 0. By Wald’s identity, EN;(y) < © 
and it is seen that the conditions of the optional sampling theorem ([7], theorem 
2.2-C3) are validated with the conclusion that EZy, = 0. Therefore 
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Ny 

(3.9) WEN, = E | > "| 
kel 


= E43. [W - EGER 9] E | 5 we | 


min(N,J) 


=EL E WER} E| x we] 


'The second term on the right satisfies 


Ny 
(3.10) y<E LX wel 5 y a, 


where a = max; (log oj). Hence, if EN = ©, then lim; £N; = œ, so that 
EE’ [W — E(W4|R4..))]) = © and (3.7) is degenerately true. Now assume 
that EN < œ, and let J — œ. The first term on the right in (3.9) converges to 
E($7T [W — E(W,|R,,)] monotonically. The random variables EY Wf” 
converge a.s. to $71 Wp and are bounded below and above by y and y + a so 
that the expectations converge. It remains to show that lim; EN; = EN. Since 


(3.11) EN; = five, NP + usn Ny; dP, 


we need to show that the extreme right term converges to zero. Let 


J n4-J 
(3.12) U,- 2 Wr, N(U;) = {first n such that X WP zy — Uz} 
HA 


so that 


(3.13) mo N,dP = JP(N > J) + Tus N(U;) dP. 


Since EN < o, we have lim; JP{N > J} = 0. We write the second term as 


E{E[N(Uy)|Us]|N > J} P(N > J}. By Wald's identity, 


(3.14) BIN(UD|U) s Hts. 


On the other hand, since the most we can win at any play is a, the inequality 


(3.15) N= eats +J 


holds on the set {N > J}. Putting together the pieces, 


Q 


(3.16) Í N(U;) dP syf (N — D dP + S P(N » J). 
{X >J} W {N>J] W 


The right side converges to zero and the proposition is proven. 
If we subtract from (3.7) the analogous result for A* we get 


(3.17) EN(y) — EN*(y) 


- 5 EX [W — EPIS 23 + we [x W - = 


ws | . 
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This last result establishes inequality (3.3) of the theorem. As we let y > œ, 
then N(y) — œ a.s. and we see that 


N E 
(3.18) lim g b» [W — EQVA[E 4) ) = È} (W - EW»). 


By proposition 4, the distribution F% of $01" Wt — y converges, as y — %, to 
some continuous distribution F* and we finish by proving that the distribution 
F, of ŁY W, — y also converges to F*. 

Proposition 6. Let Yn, en be two sequences of random variables such that 
Y, 9, Y, + & — œ as. If Z is any random variable, if ¢ = supnzi |e|, and 


af we define 


(3.19) HE) = P{frsttY, 2 Z24+yis <Z+y4+ 8, 
(3.20) DIE) = P{first Yn tea 2Z+yis<Z+yt 6, 
then for any u > 0, 

(3.21) 


Fayu(E — 2u) — P(e 2 u} € DIE) < D,E) € Hyl t 2u) + Pie 2 uj. 

Proor. 
(3.22) 

D,E) € P(first Y, tazZ+tyis<Zt+ytiéie<us + Pez u} 
SPffrstY,>Z+ty-uis<Zt+ytétue<us +Ple 2 uj} 
€ H, (£t 2u) + P{e 2 uj. 

(3.23) 

D, = P {first Y¥,te2Z+yis<Ztyt&e<u} 

>Pf{firss¥,2Z+ytuis<Z+ytét-—ue<u} 
2 Hy( — 2u) — Pie 2 uj. 

Proposition 7. Let Xi, Xs, --+ be a sequence of independent identically dis- 
tributed nonlattice random variables, O < EX, < c, with Y, = Xi + c + Xa. 
If Z is any random variable independent of Xi, Xs, +++ , G the limiting distribu- 
tion of proposition 4, and 
(3.24) F,z(E = P(first Y, Z Z2+yis «Zo yc 8, 
then lim, F,z(£) = G(£). 

Pnoor. 

(3.25) Fyz(E) = E[P {first Y, 2 Zd- yis < Z-F y t lz] 
= E(F,z(], 


where F,(£) = P {first Yn = yis < y + £}. But lim, Fy,z(£) = G(é a.s. which, 
together with the boundedness of F,,2(£), establishes the result. 
We start putting things together with 
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Proposition 8. Let $71 Wy — Dt Wi converge a.s. to an everywhere finite limit. 
If the Wi are nonlattice, if F,(£) is the distribution function for $C W, — y, then 
lim, Fy(é) = F*(é). 

Proor. Fix m, let 


m-—1 n n 
(83.00 Zm=— E We mn = D W- EWE = SUP |ém nl, 


and by assumption e, — 0 a.s. Now 


(3.27) F(t) = P {first 2 Wrz yis <y +ë 


= Pss (È W + ena) 2 Zm + yis < Zm +y + g}. 
If 7 
(328) y(t) = P{first È Wł 2 Zn +y is < Za +y + 8, 
then by proposition 6, for any u > 0, 
(3.29) . Hy4(£ — 2u) — P {em 2 u) S F) S Hy(£ + 2u) + P(e, 2 u). 
Letting y — « and applying proposition 7, 
(3.30) 
F*(£ — 2u) — P {em Z u} S lim FẸ) € lim P.) S F*(£ + 2u) + P {en 


v 


Iv 


Taking first m — œ and then u — 0 we get 
(3.81) lim F,() = lim F,() = F*(). 
y y 


To finish the proof, we invoke theorems 2 and 3 of section 4. The content we use 
is that if $7? Wi — $53 Wt does not converge a.s. to an everywhere finite limit, 
then XT [W — E(W,|Ri-1)] = +% on a set of positive probability. Therefore, 
if the conditions of propositions 5 and 8 are not validated, then by (3.17) both 
sides of (3.2) are infinite. Thus the theorem is proved. 


do 
l 3. Asymptotic magnitude problem 


The main results of this section can be stated roughly as: asymptotically, S$ 
is as large as the S, provided by any strategy A, and if A is not asymptotically 
close to A*, then S% is infinitely larger than S4. The results are valid whether or : 
not the games are favorable. 

THEOREM 2. Let A be any strategy leading to the fortune S, after n plays. Then. 
lim, S,/$ exists a.s. and E(lim, 84/82) S 1. 

For the statement of theorem 3 we need 

DEFINITION. A is a nonterminating strategy if there are no values of X, such 
that Sica, N” o; = 0, for any n. 
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THEOREM 3. Jf A ts a nonterminating strategy, then almost surely 


(4.1) Y [W — EWR] = 9 e lim Š = 0, 
1 n n 


Proors. We present the theorems together as their proofs are similar and 
hinge on the martingale theorems. For every n 
(4.9) z(S 


-p/i Sri, 

Sta) = B (R) et 

If we prove that E(V.,/Vz|R.) € 1 a.s., then S,/S* is a decreasing semi- 
martingale with lim, S,/S% existing a.s. and 


S, S 
(4.3) Elm SEG = 1 
By the definition of A*, for every « > 0, 
(4.4) E {log (1 — e) V% + «V,] — log Vz| E) S 0. 
Manipulating gives 

1 € Va 1 1 
(4.5) : E [og (1 + in. v) Ra z : log 7 —. 
By Fatou's lemma, as e — 0 
Va _ _ f1 Va 

(4.6) E A Ra) -E [tim (: los 14 DIL | F) E- | 

1 

< lim z log 1—.7 1. 


Theorem 3 resembles a martingale theorem given by Doob ([6], pp. 323-324), 
but integrability conditions get in our way and force some deviousness. Fix a 
number M > 0 and take A to be the event {W — E(W,|R, 3) 2 M i.o.). If 
p = min; p; then E(Wz — W,|R,.;) z M implies P{W} — Wn = M|R.a) = p. 
By the conditional version of the Borel-Cantelli lemma ([7], p. 324), the 
set on which »7T P{W} — Wna = M|R,1} = © and the set (Wi — Wn = Mio.) 
are a.s. the same. Therefore, a.s. on A, we have Wz — W, 2 M io. and 
log (82/84) = 3:1 (Wi — W+) cannot converge. We conclude that both sides of 
(4.1) diverge a.s. on A. 

Starting with a strategy A, define an amended strategy Am by: if W — 
E(W,|R, 4) < M, use A on the nth play, otherwise use A* on the nth play. The 
random variables 


(4.7) U, = log s- È [W - EWP Ra) 


form a martingale sequence with 


(4.8) Un — Una = Wi — Wi? — [W — EQVS^?|R. )]. 
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For Au, we have E(Wz — Wz?|R,.4) < M, leading to the inequalities, 
(1.9) sup (Ws = WP)s^5  UQ- Un s 

P 


On the other side, if 
(4.10) a = min log (È No;), 8 = max log 0j, 
j 1G A; j 


then U, — Un) 2 a — B — M. These bounds allow the use of a known martin- 
gale theorem ([7], pp. 319-320) to conclude that lim, Un exists a.s. whenever 
one of hm U, « «, lim U, > — is satisfied. This implies the statement 


" 
(4.11) lim im 
However, on the complement of the set A the convergence or divergence of the 
above expressions involves the convergence or divergence of the corresponding 
quantities in (4.1) which proves the theorem. 

COROLLARY 1. If for some strategy A, we have } Y [W — E(W,|Ra i JJ 29 
with probability y > 0, then for every € > 0, there is a strategy A such that with 
probability at least y — e, lim S n/Sn = Oand except for a set of probability at most e, 
lim S,/$, S 1. 

Proor. Let E be the set on which lim S,/S% = 0, with P(E) = y. For any 
e > 0, for N sufficiently large, there is a set Hy, measurable with respect to the 
field generated by Ry such that P(Ey AE) < e, where A denotes the symmetric 
set difference. Define A as follows: if n < N, use A, if Ra, with n = N, is such 
that the first N outcomes (Xi, +> , Xy) is not in Hy, use A, otherwise use A*. 
On Ew, we have Dor [W — BWR.) « o, hence lim $,/8: > 0 so that 
lim S,/$, = 0 on Ex N E. Further, P(Ey N Ez 2 P{E} — e = y — e. On the 
complement of Ey, we have S, = 8,, leading to lim S,/S, < 1, except for a set 
with probability at most e. . 


<o 2 [W — BOW |Rya)] < o. 


5. Problems with finite goals in coin tossing 


In this section we consider first the problem: fix an integer n > 0, and two 
numbers y > z > 0, find a strategy which maximizes P{S, 2 y|So = x}. In 
this situation, then, only n plays of the game are allowed and we wish to maxi- 
mize the probability of exceeding a certain return. We will also be interested in 
what happens as n, y become large. By changing the unit of money, note that 


(5.1) Sup PS, 2 y|So = x} == sup Ps. z 1|So = zh 


where the supremum is over all strategies. Thus, the problem reduces to the 
unit interval, and we may evidently translate back to the general case if we find 
an optimum strategy in the reduced case. Define, for £ 2 0, n 2 1, 
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sup P{S, 2 1|So = 8, E< 1, 

5.2 n = 
(5.2) ex 5 = 
and 
(5.3) ee oif] 

' Pots) = 1, €21. 
In addition ¢,(€) satisfies 
(5.4) on(t) = sup E[P(S, 2 1|$, Sc} |S = E 


< sup Es a(81)|So = £] 
sup [pé«a(£ + 2) + qe» (E — 2)]. 


0szSt 


IA | 


To find &«(£) and an optimal strategy, we define functions és(£) by 
(5.5) lE) = ew(£), é.(E) = owe, [pfn (E + 2) + gós a(£ — 2)] 


having the property $.(£) € ¢,(é), for all n, £. If we can find a strategy A such 
that under A we have ¢a(t) = P{S, 2 1|So = £}, then, evidently, A is optimum, 
and én = $,. But, if for every n 2 1, and £ there is a z,(£), with O S z,(£) S &, 
such that 


(5.6) $«(£) = Dé. alt + zn(€)] + gén—ilé = zn(€)], 
then we assert that the optimum strategy is A defined as: if there are m plays 
left and we have fortune £, bet the amount z4(£). Because, suppose that under A, 
forn = 0,1, ---,m we have $al) = P(S, Z 1|So = £}, then 


(5.7) P(S,4 2 1|$ = & = E[P (S. Z 118, So}|So = E] 
= CACAN = £j = mpl). 


Hence, we need only solve recursively the functional equation (5.5) and then 
look for solutions of (5.6) in order to find an optimal strategy. We will not go 
through the complicated but straightforward computation of $.(£). It can be 


described by dividing the unit interval into 2" equal intervals I, ++- , Izn such 

that I, = [k/2", (k + 1)/27]. In tossing a coin with P(H) = p, rank the prob- 

abilities of the 2” outcomes of n tosses in descending order P, 2 Pj 2 -e 2 Pa, 

that is, P, = p", p». = q”. Then, as shown in figure 1, . 

(5.8) $.(£) = P» P; . E c I, 
j 


Note that if p > 1/2, then lim, $,(£) = 1, with £ > 0; and in the limiting case 
p = 1/2, then lim, $,(£) = £ with £ S 1, in agreement with the Dubins-Savage 


result [2]... 
There are many different optimum strategies, and we describe the one which 
seems simplest. Divide the unit interval into n + 1 subintervals I$", -.- , I, 


n 


such that the length of I$? is a(r) where the , 


) are binomial coefficients. On 
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p (€) 


FIGURE 1 


Graph of ¢n(t) for the case n = 3. 


each 7f? as base, erect a 45?—45? isosceles triangle. Then the graph of zn41(€) is 
formed by the sides of these triangles, as shown in figure 2. Roughly, this 
strategy calls for a preliminary “jockeying for position," with the preferred posi- 
tions with m plays remaining being the midpoints of the intervals Zf”. Notice 
that the endpoints of the intervals (If?) form the midpoints of the intervals 
U£-). So that if with n plays remaining we are at a midpoint of (I1?) , then 
at all remaining plays we will be at midpoints of the appropriate system of 
intervals. Very interestingly, this strategy is independent of the values of p so 


Z, (£) 
! 


4 


— 


alot 


FIGURE 2 


Graph of zn4.(&) for the case n = 3. 
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long as p > 1/2. The strategy A* in this case is: bet a fraction p — q of our 
fortune at every play. Let $3(£) = P{Si = 1|So = &. In light of the above 
remark, the following result is not without gratification. 

THEOREM 4. lima supe [¢n(é) — AE] = 0. 

Proor. The proof is somewhat tedious, using the central limit theorem and 
tail estimates. However, some interesting properties of ¢,(£) will be discovered 
along the way. Let P{k|1/2} be the probability of k or fewer tails in tossing a 
fair coin n times, P {k|p} the probability of k or fewer tails in n tosses of a coin 
with P{H} =p. If £ = P(k|1/2) + 27^, note that $«(E—) = P{klp}. Let 
c = V pq, by the central limit theorem, if fnn = P{gn + te V/n|1/2) + 27", 
then 


t 
(5.9) lim 4s (s) = J [. en #2 da, 
uniformly in ¢. Thus, if we establish that 
. 1 t 
(5.10) lim i5) = = Í. 67? dz 
uniformly for ¢ in any bounded interval, then by the monotonicity of $«(£), éx(£), 


the theorem will follow. 
By definition, 


(5.11) ex(£) 


P(QWi + +--+ + Wi z oWo 
log & = P{Wi+~---+ Wie —logg, 
where the Wi are independent, and identically distributed with probabilities 


P{W = log 2p} = p and P{W = log 2g} = q. Again using the central limit 
theorem, the problem reduces to showing that 


: I g En t + nEWi 
5.12 lim : =t 
(5.12) a Vno(WY) 
uniformly in any bounded interval. By a theorem on tail estimates [8], if 
Xi, Xo, +++ are independent random variables with P(X, = 1} = 1/2 and 
P{X, = 0} = 1/2, then 
(5.13) log P(Xi +e t Xn 2 na} = n6(a) + n(n, a) log n, 


where u(n, a) is bounded for all n, with 1/2 + è a £1 -— 5, and l(a) = 
—alog (2a) — (1 — a)log[2(1 — a)]. Now 


(5.14) log n = log [P{X + --- +X, 2 np— lo V/n) + 277] 
so that the appropriate a = p — te/ V/n with 


(5.15) ala) = &(p) — LIE +0 G) 


' Since 6(p) > —log 2, we may ignore the 277 term and estimate 
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(5.16) log £n: = 20(p) — ton log 1 + O(log n). 
But (p) = —EWT, and the left-hand expression in (5.12). becomes 
io log P 
(5.17) 1,0 (525). 
o(WY Vn 


Now the short computation resulting, o(Wt) = c log (p/q), completes the proof 
of the theorem. 
There is one final problem we wish to discuss. Fix £, with 0 < & < 1, and let 


(5.18) T(t) = E(first n with S, = 1\So = 8, 


find the strategy which provides a minimum value of 7T(£). We have not been 
able to solve this problem, but we hopefully conjecture that an optimal strategy 
is: there is a number &, with O < & < 1, such that if our fortune is less than £, 
we use A*, and if our fortune is greater than or equal to &, we bet to 1, that is, 
we bet an amount such that, upon winning, our fortune would be unity. 
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OPTIMAL GAMBLING SYSTEMS FOR FAVORABLE GAMES? 
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E. O. Thorp 
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INTRODUCTION 


In the last decade it was found that the player may have the advantage in some 
games of chance. We shall see that blackjack, the side bet in Nevada-style Baccarat, 
roulette, and the wheel of fortune all may offer the player positive expectation. The 
stock market has many of the features of these games of chance [5]. It offers special 
situations with expected returns ranging above an annual rate of 259; [23]. 

Once the particular theory of a game has been used to identify favorable situations, 
we have the problem of how best to apportion our resources. Paralleling the discoveries 
of favorable situations in particular games, the outlines of a general mathematical 
theory for exploiting these opportunities has developed [2, 3, 10, 13]. 

We first describe the favorable games mentioned above, those being the ones with 
which the author is most familiar. Then we discuss the general mathematical theory, 
as it has developed thus far, and its application to these games. Detailed knowledge 
of particular games is not needed to follow the exposition. Each discussion of a 
favorable game in Part I motivates a concluding probabilistic summary of that game. 
These summaries suffice for the discussion in Part II so that a reader who has no 
interest in a particular game may skip directly to the summary. 

References are provided for those who wish to explore particular games in detail. 
For the present, a favorable game means one in which there is a strategy such that 
P (lim S, = oo) > 0 where S, is the player's capital after n trials. 


PART I. FAVORABLE GAMES 
1, BLACKJACK 


Blackjack, or twenty-one, is a card game played throughout the world. The casinos 
in Nevada currently realize an annual net profit of roughly eighty million dollars 
from the game. Taxing a price/earnings ratio of 15 as typical for present day common 
stocks, the Nevada blackjack operation might be compared to a $ 1.2 billion corpo- 
ration. 

To begin the game a dealer randomly shuffles n decks of cards and players place 
their bets. (The value of n does not materially affect our discussion. It generally is 
1, 2, or 4, and we shall use 1 throughout.) There are a maximum and a minimum 
allowed bet. 

The minimum insures a positive probability of eventual ruin for the player who 
continues to bet. The maximum protects the casino from large adverse fluctuations 
and in particular prevents the game from being beaten by a martingale (e.g. doubling 
up), especially one starting with a massive bet. In fact, without a maximum, a casino 


1 The research for this paper was supported in part by the Air Force Office of Scientific Research 
through Grant AF-AFOSR 1113-66. 

The paper is intended in large part to be an exposition for the general mathematical reader with some 
probability background, rather than for the expert. 
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with finite resources can in general be ruined a.s. (almost surely) by a player with 
infinite resources. The player simply bets enough at each trial so that the casino is 
ruined if it loses that trial. A practical way for the player to have infinite resources 
would be for the casino to extend unlimited credit for the finite time it might be needed. 

The players' hands are dealt after they have placed their bets. Each player then uses 
skill in his choice of a strategy for improving his hand. Finally, the dealer plays out 
his hand according to a fixed strategy which does not allow skill, and bets are settled. 
In the case where play begins from one complete randomly shuffled deck, an approxi- 
mate best strategy (i.e. one giving greatest expected return) was first given in 1956 [1]. 

Though the rules of blackjack vary slightly, the player following [1] typically has 
the tiny edge of + .10 77. (The pessimistic figure of — .62 % cited in [1] was erroneous 
and may have discouraged the authors from further analysis.) These mathematical 
results were in sharp contrast to the earlier and very different intuitive strategies 
generally recommended by card experts, and the associated player disadvantage of 
two or three per cent. We call the best strategy against a complete deck the basic 
strategy. Determined in 1965, it is almost identical with the strategy in [1] and it gives 
the player an edge of + 0.13% [22]. 

If the game were always dealt from a complete shuffled deck, we would have 
repeated independent trials. But for compelling practical reasons, the deck is not 
generally reshuffled after each round of play. Thus as successive rounds are played 
from a given deck, we have sampling without replacement and dependent trials. It is 
necessary to show the players most or all of the cards used on a given round of play 
before they place their bets for the next round. They can then use this knowledge of 
which cards have been played both to sharpen their strategy, and to more precisely 
estimate their edge. (The strategies for various card counting procedures, and their 
expectations, were determined directly from probability theory with the aid of com- 
puters. The results were reverified by independent Monte Carlo calculations.) 

For a given card counting procedure and associated strategy, there is a probability 
distribution F, describing the player's expectation on the next hand, provided c cards 
have been counted. As c increases, F, spreads out. (This is a theorem, whose proof 
resembles that for the similar theorem in Baccarat, mentioned in [24], page 316). 
This spread in F, can be exploited by placing large bets when the expectation is posi- 
tive and small bets when it is negative. Part II indicates how best to do this. 

If the basic strategy is always used, E( F.) = + 0.135, just as from a complete deck. 
But if an improved strategy, based on the card count, is used, E(F.) increases as c 
increases, approaching values of one to two per cent or more. 

Ties, in which no money is won or lost, may be discounted. They occur about one 
tenth of the time. Most, but not all, of the other outcomes result in the player either 
winning or losing an amount equal to his original bet. 

The conditional means E(F, | F..,); k = 1, 2, ..., c, of the successive F, are non- 
decreasing. The F, are dependent; in particular when a deck “goes good”, it tends 
to stay good. 


Probabilistic summary 

To a good first approximation, Blackjack is a coin toss where the probability p of 
success is selected independently on each trial from a known distribution F (which 
is a suitably weighted average of the F,) and announced before each trial. 

A more accurate model considers that the p's are dependent in short consecutive 
groups, corresponding to successive rounds of play from the same deck. Another 
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more accurate observation is that insurance, naturals, doubling down, and pair 
splitting, each win or lose an amount different from the amount initially bet. We do 
not consider this more accurate model in part II because the improvement in results 
is slight and the increase in complexity is considerable. 


2. BACCARAT 


The terms Baccarat and Chemin de Fer are used, sometimes interchangeably, to 
refer to several closely related variants of what is essentially one card game. The game 
is currently popular in England and France, where it is sometimes played for un- 
limited stakes. It is also played in Nevada. The game-theoretic aspects of Baccarat 
have been discussed in [11, 14]. The Nevada game is analyzed in [24] which includes 
results of extensive computer calculations. 

The studies of Baccarat show that the available bets generally offer an expectation 
on the order of —1 %. The use of mixed strategies, to the very limited extent that this 
is possible in some variants of the game, has but slight effect on the expectation. 
Despite the resemblances between Baccarat and Blackjack, the favorable situations 
detected by perfect card counting methods are not sufficient to make the game 
favorable. Thus Baccarat is not in general a favorable game. 

The game as played in Nevada sometimes permits certain side bets. The minimum 
on the side bets was observed to be $ 5 to $ 20 and the maximum was $ 200. The bets 
either won nine times the amount bet or lost the amount bet. The game was played 
with eight well shuffled decks dealt from a dealing box, or shoe. Using the card 
counting techniques described in [24], the side bets were favorable about 20% of the 
time. When they were favorable, the expectations ranged as high as + 100%. The 
expectation initially was about — 5% and as the number c of cards seen increased, 
the distribution F, of expectations spread out ([24], page 316) as in Blackjack. In 
practice the betting methods discussed in part II, in which the bet increased with the 
expectation, doubled initial capital in twenty hours. 

Unlike the Blackjack player, the Baccarat side bettor has no strategic decisions to 
make so E(F,) does not vary as c changes. When the expectation of the side bet falls 
below a certain value, it is best to make a "waiting" bet on one of the main bets. 
There are either two or four side bets, similar and dependent. How to apportion funds 
on the side bets is complicated by the fact that there are several of them. These com- 
plexities are treated in [24]. 


Probabilistic summary 


When only one side bet is available, the pay-off for a one unit side bet is either + 9 
or — |. If p is the probability of success, we may suppose that p is selected indepen- 
dently from a known distribution F and announced before each trial. When several 
side bets are available, the situation is more complex. It illustrates the general setting 
of [3], page 65. 

As in Blackjack, a more accurate model considers that the p's are dependent in 
consecutive groups, corresponding to successive rounds of play from the same en- 
semble of (eight) decks. It also considers the effect of waiting bets. 

The situation here is more complex than in Blackjack. First, it is important to 
exploit any opportunities of making simultaneous bets on two or more favorable side 
bet situations. Second, the pay-off is never one to one. 
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3. ROULETTE 


Roulette has long been the prototype of unbeatable gambling games. It is normally 
regarded as a repeated independent trials process which generates at each trial 
precisely one from a set of random numbers. In Monte Carlo these numbers are 0, 
1, 2, ..., 36. Players may wager on particular conventional subsets of random num- 
bers (e.g. the first dozen, even, {27}, etc.), winning if the number which comes up is 
a number of the chosen subset. A player may wager on several subsets simultaneously, 
and each bet is settled without reference to the others. The expectation of each bet is 
negative (in Nevada generally — 5.26%, except for one worse bet, and in Monte 
Carlo — 1.35%.) Thus it has been long known that the classical laws of large numbers 
insure that the player will with probability one fall behind and stay behind, tending 
to lose in the long run at a rate close to the expectation of his bets. 

Despite this, Henri Poincaré and Karl Pearson each examined roulette. Poincaré 
([20], pages 69—70, pages 76—77; [21], pages 201—203; [9], pages 61-62) supposes that 
the uncertainty in initial conditions (e.g. the angular position and velocity of the ball 
and of the rotor at a given time) leads to a continuous probability density f in the 
ball’s final position. He shows by an argument involving continuity only that if f has 
sufficient spread, then the finitely many final ball positions are to very high approxi- 
mation equally likely. 

Karl Pearson statistically analyzed certain published roulette data and found very 
significant patterns. In particular Pearson says, "If Monte Carlo roulette had gone 
on since the beginning of geological time on this earth, we should not have expected 
such an occurrence as this fortnight's play to have occurred once on the supposition 
that the game is one of chance." And again, “To sum up, then: Monte Carlo roulette 

is... the most prodigious miracle of the nineteenth century." I’ve been told that 
it was later learned that the roulette data was supplied for a newspaper by journalists 
hired to sit at the wheel and record outcomes. The journalists instead simply made up 
numbers and submitted them. It was their personal bias that Pearson detected as 
statistically significant. 

It brings to mind David Hume's essay Of Miracles: “No testimony is sufficient to 
establish a miracle, unless the testimony be of such a kind that its falsehood would be 
more miraculous than the fact that it endeavors to establish. . . . it is nothing strange 
... that men should lie in all ages." 

Poincaré assumed a mechanically perfect roulette wheel. However, wheels some- 
times have considerable bias due to mechanical imperfections. Some observed in- 
stances and their exploitation are discussed in detail in [25]. 

In Blackjack and Baccarat, we used the following fundamental principle: The 
payoff random variables, hence the favorability of a game to an optimal player, depend 
on the information set used to determine the optimal strategy. For instance, if used 
cards are ignored in Blackjack, then we simply have Bernoulli trials with p = 
+ 0.13%. However, as more card counting information is employed, the distribution 
of p spreads out (has more structure), its expected value increases, and it can be more 
effectively exploited. The roulette system we now describe illustrates the use of an 
enlarged information set. 

Play at roulette begins when the croupier launches the ball on a circular track which 
inclines towards the center so the ball will fall into the center when it slows down 
sufficiently. The center contains a rotor with a circle of congruent numbered pockets 
rotating in the opposite direction to the ball. The ball eventually slows and falls from 
its track on the stator, spiralling into the moving rotor and eventually coming to rest 
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in a numbered pocket, the “winning number". Bets may generally be placed until the 
ball leaves its track. This is crucial for what follows. 

A collaborator and I tried to use the mechanical perfection of the wheel - the very 
perfection needed to eliminate the bias method — to gain positive expectation. Our 
basic idea was to determine an initial position and velocity for the ball and rotor. We 
then hoped to predict the final position of the ball in much the same way that a 
planet's later position around the sun is predicted from initial conditions, hence the 
nickname "the Newtonian method". 

The Newtonian method occurred to me in 1957, and by 1961 the work described 
here had been completed. Although the wheel of fortune device was mentioned in 
LIFE Magazine, March 27, 1964, pp. 80-91, we pointedly did not mention the roulette 
work there. However, we do so in [22], page 181-182. The Newtonian method is also 
mentioned in the significant book by R. A. Epstein, The Theory of Gambling and 
Statistical Logic, Academic Press, pp. 135-136, (1967). 

The stator has metal deflectors placed to scatter the ball when it spirals down and 
the pockets are separated by vertical dividers (“frets”) which also introduce scattering. 
These scatterings were measured and found to be far from sufficient to frustrate the 
Newtonian approach. However, there were additional sources of randomness which 
did frustrate this approach. (We never satisfactorily identified these causes and can 
only speculate — perhaps the causes included minute imperfections in track or ball or 
high sensitivity of the coefficient of friction to dirt or atmospheric humidity.) 

We were led to a variation we called the quantum method. If a roulette wheel is 
tilted slightly the ball will not fall from a sector of the track on the “high” side. The 
effect is strong with a tilt of just 0.2°, which creates a forbidden zone of a quarter to a 
third of the wheel. The non-linear differential equation governing the ball's motion 
on the track is the equation for a pendulum which at first swings completely around 
its pivot, but is gradually slowed by air resistance. (It is illuminating to sketch the 
orbits of the equation, as indicated in [4], page 402, problem 3.) The experimental 
orbits of angle versus time could be plotted easily in the laboratory by taking a movie 
of the system in motion, along with a large electric clock whose hand swept out one 
revolution per second! 

The existence of a forbidden zone partially quantizes the angle at which the ball 
can exit, and hence quantizes the final angular position of the ball on the rotor. The 
physics involved suggests that the quantization is in fact very sharp: Suppose the ball 
is going to exit beyond the low point of the tilted wheel. Then it must have been 
moving faster than a ball exiting at the low point, so it reaches its destination sooner. 
But it has also gone farther, and the two effects tend to cancel. They in fact cancel 
very well. A similar argument shows that balls which exit before the low point have 
been slower, hence later, offsetting the fact they have not gone as far. Observation 
verifies the conclusions of this heuristic argument. 

The sharp quantization of ball final position, as a function of initial conditions, 
makes remarkably accurate prediction possible. 

Using algorithms, it was possible by eye judgements alone to estimate the ball's 
final position three or four revolutions before exit (perhaps five to seven seconds 
before exit, which was ample time in which to bet) well enough to have a + 15% 
expectation on each of the five most favored numbers. A cigarette pack sized tran- 
sistorized computer which we designed and built was able to predict up to eight revo- 
lutions in advance. The expectation in tests was + 44%. 

One third of the Nevada roulette wheels which we observed had the desired tilt of 
at least 0.2°. The input to the computer consisted of four push-button hits: two when 
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the 0 of the rotor crossed a fiducial mark during successive revolutions and two when 
the ball crossed a fiducial mark on successive revolutions. The decay constants of ball 
and rotor, approximately constant over the class of wheels observed, had been deter- 
mined earlier by simple observations. 

The ultimate weakness of the system was that the house could foil it by forbidding 
bets after the ball had been launched. 


Probabilistic summary 


Roulette on a slightly tilted wheel is repeated independent trials. At each trial the 
player may wager on one or more subsets of the finitely many elementary outcomes. 
A wager on a subset wins if and only if it contains the elementary outcome that occurs. 
There are subsets with expectations of 44%. Our procedure in practice was to bet on 
one of eight neighborhoods of five numbers. Thus the payoff for a bet of .2 units on 
each of five numbers was either —1 or + 6.2. The expectation of + 44% corresponds 
to a probability of success of .2. We remark that our knowledge of p increases with 
the sample size. 


4, THE WHEEL OF FORTUNE 


The wheel of fortune, featured in many Nevada casinos, is a six foot vertical wheel 
with horizontal equally spaced pegs in its rim. As the wheel spins, a rubber flapper 
strikes successive pegs, slowing the wheel. There are generally 48 to 54 spaces between 
the pegs, numbered with 1s, 2s, 5s, 10s, 20s, and two distinct 40s. A player betting a 
unit on one of these outcomes is paid that number of units if his outcome occurs. The 
wheel behaves to good approximation as though a constant increment of energy is 
lost each time a peg passes the flapper. Thus 0, the total angle of rotation, is propor- 
tional to the energy E, which equals Jw”, where J is the moment of inertia and c is 
the angular velocity of the wheel. 

In practice, a transistor timing device of match box size (a "spinoff" from the 
roulette technology) produced a faint click a chosen time after a push-button was hit. 
The button was hit when a specified 40 passed the flapper. The timer was set so the 
click was approximately when the second 40 reached the flapper. If it clicked after the 
second 40 reached the flapper, the wheel was "fast" and would go farther than 
average before stopping. If it clicked before the second 40 reached the flapper, the 
wheel was “slow”. 

For a given timer setting, a table was constructed empirically, giving the approxi- 
mate final position of the wheel as a function of the number of spaces the second 40 
was fast or slow when the click was heard. 

In practice one could determine with certainty which of the two 40s could not occur. 
Thus, one could always bet on the “right” 40. On a wheel observed in the Riviera 
Hotel there were 50 numbers, including 22 ones, 14 twos, 7 fives, 3 tens, 2 twenties 
and 2 forties. Betting on the "right" 40 would win on average 80 units in 50 trials and 
lose 48, for an expectation of 32/50 or 64%. 


Probabilistic summary 


Ignoring obvious refinements, we have repeated independent trials with probability 
p = 1/25 of success at each trial, a payoff for a 1 unit bet of — 1 or + 40, and an 
expectation of + 64%. 
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5. THE STOCK MARKET 


The stock market is a natural economic object for mathematical analysis because 
vast quantities of precise historical data are available in numerical form. There have 
been many attempts to mathematically predict future price behavior, using as a basis 
various subsets of the available information. Most notable are the attempts to predict 
future prices from past price behavior. These attempts have caused the view to be 
widespread in academic circles that, to first order, common stock prices are a random 
walk and changes in common stock prices are log normally distributed with a certain 
mean and standard deviation [5]. 

Practitioners hotly contest this view. Part of the dispute is caused by practitioners 
who are unwilling or unable to test their claims scientifically and part of it is due to 
the success of a few practitioners who use much more information than past price 
history alone. A recent study suggests strongly that "relative strength" in a price series 
is continued and, consequently, that past prices do have some value in predicting 
future prices [16]. 

Whether or not we can predict the future course of stock prices!, there are invest- 
ments in combinations of securities which can yield high expected return [23]. These 
investments involve convertible securities. À convertible security is one which, in 
some cases with the addition of money, is exchangeable (per share) for a certain num- 
ber of shares of another security. Convertible securities include convertible bonds, 
convertible preferreds, stock options, stock rights, and warrants. There are several 
billion dollars worth of convertibles listed on the New York and American Stock 
Exchanges. 

The analysis of other convertibles follows from the analysis of the common stock 
purchase warrant. We therefore restrict ourselves to these in our discussion, and shall 
refer to them simply as warrants. 

A warrant is the right or option to buy a certain number of shares of common for 
a certain price, until a certain expiration date (warrants which do not expire are 
called perpetual). The terms ordinarily read: A warrants plus E dollars buy C shares 
until D date. To avoid normalization problems, we suppose A = C = 1. Then Æ is 
the "exercise price" of the warrant. The prices of warrant and common are related 
and it is this which allows successful investments. One observes: (1) The price W of 
the warrant should increase as the price S of the stock increases. (2) If W + E < S, 
warrants can be bought and common sold short, simultaneously. The warrants are 
then converted to common which is delivered against the short position. Neglecting 
commissions, a profit of S — W — E per warrant results. The purchase of warrants 
tends to increase W and the sale of common tends to decrease S, until W + E z S. 
Thus W = S - E normally holds. (3) The common has advantages over the warrant 
such as possible dividends, or voting rights, hence we also normally expect W < S. 

Thus for practical purposes points (S, W) representing (nearly) simultaneous prices 
of a common stock and its warrant are confined to the part of the positive quadrant 
between the lines W = S and W = S - EF. 

The prices W and SS at a future time are random variables but they are related. 
As E( S) increases we would expect, and past history verifies ([12, 23]), that E(W) 
tends to increase. In fact the points (S, W) tend to lie on certain curves which depend 


! The great mathematician Karl Friedrich Gauss was successful in the market but we have little 
knowledge of his methods. On a basic salary of 1000 thalers per year he left an estate in cash plus 
securities of 170,857 thalers ([7], page 237). 
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on several variables, most notably the time remaining until expiration of the warrant. 
Thus, although we may not know the price S of the common, or the price W of the 
warrant at a given future time, we do know that (W, S) is near one of these curves. 
The family of curves qualitatively resembles the family W = (S7 + E")! — E, where 
z = 1.3 + 5.3/T and T is the number of months remaining until expiration. 

A historical study of expiring warrants (from here on we limit ourselves for con- 
venience to warrants traded on the American Stock Exchange) suggests that during 
the last two years or so before expiration they tend to trade at prices which are much 
too high. For instance the average loss from buying each of a certain 11 listed war- 
rants 18 months before expiration and holding until 2 months before expiration was 
46.0%, an annual rate of 34.5% ((23], page 37). Thus selling warrants short seems to 
yleld high expectation. However, it also happens to result in occasional large losses 
which, by the criterion of Part II, are extremely undesirable despite the high overall 
expectation. We can sharply reduce this high variance and yet retain a high expecta- 
tion by using the so-called warrant hedge. The technique is to simultaneously sell 
short overpriced warrants and buy common in a fixed ratio (generally from one to 
three warrants will be shorted for each share of common bought). The position is 
beld until just before expiration of the warrant (at which time the warrant sells at a 
"correct" price) and then it is liquidated. 

Here is the rationale. We are mixing two investments with positive annual expecta- 
tions of say 34.5% for the warrants and 10% for the common, resulting in an invest- 
ment whose overall expectation must therefore be somewhere between these figures. 
(We suggest 1077 for the common because this approximates the observed mean rate 
of return from common stocks during this century due to price appreciation plus 
dividends.) Buying the common leads to a gain when the common rises and a loss 
when it falls whereas shorting warrants leads to a gain when the common falls and 
leads to a loss only if the common rises substantially. Thus the risks tend to cancel 
out. In fact, the hedge generally yields a profit upon expiration of the warrant, for a 
wide range of prices of the common. 

If we make assumptions about the probability distribution of the price of the com- 
mon at expiration of the warrant, we get more precise information about the random 
variable representing the payoff from the hedge. Let the probability measure P with 
support [0, oo) describe the distribution of the stock price S, at expiration. Then 


E(S;)-— i x"d P(x). 


Let So be the present price and let E be the exercise price. Assume that P(S; 2 So 
+ t) > P(S; € So- t) for each t > 0, i.e. for any t, the chance of a price rise of at 
least ¢ is no less than the chance of a price drop of at least t. This is a very weak 
assumption. Note that it does imply E(S,) 2 So. 

Just before expiration W, = Oif S, S E and W, = Sp- Eif S; > E. Thus the 
final gain from shorting a warrant at Wy is Wo if S, x: E and is Wy - S, + Eif 
S, > E. The gain from buying a share of common at S, is, of course, S, — So. 

Hence if we assume one share of common is purchased at .5E and one warrant is 
shorted at .2E, the final gain G; is S, - .3Eif S, X Eand .7E if S, > E. A standard 
measure-theoretic argument yields E(G,) = .2E. Using 100% margin, the percent 
profit is E(Gy) / 7E > 28%. With 100% margin on the warrants and 70% margin 
on the common, it is at least .2E / .55E > 36%. With 70% margin on each, it is at 
least .2 / .49 > 40%, an annual rate of more than 20% if the warrant expires in 
two years. 
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It is interesting to calculate E(G,) by assuming that S, is log normally distributed. 
Letting s, = S,/ E, we thus assume that s, has the density function 


f(x) = (xo 2n)! exp | -0 x-p)? /20? |, where p and c are parameters de- 
pending on the stock. We note E(s,) = exp(p + 07/2). 


If t is the time in months remaining until expiration (which is when s, is realized), 
then we assume u = log s; + mt and o? = a?t, where Sy = sg E is the present stock 
price and m and a are constants depending on the stock. Thus E(s,) = so exp 
[(m + a?/2)t]. A mean increase of 10% per year is approximated by setting 12(m + 
a?/2) = .1. If we estimate a? from past price changes we can solve for m. 

Letting wp = W,/E, where W, is the final warrant price, a calculation yields 
E(w,) = E(s,) N(ulo + o) - N(u/c), where N is the normal distribution. (Compare 
the equivalent expression from pp. 464—466 of [5].) Now suppose that sọ = .5, that a 
has the realistic value of .lsọ or .05, and that 12(m + a?/2) = .1, whence m = 


.085/12. Then for t = 24 we have o = /.06 = 245 and u = log .5 + .17 = —523. 
This yields E(w,) = .0015 and E(s,) = .61, whence E(G,) = .20 + .11 = .31. Thus 
the profit, with 70% margin on both warrant and common, is .31/.49 or 63.3% and 
the annual rate is 31.6%. Note that the warrant is virtually worthless! 

Instead of selling one warrant short and buying one share of common, we can sell 
short w warrants and buy s shares of common. Neglecting commissions, which we do 
throughout for simplicity, the gain G at any point (S, W) is s((S — So) —w(W- Wo). 
Thus the line G = 0, the zero profit line, is the line through (Wo, So) with positive 
slope s/w = 1/m. We call m the mix. Points below the zero profit line represent gain 
and points above it represent loss. If 1 < m < oo, the zero profit line intersects the S 
axis at Sọ — m W, and it intersects the line W = S -— Eat S = [m( Wo + E) - Sg1/ 
(m - 1); W = (m Wy + E- S9)/ (m - 1). 

When the warrant expires the hedge position will yield a profit if S, is between 
the S values of the two intersections and it will yield a loss if S; is beyond the inter- 
sections. For instance, if Sọ = .5E and Wy = .2E, the choice m = 2 insures a final 
profit if .1E < S, < 1.9E. Such safety is characteristic of the warrant hedge. 

The final gain Gy is (Se — So) + wW if S, X Eanditiss(S,; — So) + w(Wo + 
E- S,)if S; > E. Thus as a function of S, it is an inverted "V" with apex above 
S, = E. With 100% margin, the initial investment is s Sy + w Wo so the gain per 
unit invested is g; = Gy/(5 So + w Wo). With margin of « on the common and f on 
the warrants it is g; = G,/(a s So + 8 w Wo). 

We have assumed so far that a hedge position is held unchanged until expiration, 
then closed out. This static or “desert island” strategy is not optimal. In practice 
intermediate decisions in the spirit of dynamic programming lead to considerably 
superior dynamic strategies. The methods, technical details, and probabilistic sum- 
mary are more complex so we defer the details for possible subsequent publication. 


Probabilistic summary 


The warrant hedge may offer high expectation with low risk. The gain per unit 
grisg,=[(Sy+ So) +m Wo] / (« So + Bm W,)when S, S Eandg,; = [6S5 — So) 
+ m(W, + E- S,)]/ (« So + Bm Wo) if S, > E. The gain per unit depends only 
on the random variable S,. This has an unknown distribution but it can be estimated. 
The other quantities are constants depending on circumstances. 
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PART II. A MATHEMATICAL THEORY FOR COMMITTING 
RESOURCES IN FAVORABLE GAMES 


]. INTRODUCTION: COIN TOSSING 


Suppose we are confronted with an infinitely rich adversary who will match all bets 
we make on repeated independent trials of a biased coin (whose two outcomes are 
"heads" and "tails".) Assume that we have finite capital Xy, that we bet B; on the 
outcome of the ith trial, where X, is our capital after the ith trial, and that the proba- 
bility of heads is p, where 4 « p < 1. (This is approximately the situation in Nevada 
blackjack, except that the game is played with a "mix" of biased coins.) Our problem 
is to decide how much to bet at each trial. A classic criterion is to choose Bj so that 
our expected gain E(X; — X;_,) is maximized at each trial, which is equivalent to 
maximizing E( X,) for all n. 

Define T; by T; = 1 if the jth trial results in success and 7, = —1 if the jth trial 


results in failure. Then X; = X,., + T;B; j = 1,2,...,and X, = X, + X TB; 
j*1 

We assume that T, X;, and B; are all random variables on a suitable sample space ©. 
If, for example, B; is a function of Xo, X,,..., X;—, as it is in the common gambling 
systems, e.g. Martingale, Labouchere, etc. (note that B, = | X, - X, , | so we need 
not add the B,, k = 1,...,j - 1), then we see by induction that B; is a function of 
Xo, Tis Ta, . . . Hence the underlying sample space can be taken to be the space of all 
sequences of successes and failures, with the usual product measure. 

Suppose, more generally, that the player determined B, by examining Xo,..., 
X;..,. and then "consulting" a chance device, e.g., a near-by roulette wheel. Then the 
sample space consisting of an infinite product of spaces, each of them a joint outcome 
of the roulette wheel and the latest trial, might be suitable. Such possibilities are in- 
cluded if we simply assume 7,, X; and B; are all random variables on some suitable 
sample space Q. 

When B; > X;.,, the player is betting more than he has. He is asking for credit. 
This is common in gambling casinos, in the stock market (buying on margin), in real 
estate (mortgages) and is not unrealistic. 

When B; < 0, the player is making a "negative" bet. To interpret this, we note 
that in our sequence of Bernoulli trials, or coin toss between two players, that what 
one wins, X; — Xo, the other loses. To make a negative bet may be interpreted as 
*backing" the other side of the game, to taking the role of the "other" player. 

In particular, the payoff B;T; from trial j may be written as (—B,) (—7)). If B; S 0, 
then —B, = 0 and may be interpreted as a nonnegative bet by a player who succeeds 
when —T = 1, i.e., with probability g, and who fails with probability p. The —-7; are 
independent so we have Bernoulli trials with success probabilities q, i.e., the other 
side of the game. 

For simplicity we shall assume in what follows that 0 < B; € X;.,, but we may 
wish at a future time to remove one or both of these limitations. 

We also assume that B; is independent of T; i.e., the amount bet on the jth out- 
come is independent of that outcome. 

Definition: A betting strategy is a family (B;) such thatO € Bj  X;-,,j— 1,2,... 

Theorem 1: The betting strategies B; = X;_, when p > 4; B; = 0, p < 4; B; arbi- 
trary when p = 4; are precisely the ones which maximize E(X;) for each j. 


Proof: Since X, = X, + E BjT,E(X,) = X,+ EZ E(B;T;) = 
j=l j=l 


j= 
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— X,- X (p-q)E(B;).Ifp-q =0,ie., p = $, then E(B;) does not affect E( X, ). 
j= 

Ifp-q > 0, ie. p > 4, then E(B;) should be maximized, i.e., B; = Xj, to maximize 

E(X,). Similarly, ifp -q < 0, i.e., p < 4, then E(B;) should be minimized to maxi- 

mize the jth term, i.e., B, = 0. Clearly, these maxima are not attained with other 

choices for B;. This establishes the theorem. 

Remark. In the foregoing discussion, the Bernoulli trials and the 7; can be general- 
ized, yielding a more general theorem. (The T, become “payoff functions" that are 
not necessarily identically distributed; roulette is the classic example.) The particular 
case of blackjack is covered, for instance, by replacing p and q throughout by p; 
and q; for the respective probabilities that 7; = 1 or —1. 

To maximize our expected gain we must bet our total resources at each trial. Thus 
if we lose once we are ruined, and the probability of this is 1 - p" — 1 so maximizing 
expected gain is undesirable. 


2. MINIMIZING THE PROBABILITY OF RUIN 


Suppose instead that we play to minimize the probability of eventual ruin, where 
ruin occurs after the jth outcome if Y; = 0. If we impose no further restriction on B,, 
then many strategies minimize the probability of ruin. For example, it suffices to 
choose B; < X;.,/2. The discreteness of money makes it realistic to assume B; > 
C > 0, where C is a non-zero constant. We further restrict ourselves to the subclass 
of strategies where B; equals C whenever 0 < X;., < a, B; = 0 if X; , S 0 or 
X;-; Z a, and C divides both a — z and z, where we have set z = Xo. This lets us use 
the gambler's ruin formulae ([8], page 314). 

Consider the gambler's ruin situation: Xy = z, B; = 1 if0 < X,- «a, B; —0 
if 3;., = 00r X¥;_, = a, a and z are integers. Let r be a positive number (necessarily 
rational) such that zr and ar are integers. Let R(r) be the ruin probability when z and a 
are replaced by zr and ar, respectively. This is equivalent to betting r ' units when 
0 < X;_, < a, in the original problem. 

We have R(r) = (6% — 07) / (0% — 1), where 0 < p + 4 and 6 = q/p. 

Theorem 2: (a) If 1 p > 4, R(r) is a strictly decreasing function of r. (b) If 
0 « p< 4, R(r) is a strictly increasing function of r. 

Proof: Follows from Lemma 3 below. 

Part (a) of the Theorem says that in a favorable game, the chance of ruin is decreased 
by decreasing stakes. Note that for p > 4, ie, 0 < 1, lim R(r) = 0, hence by 
making stakes sufficiently small, the chance of ruin can be made arbitrarily small. 

Lemma 3. Let a > z 0, x > 0. If 0 <6 < 1, then f(x) = (0* — 6%) / (1—6**) 
is strictly decreasing as x increases, x > 0. If 0 > 1, f(x) is strictly increasing as x 
increases, x > 0. 

Proof: Elementary calculations which we omit. 

Theorem 2(a) shows that, at least in the limited subclass of strategies to which it 
applies, we minimize ruin by making a minimum bet on each tria]. 

In fact, this holds for a broader class of strategies: 

Theorem 3': If 1 > p > 4, the strategy B, = lif 1 < z < a-1, B; = 0 otherwise 
(timid play), uniquely minimizes the probability of ruin among the strategies where B, 
is an integer satisfying 1 < B; < min (z, a-z) if 1 Sz i a-l, B; = 0 otherwise. 

Proof: We first show that if timid play is optimal, then it is uniquely so. Let g, be 
the probability of ruin, starting from z, under timid play. To establish uniqueness it 
suffices to show 
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d, < Pda dd,-o, 2S k S a2, z-k 20, z ka. (1) 
Using q, = (0-07) / (6"-1) and simplifying (1), we find that it is equivalent to show 
f(p) = p 4 gt- pk! gk“! > 0, 2«c«p£l (2) 


This follows at once from the observations f(4) = 0, f(1) = 1, and f'(p) > 0, 
t<psil. 

To show that timid play is optimal, let Q(z) = 1-9,, and adopt the terminology 
of [6]. Then Q(z) is the probability of success, both for z in our original game, and 
for z/n in a normalized game where the possible fortunes are F = { 0, I/a, 2/a,..., 
z/a,..., 1 = aja }, and the betting units and limits are 1/a as large as before. 

The establishment of (1) shows Q(z) is excessive. But obviously u(z) < Q(z) 
< U(z) so by [6, Theorem 2.12.3], Q(z) = U (z). 

Thus timid play is the one and only strategy in our class of strategies which mini- 
mizes the probability of ruin. 

Remark: In [6] it is shown that bold play is optimal but not necessarily unique 
when p < 4 (pages 2, 87ff, 101ff). If there is also a legal upper limit to bets, there may 
be more than one optimal strategy; whether bold play is one of them seems to be 
unknown (page 4). Betting systems which minimize the probability of ruin in certain 
favorable games are also discussed in [10]. 

The strategy which minimizes ruin has the unsatisfactory consequence that it also 
minimizes our expected gain. Some strategy is called for which is intermediate be- 
tween minimizing ruin (and expectation) and maximizing expectation (assuring ruin). 
A remarkable solution, in a certain sense very close to best possible, was proposed 
in [13]. 


3. THE KELLY CRITERION 


Consider Bernoulli trials with 1 > p > 4 and B; = fX;_,, where 0 Sf S lisa 
constant. (This is sometimes called "fixed fraction" or "proportional" betting.) Let 
S, and F, be the number of successes and failures, respectively, in n trials. Then 


X, = X, (1 f" (1- f). 


Observe that f = 0 and f = 1 are uninteresting; we assume 0 < f < 1. Note too 
that if f < 1, there is no chance that X, = 0, ever. Hence ruin, in the sense of the 
gambler's ruin problem, cannot occur. We reinterpret "ruin" to mean that for each 
e > 0,lim P[ X, Z €] = 0, and we shall see that this can occur. Note too that we 

n 


are now assuming that capital is infinitely divisible. However, this assumption is not 
a serious problem in practical applications of the theory. 

Remark: The min-max criterion of game theory is an inappropriate criterion in 
Bernoulli trials. If B; is a positive integer for all j, the maximum loss, i.e. ruin, is always 
possible and all strategies have the same maximum possible loss, hence all are equiva- 
lent. If capital is infinitely divisible, ruin is as we redefined it, and we restrict ourselves 
to fixed fractions, then for an infinite series of trials the min-max criterion (suitably 
probabilitistically modified) considers all f with 1 => f Z f. equivalent and all f with 
0 < f « f. equivalent. It chooses the latter class. For a fixed number n of trials, 
smaller f are preferred over larger f. The criteria of minimizing ruin or of maximizing 
expectation likewise fail to make desirable distinctions between the fixed fraction 
strategies. 
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The quantity log [ X,/ Xol!” = (S,/ n) log (1 +f) + (F,/ n)log (1 - f) mea- 
sures the rate of increase per trial. Since time is important it is plausible to in some 
sense maximize this. Kelly’s choice [13] was to maximize E log [ X,/ Xo]! = p log 
(1 4- f) - ¢log(1-f) = G(f), which we call the exponential rate of growth. The 
following theorems show the advantages of maximizing G (f). 

Theorem 4. If 1 > p > 4, G( f£) has a unique maximum at f* = p-4,0 « f* < 1, 
where G(f*) = p log p + q log q + log 2 > 0. There is a unique fraction f, > 0 
such that G (fc) = 0, and f, satisfies f/* < f, < 1. Further, we have G(f) > 0, 
O0Ocf«f,; G(f) «0, f » fa with G (f) strictly increasing, from 0 to G (f*), on 
[0, /* ], and G (f) strictly decreasing, from G ( f *) to - œ on [f*, 1]. 

Theorem S(a). If G(f) > 0, then lim X, = œ a.s., i.e., for each M, P [lim 


n 


X,>Mj=1. 


(b) If G(f) < 0, then lim X, = 0 a.s., i.e., for each £ > 0, P[lim X, < £] = 
(c) If G(f) = 0, then lim X, = oo a.s. and lim X, = 0 a.s. 


Thus for 0 < f < fa the player's fortune will eventually permanently exceed any 
fixed bounds with probability one. For f — f, it will almost surely oscillate wildly 


between 0 and + oo. If f > fa ruin is almost sure. 
1/n 


Proof: (a) By the Borel strong law([17], page 19), limlog[ X,/Xo9] = G(f)»0 


with probability 1. Hence, a.s., for w e Q, where Q is the space of all sequences of 
Bernoulli trials, there exists N (w) such that for n = N (o), 
log [X, / Xo] "^ z G(f)/2>0. 
But then X, > Xo e" $0? for n > N (o) so X,7 oo. 
(b) The proof is similar to part (a). 
(c) We use the fact that, given any M, lim S, 2 np + M + land 


nptM 
n 


1 
log(1-f) = G (f) + toe 1 t4 Mogi, 


lim S, € np - M~1. Then if S, > np + M,log[ X,/ Xy] ^ 7 
na (np + M) 


log (1 +f) + 


whence X, 2x ef 


-f 
Pram ` 1+ f M . ` . : ] 
lim X, 2 Xo 1-f a.s. Since the right side may be chosen arbitrarily large, 


lim X, = œ a.s. 
n 


E Since S, = np + M infinitely often, a.s., then 


The proof that lim X, — O a.s. is similar. 
Theorem 6: EGH) > G(fz), then lim X (i)i X,(f2) = was. 
Proof: log [ X, (J1) / Xo" - log LX, (fa) Xo)!" 
= log LX, (f) / X, SDF" = Ša log t uz). + Fr log (25). Therefore, by the 


1 tf; 1-f; 
Borel strong law of large numbers, lim log [ X, "m (01^ 6()-GC5) > 0 


with probability 1. Now proceed as in the proof of Theorem 5(a). 
In particular, we see that if one player uses f* and another, betting on the same favor- 
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able situation, uses any other fixed fraction strategy f, then lim X,(f*)/ X,(f) = œ 


with probability 1. This is one of the important justifications of the criterion "bet to 
maximize E log X,.” 

Bellman and Kalaba ([2], pages 200-201) show that f* not only maximizes E log X, 
within the class of all fixed fraction betting strategies but in the class of "all" betting 
strategies. 

This also is a consequence of the following theorem, part of which was suggested 
by a conversation with J. Holladay. Consider a series of independent trials in which 
the return on one unit bet on the ith outcome is the random variable Q;. Then 


X, -n (X,/X;-,) and E log X, = z F log (X; |X;-1). We have X; = Xj, + 


BiQj and Xıl Xj; = 1+ (Bi/ X,-1) Q.. Thus each term is of the form E log (1 + 
F,Q;) where the random variable F; depends only on the first i} trials, Q; depends 
only on the ith trial, and hence F; and Q, are independent. We are free to choose the 
F; to maximize E log (1 + F,Q;), subject to the constraint 0 < F; < 1. 

Theorem 7: If for each i there is an fp, 0 < f, < 1, such that E log (1 + Qi) 
is defined and positive, then for each i there is a number f? such that E log (1 + F;Q;) 
attains its unique maximum for F; = f¥ a.s. To avoid trivialities we assume Q; + 0 
a.s., each i. 

Proof: It follows that the domain of definition of E log (1 + /,Q;) is an interval 
[0, a;) or [0, a;], where a; = min (1, b;) and b; = sup ( f; : iQ; > 0 as. ) > 0. Since 
the second derivative with respect to f; of E log (1 + f,Q;) is -E(Q2 /(1 4 f,Q;) ), 
which is defined and negative, any maximum of E log (1 + f;Q;) is unique. The func- 
tion is continuous on its domain so if it is defined at a;, there is a maximum. If it is 


not defined at aj, then lim E log (1 + f,0;) = - oo so again there is a maximum. 
Sita 
By the independence of F; and Q;, we can consider F;(5,) and Q,(s2) as functions 


on a product measure space S, x S,. Then 
E log (1 + FiQ;) = j j log (1 + Fi(s,) Q;(s)) = E E log (1 + F;(s1) Qi) 


< Elog (1 f* Qi ) with equality if and only if E log (1 + F;(s,) Qj) = 

E log (1 + f** Qj) a.s., which is equivalent to f,* Q; = F;(s,) Q; a.s., and by the in- 

dependence this means either f* = F, a.s. or Q; = 0 a.s. hence f^ = F; a.s., and the 

theorem is established. 
We see in particular from the preceding theorem that for Bernoulli trials with suc- 

cess probability p; on the ith trial and 1 > p; > 4, E log X, is maximized by simply 

choosing on each trial the fraction f;,*= p; - q; which maximizes E log (1 + f,Q;). 


4. THE ADVANTAGES OF MAXIMIZING E LOG Xn 


The desirability of maximizing E log X, was established in a fairly general setting 
by Breiman [3]. Consider repeated independent trials with finitely many outcomes 


I= (1,...,s ) for each trial. Let P(i) = pp i = 1,...,5, and suppose that 
{ A,,..., A, } is a collection of (betting) subsets of J, that each i is in some Ax, and 
that payoff odds o, correspond to the A,. We bet amounts B,,..., B, on the respective 


A, and if outcome i occurs, we receive X Bjo; where the sum is over (j : ie A, 
We make the convention that 4, = Zand o, = 1, which allows us to hold part ofc our 
fortune in reserve by simply betting it on A,. We have, in effect, a generalized roulette 
game. 
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Roulette and the wheel of fortune, as described in Part I, are covered directly by 
Breiman's theory. 

The theory easily extends to independent trials with finitely many outcomes which 
are not identically distributed but which are a mix of finitely many distinct distribu- 
tions, each occurring on a given trial with specified probabilities. The theory so ex- 
tended applies to Blackjack and Nevada Baccarat, as described in Part I. 

Breiman calls a game (i.e., a series of such trials) favorable if there is a gambling 
strategy such that X, > œ a.s. Thus the infinite divisibility of capital is tacitly assumed. 
However, this is not a serious limitation of the theory. If the probability is "negligible" 
that the player's capital will at some time be “small,” then the theory based on the 
assumption that capital is infinitely divisible applies to good approximation when the 
player's capital is discrete. This problem is considered for Nevada Baccarat in [24], 
pages 319 and 321. 

Breiman establishes the following about strategies which maximize E log X,. 

1. Allowing arbitrary strategies, there is a fixed fraction strategy B; = (f,,...,f,) 
which maximizes E log X,. 

2. If two players bet on the same game, one using a strategy A* which maximizes 
E tog X, and the other using an “essentially different” strategy A, then lim 


X, (A*) / X, (A) > coas. 

3. The expected time to reach a fixed preassigned goal x is, asymptotically as x in- 
creases, least with a strategy which maximizes E log X,,. 

Thus strategies which maximize £ log X, are (asymptotically) best by two reason- 
able criteria. 


5. A STOCK MARKET EXAMPLE 


Though in practice there are only finitely many outcomes of a bet in the stock 
market, it is technically convenient to approximate the finite distributions by discrete 
countably infinite distributions or by continuous distributions. In fact it is generally 
difficult not to do this. The additional hypotheses and difficulties which occur are, 
from the practical point of view, artificial consequences of the technique. Hence the 
new theory must preserve the conclusions of the finite theory so again we apportion 
our resources to maximize E log X,. 

As a first example, consider the following stock market investment. It was the first 
to catch our interest, and was based on a tip from a company insider. 

Suppose a certain stock now sells at 20 and that the anticipated price of the stock 
in one year is uniformly distributed on the interval [15, 35]. We first compute f* and 
G (f *), assuming the stock is purchased and fully paid for now, and sold in one year. 
The purchase and selling fees have been included in the price. Thus, the outcome of 
this gamble, per unit bet, is described by dF (s) = C. 4, (s) ds, where F is the asso- 
ciated probability distribution and C,(s) is 1 for s in A and 0 for s not in A. 

The mean m of Fis 1 > 0. Also 

zis, and lim G'(f) = 


GY) = [ion (148) ds, GJ) = P3 P 


Therefore Theorem 8 below applies and rei is a unique f * such that 0 < f* < 4 and 
G'(f*) = 0. To obtain /*, it suffices to solve 
fs ds H ds 


i 
h(f) = 0 where h(f) = feque LIER j as- Fg 


-l1-(l/f)iog, (14 fs) which reduces to 1 - (1//f) log. t, 
-4 


H ET 


h(f). 
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Now A( f) has the same sign and root as G'( f) on (0,4). Since A(3) = 1 - 4 log, 
13 > 0, G'(f£) > 0 for 0x fx 3. Therefore 3 < f* < 4; calculation yields f* = 
3.60 . 

Thus if the maximum fraction of current capital which can be bet is [, we should 
bet all our capital. However, if margin buying is allowed, we should (consistent with 
our ability to cover later) be willing to bet as much as possible, up to a fraction f* 
which is 3.6 times our current capital. 

The mathematical expectation for buying outright is 0.25 V, if buying on margin 
is excluded and 0.90 Vj, if unlimited buying on margin is permitted, and additional 
coverings can be made later as required, and we bet f* = 3.60 of our current capital. 

Integration yields 
G(f) = [(00g; e) /f1C + 3//4) [In (1 + 3//4) -1]- (107/74) [In (10-//4) -1]] 
from which we find G (1) — 0.28 and G (3.60) — .59. 

Next, we compute f* and G (f *), assuming that calls are purchased for 2 points 
per share. Thus the outcome of this gamble per unit bet is described by the probability 
distribution F with mass 5/20 at -1, and dF (s) = 2/20 if -1 < s < 6.5. 


6.5 
The mean m = 1.8125 > 0. Also G(f) = (5/20) log; (1-f) + (2/20) J log, 


(1 + fs) ds and G’(f) cipes + (2/20) 1 E 


clear that lim G' (f) = — oo. Therefore, again by Theorem 8 below, there is a unique 


f 
f* such that G'(f*) = Oand0 < f* < 1. 

Jt suffices to solve h(f) = 0 where h (f) = 20G'(f)/log, e 
=+ ifr 5- Slog, (PS) H We find f* — 0.57. The mathematical expec- 
tation of the call purchase process is 1.8125 f* V, or about 1.03 Vo. 

Integration yields 
G Cf) = (4) log, (1-f) + Cn, eon + 6.5f) [In(1 + 6.5f)-1]-(1-f) 
[in(1-f)-1]. We find G (0.57 ) = 0.5 

Thus we have the interesting result that the expectation from buying calls is higher 
than from buying on unlimited margin but that the growth coefficient is higher from 
buying on unlimited margin. Our criterion selects the latter investment. 

For buying on margin G (37) = .55 so our criterion selects buying on margin if 
the margin requirement is less than 3^ and buying calls, if possible, if the margin 
requirement exceeds 47. 

In the preceding example we needed the following theorem to establish the unique- 
ness of f*. We define a = sup { t : F(- œ, t) = 0 ) and note that if 1 + fa > 0 and 


o [^] l 
the integral G (f) = f log, (1 + fs) dF (s) is defined, then G'(f) = E 082 © AF (s), 


a 1+ fs 
(See, e.g. [17], page 126). 
Theorem 8: The function G' (f) = 


ds, from which it is 


| 1 E dF (s) is monotone strictly de- 


creasing on [0, - 1/a). If the mean m = i sdF (s) > 0, then the equation G'(f) = 


a 


oo 
j UEM dF (s) = O has exactly one solution f* in the interval (0, — 1/a) iff Am, 


G' (f) < 0. In this event, G (f) is monotonely strictly increasing for f in [ f*, — 1/a). 


S S , 
. - ——À— > G' G . 
Proof: If 0 « f, « f; « -1/a, I4 ithe (0 52 a)soG' (f) > G'(f2) 
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From this and the right-hand continuity of G'( f) at 0, G' is monotone strictly de- 
creasing on (0, -1/ a). By hypothesis G'(0) = m log, e > 0. Therefore, from the 
continuity of G'( £) on [0, -1/ a), G'( f) attains all values ¢ on the interval 


lim G'(f) < t € G'(0) exactly once. Thus there is exactly one solution f* in 
f 7- Ma 
(0,-1/a)iff lim G'(f) <0. 
7-1/a 
The description of G (f) is now evident. 


6. WARRANT HEDGING 


We next apply the criterion of maximizing E log X, to the warrant hedge described 
in Part I. With the notation of Part I, and an assumed mix of 1, the gain X from a one 
unit bet is 

X = (Sf -So + Wo)/(% so +B Wo), s; i1, and 
X = (wo + 1-59)/( a 59 +B Wo), 5, 1. 


We wish to maximize the exponential rate of growth G (f), given by G(f) = 
Elog(1+/X). 

It can be shown that the situation is essentially the same as in Theorem 8 and that 
this depends on the a.s. boundedness of X ; we have in fact a.s. sup X = ( wọ + 1-59) / 
(a5 9 +B wo) and as. inf X = —(59~-Wo)/(a@59 + B wo). Thus f* can be com- 
puted when the mix is 1, though the details are tedious. 

When the mix is greater than 1, more serious difficulties appear. The payoff function 
X has a.s. inf XY = — oo and a.s. sup X < oo. This means that, no matter what fraction 
f > O of our unit capital is bet, there is positive probability of losing at least the entire 
unit. Thus any bet is rejected! Yet this is unrealistic. We now find out what is wrong. 

First, the assumption that arbitrarily large losses have positive probability of occur- 
rence 1s not realistic. (a) The broker will automatically act to liquidate the position 
before the equity is lost. (b) The strategies for investing in hedges automatically lead 
to liquidating the position after the common is substantially above exercise price. 

There is, then, a maximum imposed on X by practice but it is not easy in practice 
to specify this maximum. Further, this maximum will, in general, be a random 
variable (a.s. bounded, however) which is a function of the individual's investment 
strategy. It is not easy to determine the consequent probability distribution of sp, yet 
this is required to calculate E log (1 + fX). 

More generally, we might consider an individual's lifetime sequence of bets of 
various kinds. It is plausible to assume that X, = 0 only upon the death of the indivi- 
dual, for although the individual may have no cash equity at a given instant, he does 
have a cash “worth”, based on his future income, serendipity, etc., and this should be 
included in X,. This is true even of a (Billie Sol Estes) bettor who loses more than he 
owns. The subtlety here, then, is that the accountant's figure for net assets (plus or 
minus) is not an accurate figure for X, as X, decreases below small positive amounts. 

One can also object to Y, at death being assigned the value 0, by arguing that the 
chance of death in a time interval always has a small positive probability, thus making 
Elog X, — — oo always. Also, individuals when choosing between two alternatives 
each involving a low probability of death generally do not meticulously select the 
safer alternative (e.g., air travel versus train travel). Thus death should really be 
treated as an event with a large but finite negative value. 

Another common objection to E log X, as a measure of "utility" is that, like all 
such measures which are not a.s. bounded, it allows the St. Petersburg paradox. 
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The foregoing objections to E log X, only arise when we leave the case of finitely 
many outcomes. We say that these are artificial technical difficulties which can all be 
removed in the cases of practical importance. This may be tedious, as it is for the 
warrant hedge, so we defer such matters for a subsequent paper. 


7. PORTFOLIO SELECTION USING E LoG X 


The Breiman results were obtained for repeated independent trials with finitely 
many outcomes and finitely many ways to apportion our capital (amongst finitely 
many betting sets). The results extend, as we have remarked, to independent trials 
which are a mix of finitely many differently distributed trials (i.e., finitely many out- 
comes and betting sets) provided that as n tends to infinity, the number of trials with 
each distribution also tends to infinity. 

There are significant real world situations, such as the selection and continuous 
revision of a portfolio of securities, to which this extended theory does not generally 
apply. A ditficulty which we have already discussed is that it may be technically con- 
venient to introduce continuously distributed and possibly unbounded payoffs, but 
now generalized to the apportionment of capital among a finite number of alter- 
natives, rather than just betting a fraction on one alternative. Another problem is that 
the sequence of betting situations may change so that no two are ever the same. 
Further difficulties arise when we consider the possible dependence of trials. Still 
other problems appear when we consider that in the real world the spectrum of situa- 
tions is changing continuously and that a potentially continuous portfolio revision is 
part of an optimal approach. (Actually, because of the transactions costs which occur 
in practice, portfolio revision is likely to occur in discrete steps.) 

The extent to which Breiman's conclusions for the finite case can be generalized in 
these directions will be considered subsequently. For now we simply remark that the 
possible generalizations promise to be adequate for the real world problems of port- 
folio selection. 

Assuming this to be the case, we shall see in the next section that economists and 
others now have for the first time an accurate guide for portfolio selection and revision. 


8. THE KELLY CRITERION AND DEFICIENCIES IN 
THE MARKOWITZ THEORY OF PORTFOLIO SELECTION 


How to apportion funds among investments has endlessly puzzled economists and 
decision-makers. The literature was noted for its lack of instruction in such matters. 
When Markowitz’ work on portfolio selection appeared, first in articles and later in 
the monograph [18], it became the standard reference. 

Markowitz considers situations in which there are r alternative and, in general 
correlated, investments, with the gain per unit invested of X,,..., X, respectively. 
(It is so much more dignified to call bets investments; we shall try to remember to do 
this in this section.) One of the investments is, of course, cash. The gain is given by 
X, = 0 a.s. 

To select a portfolio is to apportion our resources so that f; is placed in the ith 
investment. Markowitz’ basic idea is that a portfolio is better if it has higher expecta- 
tion and at least as small a variance or if it has at least as great an expectation and has 
a lower variance. If two portfolios have the same expectation and variance, neither is 
preferable. As the f, range over all possible admissible values, the set of portfolios is 
generated. Typically the assumptions on the f; are 2 f, = 1, and f; 2 Ofori = 1,...,r. 
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If a portfolio has the property that no other portfolio in the set is preferable, then 
it is called efficient. Markowitz says that the investor should always choose an efficient 
portfolio. Which efficient portfolio to choose depends on factors outside the theory, 
such as the investor's "needs". 

The Markowitz theory has the obvious deficiency that if E; and o2, i = 1, 2, are 
the expectation and variance of portfolios 1 and 2, then if E, < E; and o? < 03, the 
theory cannot choose between the portfolios. Yet there are obvious instances where 
"everyone" will choose the second portfolio over the first, such as when F,( x) « 
F(x ) for all x. Specifically, let X, be distributed uniformly on [1, 3], let X; be uni- 
formly distributed on [10, 100] and let X4 = 0 a.s. represent the possibility of holding 
some of our resources in cash. Suppose X,, X2, and X, are independent. Then 
EE f, X; « 2f, + 55f; and o? Ef; X; = E f?o? = f2/3 + 675f2. All cash, or f = 1, 
is an efficient portfolio since this is the unique portfolio with zero expectation. The 
portfolio f; = | also is efficient since this is the unique portfolio with greatest expecta- 
tion. There are, in fact, infinitely many efficient portfolios. (They lie on a curve in the 
ti, f; plane connecting (0, 0) and (0, 1).) The theory doesn’t tell us which is best, 
yet fa = l is clearly preferable to any alternative. 

In the case where there are the two alternatives X, = 0 a.s. (cash) and X; with 
E, > O and c; > 0, all portfolios are efficient and Markowitz’ theory gives no infor- 
mation on which to choose. The Kelly criterion tells us to choose f, to maximize 
E log (1 + f2X2) and we know further from the theory of the Kelly criterion why 
this choice is good. As we have seen, repeated trials of such an investment with fz 
greater than the fraction f, will lead to ruin as. 

Remark: This incompleteness of Markowitz’ theory is understandable since he only 
uses probability information about first and second moments. We note though that 
the examples he gives, and the real world applications, generally assume that more 
detailed structure is known. Hence, it is reasonable that the criterion E log X,, which 
does use higher moment information, can provide a sharper theory. 

Next consider those two-point probability distributions with masses mm; located 
at x;, i = 1, 2, and with mean and variance 1. These are indistinguishable by Marko- 
witz’ criterion. A calculation shows, however, that for X, defined by x, = -1, x; = 
3/2, m, = 1/5, m, = 4/5, the optimal fraction f¥ is 4 and G(ff) is -(1/5) log 3 + (4/5) 
log 2. For X, defined by x, = -2, x; = 4/3, m, = 1/10, m; = 9/10, we have f? = 3 
and G(f*) = -(1/10) log 4 + (9/10) log (3/2), which is smaller than G(ff ). 
Hence if X7 , is the fortune after n repeated independent trials of an investor who 
invests ff in X, at each trial and X, 2 is the fortune after n trials of an investor who 
invests in any manner whatsoever in X, at each trial, we have lim X5 ,/X,,. = oo a.s. 


As a final example, suppose we are to apportion our resources between the fore- 
going X, and X, which we now suppose to be independent, and cash, represented 
by X,. We impose the constraints f; > 0, i = 1, 2, 3; fi +f +h = 1, and f, + 
2 f, € 1. The latter constraint prevents investments where our losses exceed our total 
resources. (The analysis and conclusion are essentially the same without this con- 
straint.) The admissible portfolios are represented by the closed triangular region of 
the positive quadrant bounded by the axes and the line f, + 2f; = 1. 

We have EXf,X, = f, + f; and, because of the independence of X, and X2, 
o Ef, X, = f? + f2. The efficient portfolios are the points of the f, f; plane on the 
two closed line segments joining (4, 4 ) to (0, 0) and to (1, 0). 

The function E log( 1 -- AX, +X.) = G (fi J2) is given by 50 G (fh) = 
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36 log(1 -3/,/2 c 4/2/3) + 4 log( 1-3/,/2-25,) + 9 log(1-f, + 
+ 45/3) + log(1-/,-2/;) This function is undefined on the line joining 
(0, 1/3) and (1, 0). It is defined and continuous elsewhere on the triangle of portfolios 
and as (fi, f2) tends to the segment from this triangle, G (fi, f2) > ~ oo. It follows 
(by the continuity) that G (fi, J2) attains an absolute maximum in the region of the 
triangle where it is defined. We also know that any such maximum is positive. It 
follows that, if an efficient portfolio maximizes G ( f1, f2), then it must be a portfolio 
from the interior of the segment joining (0, 0) and (1/3, 1/3). Hence the coordinates 
must simultaneously satisfy the equations ð G (fi, /2)/ 0f, = 0 and dG(f,,f,)/ 
a fa = 0. (We note that in repeated independent trials where the investor selects an 
efficient portfolio from the segment joining (1/3, 1/3) to (1, 0), he will be ruined with 
probability one.) 

Setting f, = f; = tin the equations 0 G / 3 f, = O and 0G / ð f; = O and attempt- 
ing to solve simultaneously yields, upon elimination between the two equations of the 
last of the four fractions, the necessary condition -2796 + 376 + 1111? = 0. Since 
this is negative at ¢ = 0 and ¢ = 1, there are no roots in the interval 0 < t < 1/3. 
Hence no efficient portfolio maximizes G (fi, f;). 

We conclude that if X% , is the fortune after n trials of a player who bets to maxi- 
mize G(f,, f2) on each trial, and X, ; is the fortune of a player who chooses any 
efficient portfolio on each trial, then lim X5 ,/X,, ; = œ a.s. Furthermore, the 
Kelly investor will reach a fixed goal x in less time, asymptotically as x — oo, than a 
Markowitz investor. 

The Kelly criterion should replace the Markowitz criterion as the guide to port- 
folio selection. 
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PORTFOLIO CHOICE AND THE KELLY CRITERION 


Edward O. Thorp, University of California at Irvine* 


1, Introduction. The Kelly (or capital growth) 
criterion is to maximize the expected value 
E log X of the logarithm of the wealth random 
variable X. Logarithmic utility has been widely 
discussed since Daniel Bernoulli introduced it 
about 1730 in connection with the Petersburg 
game (3, 28]. However, it was not until certain 
mathematical results were proven in a limited 
setting by Kelly in 1956 and then in much more 
general setting by Breiman in 1960 and 1961 
that logarithmic utility was clearly distinguished 
by its properties from other utilities as a guide 
to portfolio selection. (See also [2, 4, 15], and 
the very significant paper of Hakansson [11]. ) 
Suppose for each time period (nz1,2,:-.) 
there are k investment opportunities with re- 
sults per unit invested denoted by the family of 


random variables Xn, 2? see Xk . Suppose 


also that these random variables have only fi- 
nitely many distinct values, that for distinct n 
the families are independent of each other, and 
that the joint probability distributions of distinct 
families (as subscripted) are identical. Then 
Breiman's results imply that portfolio strategies 
A which maximize E log x: where x is the 


wealth at the end of the n-th time period, have 
the following properties: 
Property 1, (Maximizing E log x asymp- 


totically maximizes the rate of asset growth, ) If, 
for cach time period, two portfolio managers 
have thc same family of investment opportunities 
or investment universes, and one uses a strategy 
A* maximizing Elog x whereas the other uses 


an "essentially different" (i.e., Elog x (4%) - 
Elog X{A) 9) strategy A, then lim X (AX (A) >> 


almost surely (a. s.). 

Property 2, The expected time to reach a 
fixed preassigned goal x is, asymptotically as 
X increases, lcast with a strategy maximizing 
Elog x. 


The qualification "essentially different" con- 
ceals subtleties which are not generally appreci- 
ated, For instance [11], which is close in method 
to this article, and whose conclusions we heartily 
endorse, contains numerous mathematically 
incorrect statements and several incorrect con- 
clusions, mostly from overlooking the require- 
ment "essentially different." We intend to 
present a detailed analysis elsewhere and only 
indicate the problem here: If x; = XX then 


even though Elog x70 for all j it need not be the 


casc that P(lim x=) =]. In fact, we' can have 


215 


{just as in the case of Bernoulli trials and 
E log X,* 0; see [26]) P(limsup X -9)1 and 


PüiminfX =0)=1 (contrary to [11, p. 522, eq. 


(18)] and following assertions). Similarly, when 
Elog *;<0 for all j we can have these alterna- 


tives instead of P(lim X =0}=1 (contrary to [11, 


p. 522, eq. (17)] and the following statements; 
footnote l is also incorrect.) 

Note [6] that with the preceding assumptions, 
there is a fixed fraction strategy A which maxi- 
mizes Elog X. A fixed fraction strategy is one 
in which the fraction of wealth f£ , allocated to 


, 


investment x . is independent of n. 


We emphasize that Breiman's results can be 
extended to cover many if not most of the more 
complicated situations which arise in real world 
portfolios, Specifically, the number and distri- 
bution of investments can vary with the tim» 
period, the random variables need not be finite 
or even discrete, and a certain amount of depen- 
dence can be intreduced between the investmeat 
universes for different time periods, We have 
used such extensions in certain applications (e.g., 
[25; 26, p. 287]. 

We consider almost surely having more 
wealth than if an "essentially different" strategy 
were followed, as the desirable objective for 
most institutional portfolio managers. (It also 
seems appropriate for wealthy families who 
wish mainly to accumulate and whose consump- 
tion expenses are only a small fraction of their 
total wealth, ) Property i says that maximizing 
Elog x is a recipe for approaching this goal 


asymptotically as n increases, This is cur 
principal justification for selecting Elog X as 
the. guide to portfolio selection, 

In any real application n is finite, the limit 
is not reached, and we have P(X (AX (A)212 M) 


=)-e€(n,A,M) where €?*0 as n?9, M>O is 
given, A* is the strategy which maximizes 
Elog X and A is an "essentially different" strat - 


egy. Thus in any application it is important to 
have an idea of how rapidly'¢€*0. Work needs to 
be done on this in order to reduce Elog X toa 
guide that is useful (not merely valuable) for 
portfolio managers. Some illustrative examples 
for n=6 appear in [11]. ; 

Property 2 shows us that maximizing E log X 
also is appropriate for individuals who have a sect 
goal (ec. g., to become a millionaire). 

Appreciation of the compelling properties of 
the Kelly criterion may have been impeded by 
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certain misunderstandings about it that persist. 
in the literature of mathematical economics. 

The first misunderstanding involves failure 
to distinguish among kinds of utility theories. 
We compare and contrast three types of utility 
theories: (1) descriptive, where data on observed 
behavior is fitted mathematically. Many differ- 
ent utility functions might be needed, corres- 
ponding to widely varying circumstances, 
cultures, or behavior types!; (2) predictive, 
which "explains" observed data: fits (or ob- 
served data are deduced from hypotheses, with 
the hope future data will also be found to fit. 
Many different utility functions may be needed, 
corresponding to the many sets of hypotheses 
that may be put forward; (3) prescriptive (also 
called normative), which is a guide to behavior, 
i.e., a recipe for optimally achieving a stated 
goal, It is not necessarily either descriptive or 
predictive nor is it intended to be so. 

We use logarithmic utility in this last way, 
and much of the misunderstanding of it comes 
from those who think it is being proposed as a 
descriptive or a predictive theory. The ElogX 
theory is à prescription for allocating resources 
50 às to (asymptotically) maximize the rate of 
growth of asscts, 

Another “objection” voiced by some econo- 
mists to Elog X and, in fact to all unbounded 
utility functions, is that it doesn't resolve the 
(generalized) Petersburg paradox. The rebuttal 
is blunt and pragmatic: The generalized Peters- 
burg paradox does not arise'in the real world 
because any one real world random variable is 
bounded (as is any finite collection). Thus in any 
real application the paradox does not arisc.. 

To insist that a utility function resolve the 
paradox is an artificial requirement, certainly 
permissible, but obstructive and tangential to 
the goal of building a theory which is also a 
practical guide. 


2. Samuelson's objections to logarithmic utility. 
Samuelson (21, pp. 245-6; 22, pp. 4-5] says that 


repeatedly authorities [5, 6, 14, 15, 30] "... have 
proposed a drastic simplification of the decision 
problem whenever T [the number of investment 
periods]? is large. 

Rule: Act in each, period to maximize the 
geometric mean or the expected value of log x 


The plausibility of such a procedure comes 
from the recognition of the following valid asymp- 
totic result. 

Theorem: Acting to maximize the geometric 
mean at every step will if the period is "suffi- 
cienUy long," "almost certainly"3 result in 
higher terminal wealth and terminal utility than 
from any other decision rule."... 

"From this indisputable fact, it is apparently 
tenipting to believe in the truth of the following 
falsu corollary: i 

False corollary: If maximizing the geo- 


metric mean almost certainly leads to a betler 
outcome, then the expected value utility of its 

outcomes excecds that of any other rule, pro- 

vided T is sufficiently large." 

Samuelson then gives counterexamples to 
the corollary, We heartily agrec that the corel- 
lary is false. In fact we had already shown th 
for one of the utilities Samuelson uses, for we 
noted [26} that in the case of Bernoulli trials 
with probability 1/2<p<1 of success, one s&houla 
commit a fraction w=1 of his capita} at cach 
trial to maximize expected final gain E x, (pare 


283; the utility is U(x) =x) whereas to maximize 
E log X, ‘he should commit w -2p-1 of his capi- 


tal at each trial (page 285, Theorem +t). 

The statements which we have seen in print 
supporting this "false corollary" are by Latane 
[15, p. 151, fn. 13} as discussed in[21, p. 245, 
fn. 8), and Markowitz [16, pp. ix-x], Latane 
may not have fully supported this corollary for 
he adds the qualifier "... (in so far as certain 
approximations are permissible)... ". 

That there were or are adherents of the 
"false corollary" seems puzzling in view of th: 
following formulation. Consider a T stage in- 


vestment process, At each stage we allocate our 


resources among the available investments, Fov 
each sequence A of allocations which we choos: . 
there is a corresponding terminal probability 


A 
distribution Fr of assets at the completion of 


stage T, For each utility function U(-), con- 
sider those allocations A"(U) which maximize 
the expected value of terminal utility 


Jue arte. Assume sufficient hypotheses or 
U and the set of FA so that the integral is de- 


fined and that furthermore the maximizing allo- 
cation A"(U) exists. Then Samuelson says that 
A*(log) is not in general A*(U) for other U. 
This seems intuitively evident. 

Even more seems strongly plausible: that if 


Ur and y are inequivalent utilities then 


A > A TM . 
[UAE (x) and J U, (x) dF, (x) will in general 
be maximized for different Fr . (Two utilities 


U and U are equivalent if and only if therc are 


constants a and b such that U(x) =aU(x)+b, 
a70; otherwise u and U are inequivalent, ) In 
this connection we have proved: 

Theorem: Let U and V be utilities definud 
and differentiable on (0,9), with U(x) and V(x} 
positive and strictly decreasing as x increases. 
Then if U and V are inequivalent, there is a 
one period investment setting such that U and V 
have distinct optimal strategies. 4 

All this is in the nature of an aside for 
Samuelson's correct criticism of the "false 
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corollary" does not apply to our use of logarith- 
mic utility, Our point of view is: if your goal is 
property l or property 2, then a recipe for 
achieving either goal is to maximize ElogX. 
These properties distinguish log from the pro- 
lixity of utility functions in the literature, 
Furthermore, we consider these goals appro- 
priate for many (but not all) investors. Investors 
with other utilities, or with goals incompatible 
with logarithmic utility, will of course, find it 
inappropriate, . 

Property 1 implies that if K maximizes 
Elog X (A) and A’ is "essentially different, " 


then x (m) tends almost certainly to be better 
than X) as n?*, Samuelson [21, p. 246], 


apparently referring to this, says after refuting 
the "false corollary"; "Moreover, as I showed 
elsewhere [20, p. 4], the ordering principle of 
selecting between two actions in terms of which 
has the greater probability of producing a higher 
result does not even possess the property of be- 
ing transitive. ...we could have w**” better than 
wt, and w*"* betterthan w*, and also have w* 
better than w*# u 

For some entertaining examples, see the 
discussion of non-transitive dice in [9]. (Consider 
the dice with equiprobable faces numbered as 
follows: X= (3, 3, 3,3,3,3); Y= (4,4,4,4,1,1); Z = 
(5,5,2,2,2,2). Then P(Z»Y) 25/9, P(Y?X)- 25, 
P(X>Z)=2/,) What Samuelson does not tell usis 
that the property of producing a higher result 
almost certainly, as in property 1, is transitive. 
If we have w »w** almost certainly, and 
w** >w% almost certainly, then we must have 
wr »w* almost certainly. 

One might object [20, p. 6] that in a real in- 
vestment sequence the limit as n>® is not 
reached, Instead the process stops at some finite 
N. Thus we do not have X j/*) » X (A) almost 

n 
certainly, Instead we have P(X (RPK (A) )21- EN 
where w? as N>%, and transitivity can be 


shown to fail, 

This is correct, But an approximate form of 
transitivity does hold: Let X,Y,Z be random var- 
iables with P(X>Y)=1 -6 " P(Y>2Z)=1 -5 . Then 
P(X»Z)E1-(& +6) . To prove this, let A be the 
event X>Y, B be the event Y>Z, and C be the 
event X>Z. Then P(A)+P(B) = P(AUB}+P(ANB) 
£)+P(ACB), But ANBCC so P(C) 8 P(ANB) 
EP(AMP(B)-1, i.e., P(X»Z)E1- (4 + €,) . 


Thus our approach is not affected by the vari- 
ous Samuelson objections to the uses of logarith- 
mic utility, ` 

Markowitz [16, pp. ix-x] says '',.;in 1955-56, 
I concluded...that the investor who is currently 
reinvesting everything for "the long run" should 
maximize the expected value of the logarithm of 
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wealth." (This assertion seems to be regardless 
of the investor's utility and so indicates belief in 
the "false corollary,") Mossin [18] and Samuclson 
[20] "... have each shown that this conclusion is 
not true for a wide range of [utility] functions ... 
The fascinating Mossin-Samuelson result, com- 
bined with the straightforward arguments sup- 
porting the earlier conclusions, seemed paradox- 
ical at first. I have since returned to the view of 
Chapter 6 (concluding that: for large T, the 
Mossin-Samuelson man acts absurdly, ..." 
Markowitz says here, in effect, that alternate 
utility functions (to log) are absurd. This position 
is unsubstantiated and unreasonable, 

He continues "...like a player who would pay 
an unlimited amount for the St, Petersburg 
game..." If you agree with us that the St. Peters- 
burg game is not realizable and may be ignored 
when fashioning utility theories for the real world, 
then his continuation ''.,, the terminal utility func- 
tion must be bounded to avoid this absurdity; ..;" 
does not follow. 

Finally, .Markowitz says "...and the argv- 
ment in Chapter 6 applies when utility of terrni- 
nal wealth is bounded. " If he means by this that 
the "false corollary" holds if we restrict our- 
selves to bounded utility functions, then he is 
mistaken, Mossin [18] already showed that the 


optimal strategies for logx and xv, YyY+0, are 
fixed fraction for these and only these utilities. 


Thus any bounded utility besides x'/v » Y<O, will 

have optimal strategies which are not fixed frac- 
tion, hence not optimal for logx ; Samuelson(22] 
gives counterexamples which include the bounded 


tyes Y : : 
utilities x'/Y, Y<0. Since Mossin assumes U^" 
exists and our theorem only assumes that U' 
exists, it provides additional counterexamples. 


3, An outline of the theory of logarithmic utility 


&s applied to portfolio selection, The simolest 
case is Bernoulli trials with probability p of 
success, O<p<}, The unique strategy which max- 
imizes Elog x is to bet at trial n the fixed frac- 


tion f"zp-q of total current wealth x 1 if 


p ?1/2 and to bet nothing otherwise, 
To maximize Elog x is equivalent to maxi- 


mizlng Elog[x x ^ 8G(f, which we call the 


(exponential) rate of growth (per time period). It 
turns out that for p?l/2, G(f) has a unique posi- 
tive maximum at Í* and that there is a critical 
fraction f, O«f*«fxl, such that 60): 0, 


G(f)»0 if O<f<f, Gif)«0 if £<f#1 (we assume 


"no margin"; the case with margin is similar). If 
f<f, xc? a.s.; if fef, lim supX ct» a.s., 


and liminf X -0 a.s.; if {>f , limX_=0 a.s. 
: n c n 
("ruin") 
Bernoulli trials exhibit many of the features 
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of the following more general case, Suppose we 
have at each trial x 2,” the k investment 
opportunities x Wr n,2' see X nk and that the con- 


ditions of property l, section l are satisfied, 
This means that the joint distributions of 
X.X., "X are the same for all n, for 
Y ni ni ni 


2 
each subset of indices 15i P <: 


more Bar "X ad and ra (tX 2 are 


independent when m+n, and all random variables 
Xi have only a finite number of distinct values. 
Thus we have in successive time periods repeat- 
ed independent trials of "the same" investment 
universe. 

Since Breiman has shown that there is for 
this case an optimal fixed fraction strategy 
mz (a . £t) » we will have an optimal strategy 


if we find a strategy which maximizes Elog x in 


<i Sk, Further- 


the class of fixed fraction strategies. 


Let A= UD en A) be any fixed fraction strat- 
egy. We assume that ft Z Hl so there is no 


borrowing, or margin. The margin case is sim- 
ilar (the approach resembles [23]). Using the 
concavity of the logarithm, it is easy to show 
(see below) that the exponential rate of growth 


Elog [x (yx 1^ E ot, 


tion of UD E 4) , just as in the Bernoullitrials 


EL is a concave func- 


case. The domain of G(f) in the Bernoullitrials 
case was the interval [0,1) with G(f)!-9 as fl, 
The domain in the present instance is analogous. 
First, it is a subset of the k dimensional simplex 
. Ej z z 
&- fon Ad: ftt +5 1; 4z 0, “ALE o% 
To establish the analogy further, e R 

=X_.-1, j=1,°++,k, be the return per unit onthe 


i-th investment opportunity at an arbitrary time 
eriod n. Let the range of R, be t itm. 
P s j j.l il 


and let the probability of the outcome R Er om 
à) 


and Ry > d «++ and Re T. 
mma: ' ' . Then E log US =G(f, Ut 29 
f 
=E . log(1 «ts r + : 
| Vom ) 
lÉm$i; . denti), from which a concavity 
of Gif), . Lu can be shown. Note that Ges 


is defined if and only if EIE 


Ret 20 
Um Vim 
for each set of indices m, me. Thus the do- 
main of ett, ee A) is the intersection of all 


these open half-spaces with the k-dimensional 
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simplex 5. . Note that the domain is convex and 
includes all of 5. in some neighborhood of the 
origin. Note too that the domain of G is all of 5. 
if (and only if) Ry for all j, i.e., if there is 
no probability of total loss on any investment, 
The domain of G includes thc interior of ES if 

R E -1. 


and most cases of interest are included, 
If f,, ++: Ud are chosen so that 

1 
»-ím m te 


then PX 1 ee 


Both domains are particularly simple 


“im, 


$0)2e€70 for all n and 


s0 e 
HN , for some m, 


FEX CK 
ruin occurs with probability 1., 

Computational procedures for finding an op- 
timal fixed fraction strategy (generally unique in 
our present setting) are based on the theory of 
concave (dually, convex) functions [29] and will 
be presented elsewhere. (As Hakansson [11, p. 
552] has noted, ",..the computational aspects of 
the capital growth model are [presently] much 
less advanced" than for the Markowitz model, ) 

The theory may be extended to more general 
random variables and to dependence between dif- 
ferent time periods, Most important, we may 
include the case where the investment universe 
changes with the time period, provided only that 
there be some mild regularity conditions on the 


x ., such as that they be uniformly a,s, bounded 
, 


` and that they do not tend to 0 uniformly as i?*, 


(See [15], and the generalization of the Bernoulli 
trials case as applied to blackjack in [26],) The 
techniques rely heavily on those used to genera- 
lize the law of large numbers, 

Transactions costs, the use of margin, and 
the effect of taxes can be incorporated into the 
theory. Bellman's dynamic programming method 
is used here, 

The general procedure for developing the 
theory into a practical tool imitates Markowitz 
[16]. Markowitz requires as inputs estimates of 
the expectations, standard deviations, and co- 


variances of the X: 'E We require joint proba- 
à 


bility distributions, This would seem to be a 
much more severe requirement, but in practice 
does not seem to be so (16, pp. 193-4, 198- 201]. 
Among the actual inputs which Markowitz 
chose were (1) past history [16, ex., 8-20], (2) 
probability beliefs of analysts (pp. 26-33), and 
(3) models, most notably regression models, to 
predict future performance from past data (p. 33, 
pp. 99-100). In each instance one can get enough 


additional information to estimate Elog Qux. p, 


There are, however, two great difficulties 
which all theories of portfolio selection have, 
cluding ours and that of Markowitz. First, there 
seems to be no established method for generally 


in- 
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predicting sccurity prices which gives an edge of 
eveu a few per cent, The random walk is the best 
model for security prices today. (See (7, 10]. ) 

The sccond difficulty is that for portfolios 
with many securities the volume of inputs called 
for is prohibitive: for 100 securities, Markowitz 
requires 100 expectations and 4950 covariances; 
and our theory requires somewhat more infor- 
mation. Although considerable attention has been 
given to finding condensed inputs that can be used 
instead, this aspect of portfolio theory still 
seems unsatisfactory, 

In the fifth section we will show how both 
these difficulties were overcome in practice by 
an institutional investor, That investor, guided 
by the Kelly criterion, then outperformed for 
the year 1970 every one of the approximately 400 
Mutual Funds listed by the S & P stock guide! 

But first we relate our theory to that of 
Markowitz. 


4. Relation to the Markowitz theory; solution to 
problems therein, The most widely used guide 


to portíolio selection today is probably the Mar- 
kowitz theory, The basic idea is that a portfolio 


B is superior to a portfolio E if the expecta- 


tion ("gain") is at least as great, i.e., E(P) 
=E(P,) and the standard deviation ("risk") is no 
greater, i.e., oP) so(P) , with at least one in- 


equality, This partially orders the set @ ofport- 
folios, A portfolio such that no portíolio is 
superior (i,e., a maximal portfolio in the partial 
ordering) is called efficient. The goal of the 
portfolio manager is to determine the set of effi- 
cient portfolios, from which he then makes a 
choice based on his needs, 

This is intuitively very appealing: It is based 
on standard quantities for the securities in the 
portfolio, namely expectation, standard deviation, 
and covariance (needed to compute the variance 
of the portfolio from that of the component secur- 
ities). It also gives the portfolio manager ''chaice." 

As Markowitz [16, Chapter 6] has pointed out, 
the optimal Kelly portfolio is approximately one 
of the Markowitz efficient portfolios under cer- 
tain circumstances. If E- E(P) and R- P-1 is 
the return per unit of the portfolio P, let log P 
= log (1+R) = log ((14E)+(R-E)). Expanding in 
Taylor's series about 1+E gives log P=log(1+E) 


2 
t (R-E)J(14E) - (R-E) REP t higher orderterms. 
Taking expectations and neglecting higher order 


terms gives E log P= log(1+E) - F(PV2(14EF . 

This leads to a simple pictorial relationship 
with the Markowitz theory. Consider the E-¢ 
plane, and plot (E(P), o(P)) for the efficient 
portfolios, The locus of efficient portfolios is a 
convex non-decreasing curve which includes its 
endpoints (ligure 1). 

Then constant values of the growth rate 
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FIGURE 1. GROWTH RATE G (RETURN 


RATE R)IN THE E-o PLANE ASSUMING 
THE VALIDITY OF THE POWER SERIES 
APPROXIMATION. 
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G- Elog P approximately satisfy G=log(1+E) 


- ÅP . This family of curves is illus- 
trated in Figure 1 and the (efficient) portfolio 
which maximizes logarithmic utility is (approxi- 
mately) the one which lies on the greatest G 
curve, Because of the convexity of the curve of 
efficient portfolios and the concavity of the G 
curves, the (E,c) value where this occurs is 
unique. 

The.approximation to G breaks down badly 
in some significant practical settings, including 
that of the next section, But for portfolios with 
large numbers of "typical" securities, the approx- 
imation for G will generally provide an efficient 
portfolio which approximately maximizes asset 
growth, This solves the portfolio manager's 
problem of which efficient portfolio to choose, 
Also, if he repeatedly chooses his portfolio in 
successive time periods by this criterion he will 
tend to maximize the rate of growth of his asscts, 
i.e., maximize "performance," We see also that 
in this instance the problom is reduced to that of 
finding the efficient portfolios plus the easy step 
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of using Figure 1. Thus if the Markowitz theory 
can be applied in practice in this setting, so can 
our theory. 

| — We have already remarked on the ambiguity 
of the set of efficient portfolios, and how our 
theory resolves them. To illustrate further that 
such ambiguity represents a defect in the Marko- 
witz theory, let x be uniformly distributed over 


Ll, 3], let x be uniformly distributed over [10, 
100], let cor(X,X)]* 1, and suppose these are 
the only securities. Then X and x are both 
tea : < < tot 
efficient with 9 % and E E so Markowitz 


theory does not choose between them. Yet 
"everyone" would choose x over X because the 


worst outcome with x is far better than the best 
outcome from x . (We presented this example in 


[26]. Hakansson [11] presents further examples 
and an extended analysis, He formalizes the idea 
by introducing the notion of stochastic dominance: 
X stochastically dominates Y if P(X£Y)-1 and 
P(X>¥)>0. It is easy to prove the Lemma: An 
ElogX optimal portfolio is never stochastically 
dominated, Thus our portíolio theory does not 
have this defect, ) 

There are investment universes (X, E X) 


such that a unique portíolio P maximizes ElogP 
yet P is not efficient in the sense of Markowitz, 
Hence choosing P in repeated independent trials 
will outperform any strategy limited to choosing 
efficient portfolios. In addition, the optimal 
Kelly strategy gives positive growth rate, yet 
some of the Markowitz-efficient strategies give 
negative growth rate and ruin after repeated 
trials, We gave such an example in [26] and 
another appears in [11]. See also (11, pp. 553-4] 
for further discussion of defects in the Marko- 
witz model, 


5. The theory in action: results for a real insti- 
tutional portfolio. The elements of a practical 
profitable theory of convertible hedging were pub- 
lished in [27], Thorp and Kassouf indicated an 
annualized return on investments of the order of 
25% per year. Since then the theory has been 
greatly extended and refined with most of these 
new results thus far unpublished, 

The historical data which has been used to 
develop the theory includes well over 100,000 
observations of convertibles, 

A convertible hedge transaction generally 
involves two securities, one of which is conver- 
tible into the other, Mathematical price rela- 
tionships exist between pairs of such securities. 
When one of the pair is comparatively underpriced, 
a profitable convertible hedge may be set up by 
buying the relatively underpriced security and 
selling short an appropriate amount of the rela- 
tively overpriced security, 
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The purpose of selling short the overpriced 
security is to reduce the risk in the position, 
Typically, one sells short ina single hedge from 
50% to 125% as much stock (in "share equiva- 
lents') as is held long. The exact proportions 
depend on the analysis of the specific situation; 
the quantity of stock sold short is selected to 
minimize risk, The risk (i.e., change in assct 
value with fluctuations in market prices) in a 
suitable convertible hedge should be much less 
than in the usual stock market long positions. 

The securities involved in convertible hedges 
include common stock, convertible bonds, con- 
vertible. preferreds, and common stock purchase 
warrants, Options such as puts, calls, and 
straddles may replace the convertible security. 
For this purpose, the options may be either 
written or purchased, 

The theory of the convertible hedge is highly 
enough developed so that the probability charac- 
teristics of à single hedge can be worked out 
based on an assumption for the underlying dis- 
tribution of the common, (Sometimes even this 
can almost be dispensed with! See [27, App. C]. ) 
A popular and plausible assumption is that the 
future price of the common is lognormally dis- 
tributed about its! current price, with a trendand 
a variance proportional to the time, Plausible 
estimates of these parameters are readily ob- 
tained, Furthermore, it turns out that the return 
from the hedge is comparatively insensitive to 
changes in the estimates for these parameters, 

Thus with convertible hedging we fulfill two 
important. conditions for the practical application 
of our, (or any other) theory of portfolio choice: 
(1) We have identified investment opportunities 
which are markedly superior to the usual ones. 
Compare the return rate of 2075-257, per year 
with the long term rate of 8% or so for listed 
common stocks, Further, it can be shown that 
the risks tend to be much less, (2) The proba- 
bility inputs are available for computing 
G(fs of). 

On November 3, 1969, a private institutional 
investor decided to commit all its resources to. 
convertible hedging and-to use the Kelly criterion 
to allocate its assets, The performance record 
appears in the Table. 

The market period covered included one of 
the sharpest falling markets as well as one ofthe 
sharpest rising markets (up 50% in 11 months) 
since World War II. The gain was +16,3% for the 
year 1970, which outperformed all of the approx- 
imately 400 Mutual Funds Listed in the S & P 
Stock guide, Unaudited figures show that gains 
were achieved during every single month. 

The unusually low risk in the hedged posi- 
tions is also indicated by the results for the 200 
completed hedges, There were 190 winners, 6 
break-evens, and 4 losses. The losses as a per 
cent of the long side of the specific investment 
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TABLE - PERFORMANCE RECORD 


Change Elapsed ‘Growth Rate 
To Date Time To Date 
Date (99) (months) (%) t+ 
ll- 3-69 0.0 0 --- 
12-31-69 + 4.0 2 +26.8 
9- 1-70 +14.0 10 +17.0 
12-31-70 +21.0 14 +17.7 
6-30-71% +39.9 20 +22.3 


Assets on 7-1-71 were 5,2 million, 


DJIA Starting Gain Over 
Closing Chg. Even With DIA 
DJIA* (t+  — DJIA* (%) 
855 0.0 855 0.0 
800 - 6.3 889 * 10.3 
758 -11.3 974 *25.3 
839 - 1.8 1034 +22.8 
891 + 4,2 1196 + 35,7 


Preliminary-unaudited. 


* DJIA = Dow Jones Industrial Average. 


t+ Compound growth rate, annualized, 


ranged from 1% to 15%, 

A characteristic of the Kelly criterion is 
that as risk decreases and expectation rises, 
the optimal fraction of assets to be invested in 
a single situation may become "large." On sev- 
eral occasions, the institution, discussed above 
invested up to 30% of its assets in single hedge, 
Once it invested 150% of its assets in a single 
arbitrage. This characteristic of Kelly port- 
folio strategy is not part of the behavior of most 
portfolio managers, 

To indicate the techniques and problems, we 
consider a simple portfolio with just one conver- 
tible hedge, We take as our example Kaufman 
and Broad common stock and warrants, A price 
history is indicated in Figure 2. 

Price data shows that W=.455S isareason- 
able fit for $338 and that W=S~- 21.67 isa 
reasonable fit for S244, Between S- 38 and 
S-44 we have the line W= .845-15.5. For sim- 
plicity of calculation we replace this in our illus- 
trative analysis by W=.5S if S344 and W=S-22 
if S244, The lines are also indicated in Figure 2, 

Past history at the time the hedge was insti- 
tuted in late 1970 supported the fit for S538, 
The conversion feature of the warrant ensured 
WES-21.67 until the warrant expires, Thus 
WzS-21,67 for S244 underestimates the price 
of the warrant in this region, Extensive histor- 
ical studies of warrants [12, 13, 24, 27] showthat 
the past history fit would probably be maintained 
until about two years before expiration, i.e., un- 
til about March, 1972, 'Thus it is plausible to 
assume that for the next 1,3 years S may be 
roughly approximated by W=.5S for S $44 and 
W=S-22 for S244, 

Next we assume that S, , the stock price at 


time t>Q years after the hedge was initiated, is 
lognormally distributed with density 
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£e (xo/2ny | exp[-(logx - uy /2d] , hence 
t 
mean E(S)= epp tuac 72) and standard deviation 


.9(8) = E(S) e - j^, The functions u*u(t) and 


a= g(t) depend on the stock and on the time t, If 
t is the time in years until & is realized, it is 


2 
plausible to assume U(t)=log S tmt and O(t} = at, 
where S is the present stock price and m and a 


are constants depending on the stock. For a de- 
tailed discussion, see (1, 19], 


Then E(§)= EN exp[(m+a2)t] and a mean in- 
crease of 10%/year is approximated by setting 


2 2 
mta/2z.1. lf we estimate a from past price 
changes we can solve for m, 1n the case of 
Kaufman and Broad it is plausible to take 05,45 


whence as di .20. This yields m2 0, We then 


find o(S) = 528 . 


It is by no means established that the log- 
normal model is the appropriate one for stock 
price series (7, 10]. However, once we clarify 
certain.general principles by working through 
our example on the basis of the lognormal nodel, 
it can be shown that the results are substantially 
unchanged by choosing instead any distribution 
that roughly fits observation! 

For a time of one year, a computation shows 
the return R(S) on the stock to be 410.595, the re- 
turn R(W) on the warrant to be +34,8%, c(S)- .52, 
o(W)=.92, and the correlation coefficient 
cor(S, W)=.99. The difference in R(S) and R(W) 
shows that the: warrant is a much better buy than 
the common, Thus a hedge long warrants and 
short common has.a substantial positive expec- 
tation, The value cor(S, W)=.99 shows that a 
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FIGURE 2, PRICE HISTORY OF KAUFMAN AND BROAD COMMON, S, VERSUS THE WARRANTS, W, 


W5 
The points moved up and to the right until they 40, Shetton 7 


reached the neighborhood of (38, 17). At this tm 
point a 3:2 hedge (15,000 warrants long, 
11,200 common short) was instituted, 
As the points continued to move up 

and to the right during the next 

few months, the position was 

closed out in stages with the 

final liquidation at about 

(58, 36). 


“Best Lirenr Fn 
Siege 2.278 
Correlouon 5,99 


EIS li 48,03 
WIS, i 29.66 


A 


* n t 
^U Stone =.667 Corresponding To 
45 Zeo Profit Line For Hedge 
(Before Tronsactions Costs) 


W*S-2LET : S-E 
Terms of warrant: l warrant 


+ $21.67 > ] share common 
stock until 3-1- 74, Full pro- 
tection against dilution, The 
company has the rightto reduce 
the exercise price for icmpo- 
rary periods; 750,000 warrants 
and 5, 940, 000 common out- 
standing. Common dividends 
Q.05 ex 10-26-70, 


Ó 5 10 i5 20 

hedge corresponding to the best linear fit of W common from about 40 to almost 60. 

to s "^ standard deviation of approximately 6. Concluding remarks, As xemarked above, wc 

(1-.99) ^z.1 which suggests that o(P) for the do not propose logarithmic utility as descriptive 

optimal hedged portfolio is probably going to be of actual investment behavior, nor do wi: believe 

close to .1, The high return and low risk for any one utility function could suffice, It would be, 

the hedge will remain, it can be shown, under of Interest, however, to have empirical evidence 

wide variations in the choice of m and a. showing areas of behavior which arc character- 
To calculate the optimal mix of warrants ized adequately by logarithmic utility, Neitkerdo 

Iong to common short we maximize ef, 5) we intend logarithmic utility to be predictive; 


again, it would be of interest to know what it 


=E log (£545 W) . The detailed computational does predict, 


procedures are too lengthy and involved to be We only propose the theory to be normative 
presented here, We plan to present them or prescriptive, and only for those institutions, 
elsewhere, groups, or individuals whose overriding curreat 
Our institutional investor considered posi- objective is maximization of the ratc of asset 
tions already held, some of which might have to growth, Those with a different "prime directive" 
be closed out to release assets, and also other may find that another utility function is a bctter 
current candidates for investment, The decision guide. 
was made to short common and buy warrants in We have found ElogX to be valuable as a 
the ratio of three shares to four. The initial qualitative guide and suggest that this could be 
market value of the long side was about 14% of its most important use, Once familiarity with 
assets and for the short side about 20% of assets its properties is gained, many investment de- 
The net profit, in terms of the initial market cisions can be guided by it without complex 
value of the long side, was about 2075 in six supporting calculations, 
months, This resulted from a'move in the What sort of economic bchavior can be ex- 
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pected from followers of Elog X ? Insurance is’ 
"explained," i.e., even though it is a negative 
expectation investment for the insured and we 
assume both insurer and insured have the same 
probability information, it is often optimal for 
him (as well as for the insurance company) toin- 
sure [3). It usually turns out that insurauce 
against large losses is indicated and insurance 
against small losses is not, (Don't insure an old 
car for collision, take $200 deductible, not $25, 
etc.) 

We find that if all parties to a security trans- 
action are followers of ElogX they will often 
find it mutually optimal to trade, This may be 
true whether the transactions be two party (no 
brokerage), or three party (brokerage), and 
whether or not the parties have the same proba- 
bility information about the security involved, or 
even about the entire investment universe. 

Maximizing logarithmic utility excludes port- 
folios which have positive probability of totalloss 
of assets, Yet it can be argued that an impover- 
ished follower of Elog X might in some instances 
risk "everything." This agrees with some ob- 
served behavior, but is not what we might at first 
expect in view of the prohibition against positive 
probability of tetal loss. But consider each indi- 
vidual as a piece of capital equipment with an 
assignable monetary value. Then if he risks and 
loses all his cash assets, he hasn't really lost 
everything [3]. 

All of us behave as though death itself does 
not have infinite negative utility. Since the risk 
of death, although generally small, is ever pre- 
sent, a negative infinite utility for death would 
make all expected utilities negative infinite and 
utility theory meaningless. In the case of loga- 
rithmic utility as applied to the extended case of 
the (monetized) individual plus all his resources, 
death should be assigned a finite, though large 
and negative, utility. The value of this "death 
constant" is an additional arbitrary assumption 
for the enlarged theory of logarithmic utility. 

1n the case of investors who behave accord- 
ing to Elog X (or other utilities unbounded below), 
it might be possible to discover their tacit death 
constants." 

Hakansson [11, p. 551] observes that loga- 
rithmic utility exhibits decreasing absolute risk 
aversion in agreement with deductions of Arrow 
and others on the qualities of "reasonable'' utility 
functions. Hakansson says, "What the relative 
risk aversion index [given by -xU "[xyU'(x)) would 
look like for a meaningful utility function is less 
clear,.. In view of Arrow's conclusion that 
!,., broadly speaking, the relative risk aversion 
must hover around 1, being, if anything, some- 
what less for low wealths and somewhat higher 
for high wealths ...'the optimal growth model 
seems to be on safe ground, " As he notes, for 
U(x)-logx, the relative risk aversion is pre- 
cisely |. However, in both the extension to 
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valuing the individual as capital equipment, and 
the further extension to include the death constant, 
we are led to U(x)=log(x+c) where c is positive. 
But then the relative risk aversion index is 
x/(xtc) which behaves strikingly like Arrow's 
description, See also the discussion of U(x) 
zlog(x*c) in [8, p. 103, p. 112]. 

Morgenstern [17] has forcefully observed 
that assets are random variables, not numbers, 
and that economic theory generally does not in- 
corporate this, To replace assets by their ex- 
pected utility in valuing companies, portfolios, 
property and the like, allows for comparisons 
when asset values are given as random variables, 
We think logarithmic utility will often be eppro- 
priate for such valuation. 

I wish to thank James Bicksler for several 
stimulating and helpful conversations. 
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1 : ie Sv 

Information on descriptive utility is sparse; how 
many writers on the subject have even been able 
to determine for us their own personal utility ? 


2 
Parenthetical explanation added since we have 
used n, 


"Almost certainly" and "almost surely' are 
synonymous, 


4rhe proof of this theorem, and some further re- 
sults obtained with R, Whitley, wlll appear else- 
where. 
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OPTIMAL INVESTMENT AND CONSUMPTION STRATEGIES 
UNDER RISK FOR A CLASS OF UTILITY FUNCTIONS! 


By Nits H. HAKANSSON? 


This paper develops a sequential model of the individual's economic decision problem 
under risk. On the basis of this model, optimal consumption, investment, and borrowing- 
lending strategies are obtained in closed form for a class of utility functions. For a subset of 
this class the optimal consumption strategy satisfies the permanent income hypothesis 
precisely. The optimal investment strategies have the property that the optimal mix of risky 
investments is independent of wealth, noncapital income, age, and impatience to consume. 
Necessary and sufficient conditions for long-run capital growth are also given. 


l. INTRODUCTION AND SUMMARY 


THIS PAPER presents a normative model of the individual's economic decision 
problem under risk. On the basis of this model, optimal consumption, investment, 
and borrowing-lending strategies are obtained in closed form for a class of utility 
functions. The model itself may be viewed as a formalization of Irving Fisher's 
model ofthe individual under risk, as presented in The Theory of Interest [4] ; at the 
same time, it represents a generalization of Phelps' model of personal saving [10]. 

The various components of the decision problem are developed and assembled 
into a formal model in Section 2. The objective of the individua! is postulated to 
be the maximization of expected utility [rom consumption over time. His resources 
are assumed to consist of an initial capital position (which may be negative) and 
a noncapital income stream which is known with certainty. The individual faces 
both financial opportunities (borrowing and lending) and an arbitrary number of 
productive investment opportunities. The returns from the productive opportuni- 
ties are assumed to be random variables, whose probability distributions satisfy 
the "no-easy-money condition." The fundamental characteristic of the approach 
taken is that the portfolio composition decision, the financing decision, and the 
consumption decision are all analyzed simultaneously in one model. The vehicle 
of analysis is discrete-time dynamic programming. 

In Section 3, optimal strategies are derived for the class of utility functions 

$4 057 ufe), 0 < a < 1, where c; is the amount of consumption in period j, such 
that either the relative risk aversion index, —cu’(c)/u'(c), or the absolute risk 

! This paper was presented at the winter meeting of the Econometric Society, San Fraricisco, 
California, December, 1966. 

? This article is based on my dissertation which was submitted to the Graduate School of Business 
Administration of the University of California, Los Angeles, in June, 1966. [ am greatly indebted to 
Professors George W. Brown (committee chairman), Jacob Marschak, and Jacques Dréze for many 
valuable suggestions and comments and to Professors Jack Hirshleifer, Leo Breiman, James Jackson, 


and Fred Weston for constructive criticisms. | am also grateful to the Ford Foundation for financial 
support over a three-year period. 
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aversion index, —u"(c)/u(c), is a positive constant for all c 2 0, ie, uc) = c’, 
Ocycl,uc)- —c?",y»0,u(c) = loge, and (c)  —e *,y > 0. 

Section 4 is devoted to a discussion of the properties of the optimal consumption 
strategies, which turn out to be linear and increasing in wealth and in the present 
value of the noncapital income stream. In three of the four models studied, the 
optimal consumption strategies precisely satisfy the properties specified by the 
consumption hypotheses of Modigliani and Brumberg [9] and of Friedman [5]. 
The effects of changes in impatience and in risk aversion on the optimal amount to 
consume are found to coincide with one's expectations. In response to changes in 
the "favorableness" of the investment opportunities, however, the four models 
exhibit an exceptionally diverse pattern with respect to consumption behavior. 

The optimal investment strategies have the property that the optimal mix of 
risky (productive) investments in each model is independent of the individual's 
wealth, noncapital income stream, and impatience to consume. It is shown in 
Section 5 that the optimal mix depends in each case only on the probability dis- 
tributions of the returns, the interest rate, and the individual's one-period utility 
function of consumption. This section also discusses the properties of the optimal 
lending and borrowing strategies, which are linear in wealth. Three of the models 
always call for borrowing when the individual is poor while the fourth model 
always calls for lending when he is sufficiently rich. The effect of differing borrowing 
and lending rates is also examined. 

Necessary and sufficient conditions for capital growth are derived in Section 6. 
It is found that when the one-period utility function of consumption is logarithmic, 
the individual will always invest the capital available after the allotment to current 
consumption so as to maximize the expected growth rate of capital plus the present 
value ofthe noncapital income stream. Finally, Section 7 indicates how the preced- 
ing results are modified in the nonstationary case and under a finite horizon. 


2. THE MODEL 


In this section we shall combine the building blocks discussed in the previous 
section into a formal model. The following notation and assumptions will be 
employed: 


cj: amount of consumption in period j, where c, > 0 (decision variable). 
Ule, Cz 05. . ): the utility function, defined over all possible consumption programs (c,, C3, c,, .... 
The class of functions to be considered is that of the form 


(1) Uli Ca 0s...) = u(c)) + aU(es ey 64s...) 


a 
= J a'u), 0«a«l. 
jel 
It is assumed that u(c) is monotone increasing, twice differentiable, and strictly concave for c > 0. 
The objective in each case is to maximize E[U(c,,c2,...)], ie., the expected utility derived from con- 
sumption over time.3 
x;:amount of capital (debt) on hand at decision point j (the beginning of the jth period) (state variable). 
y: income received from noncapital sources at the end of each period, where 0 < y « oo. 


? While we make use of the expected utility theorem, we assume that the von Neumann- Morgenstern 
postulates (12] have been modified in such a way as to permit unbounded utility functions. 
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M: the number of available investment opportunities. 

S: the subset of investment opportunities which it is possible to sell short. 

Z4: amount invested in opportunity i, i = 1,..., M, at the beginning of the jth period (decision 
variable). 

r — |: rate of interest, where r > 1. 

B, : transformation of each unit of capital invested in opportunity i in any period j (random variable); 
that is, if we invest an amount 6 in iat the beginning ofa period, we will obtain 8,0 at the end of that period 
(stochastically constant returns to scale, no transaction costs or taxes). The joint distribution functions 
of the B, i = 1,..., M, are assumed to be known and independent with respect to time j. The (£,) have 
the following properties: 


(2) Bir, 

(3) Oxf,«o (i = 2,....M), 
M 

(4) Pr{ E Ø- na <0} >0, 
1-2 


for all finite 8, such that 0, > O for all i£ S and 9, # O for at least one i. 

f(xj: expected utility obtainable from consumption over all future time, evaluated at decision 
point j, when capital at that point is x, and an optimal strategy is followed with respect to consumption 
and investment. . 

Y: present value at any decision point of the noncapital income stream capitalized at the rate of 
interest, ie, Y — y/(r — 1) 


D = (v3, ..-, vy): a vector of real numbers. 
M 

hö) = E Eeer) 
i=2 


k : maximum of h(d) subject to (27) and (28) (see (26)). 
0* : vector 0 which gives maximum k of h(ò) (sce (26)). 
M 


v= ya. 
ial 
c*(x): an optimal consumption strategy. 
z,(x): an optimal lending strategy. 
zi(x): an optimal investment strategy for opportunity i, i = 2,..., M. 


$5; x, + Y. 


The limitations of utility functions of the form (1) are well known and need not 
be elaborated here. Condition (4) will be referred to as the ‘tno-easy-money 
condition." In essence, this condition states (i) that no combination of productive 
investment opportunities exists which provides, with probability 1, a return at 
least as high as the (borrowing) rate of interest ; (ii) that no combination of short 
sales existsin which the probability is zero that a loss will exceed the (lending) rate of 
interest; (iii) that no combination of productive investments made from the 
proceeds of any short sale can guarantee against loss. For these reasons, (4) may be 
viewed as a condition that the prices of the various assets in the market must 
satisfy in equilibrium. 

Consumption and investment decisions.are assumed to be made at the beginning 
of each period. The amount allocated to consumption is assumed to be spent 
immediately or, if spent gradually over the period, to be set aside in a nonearning 
account. We also assume that any debt incurred by the individual must at all times 
be fully secured, i.e., that the individual must be solvent at each decision point. 
In view of the "no-easy-money condition" (4), this implies that his (net) debt 
cannot exceed the present value, on the basis of the (borrowing) rate of interest, of 
his noncapital income stream at the end of any period. 
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We shall now identify the relation which determines the amount of capital (debt) 
on hand at each decision point in terms of the amount on hand at the previous 
decision point. This leads to the difference equation 


5 M 

6) xai = TZ + DY Byty (j=1,2,...) 
i=2 . 

where 

6 M 

K È nu-2x-c6 (j=1,2,..) 


i=1 

The first term of (5) represents the payment of the debt or the proceeds from 

savings, the second term the proceeds from productive investments, and the third 
term the noncapital income received. Combining (5) and (6) we obtain 


(7) < , 
Xj = Y Bi -— na, + rix -—¢) + y (j= 1,2,...). 
i=2 
This is the difference equation, then, which governs the process we are about to 
study. 
The definition of f{x,) may formally be written 
(8) fx) = max E[U(c, c), o Ejaz J]lxj- 
From (1) we obtain, by the principle of optimality,* for all j, 
(9) ffxj = max E[u(cj) + «(max ELU (cjs , €). 2. Aea Hx 


since we have assumed the (f) to be independently distributed with respect to 
time j. By (8), (9) reduces to 
(10) ffxj) = max (u(c) + Elf s(x}, allj. 


Since by our assumptions we are faced with exactly the same problem at decision 
point j + 1 as when we are at decision point j, the time subscript may be dropped. 
Using (7), (10) then becomes 


(11) f(x)2m ILE t Zh (B; — rjz,+ r(x — c) + jJ) 


a 
ett 
subject to 
(12) c 2 0, 
(13) 2,20, i£S, 
and 


M 
(14) PX nier dero cr) ui 
i22 


at each decision point. Expression (14), of course, represents the solvency constraint. 


* The principle of optimality states that an optimal strategy has the property that whatever the initial 
state and the initial decision, the remaining decisions must constitute an optimal strategy with regard 
to the state resulting from the first decision (2, p. 83]. 
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For comparison, the model studied by Phelps [10] is given by the functional 
equation 


05) — f(x) = max {u(c) + «EL (f(x — c) + y)}}. 

Osc&x 
In this model, all capital not currently consumed obeys the transformation fj, which 
is identically and independently distributed in each period. Since the amount 
oe x — c, is determined once c is known, (15) has only one decision variable 
c) 

Since x represents capital, f(x) is clearly the utility of money at any decision 
point j. Instead of being assumed, as is generally the case, the utility function of 
money has in this model been induced from inputs which are more basic than the 
preferences for money itself. As (11) shows, f(x) depends on the individual's 
preferences with respect to consumption, his noncapital income stream, the interest 
rate, and the available investment opportunities and their riskiness. 


3. THE MAIN THEOREMS 


We shall now give the solution to (11) for the class of one-period utility functions 


1 
(16) u(c) = —c’, O<y<1 (Model I); 
y 
] 
(17) u(c) = y^ «0 (Model II); 
(18) u(c) = log c, (Model III); 
(19) u(c) = -e™, y>0 (Model IV). 
5 Phelps gives the solution to (15) for the utility functions u(c) = c',0 < y < 1, u(c) = —c 7.7 <9, 
and for u(c) = loge when y = 0. Unfortunately, this solution is incorrect in the general case, i.e., 
whenever y > 0 and the distribution of f is nondegenerate. For example, when ufc) = =c", the 


solution is asserted to be, letting B = E(fi ^], 
(ay Mart qr y -y 
(15a) f(x) = — pi E +p wy] s 


d$) etx) = [1 - ane] + p] 


whenever af « 1. But for this to be a solution, it would be necessary that one be able to write 


(15c) E[f(x — e) + y "1 = E -et eub 


which is clearly impossible unless the distribution of f is degenerate or y = 0 or both. The right side 
of (15c) may, of course, be regarded as a first-order approximation of the left side when the variance of 
f is small, but this negates the presence of uncertainty. In fact, the preceding solution holds even under 
certainty only when «f > 1 and x > [(aJ)- !* * 9 — 1]y/(] — 1) i.e., when c(x) is less than or equal to 
x in all future periods. 

It appears that an analytic solution to (15) does not exist when y > O and the distribution of f is 
nondegenerate. It is ironic, therefore, that when one generalizes Phelps' problem by introducing the 
possibility of choice among risky investment opportunities and thc opportunity to borrow and lend 
(see (11)), an analytic solution does exist (as will be shown). It is the second of these generalizations which 
guarantees the solution in closed form. 
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Pratt (11] notes that (16)-(18) are the only monotone increasing and strictly 
concave utility functions for which the relative risk aversion index 


u" (c) 

u'(c) 

is a positive constant and that (19) is the only monotone increasing and strictly 
concave utility function for which the absolute risk aversion index 


Q0) q*(c) = — 


u'(c) 
~ ue) 


is a positive constant.® 


Q1) q(c) = 


THEOREM 1 : Let u(c), a, y, r, (B,), and Y be defined as in Section 2. Then, whenever 
u(c) is one of the functions (16)-(18) and ky < 1/a in Model I, a solution to (11) 
subject to (12)-(14) exists for x > — Y and is given by 


(22) f(x) = Au(x + Y) - C, 

Q3 — eX(x) = Bx + Y) — 

(24) zi(x) = (1 — B)(1 — v*)(x + Y) — Y, 

Q5 (x) = (1 — Bix Y) (= 2,..., M) 


where the constants v; (V = $., vj) and k are given by 


ZE Y (f — ry + j| 

(22 
M 

max p| u Y (B, — nv, + Jl. 
12 


vi) 


(26)  k 


subject to 
(27) u20, i¢sS, 
and 


M 
(28) PG mer» Oh = 1, 


{a2 


and the constants A, B, and C are given by 
(i) in the case of Models I-I], 


A = (1 — (aky) Ory?" 
Q9) Bal — (aky) "t-n, 
C = 0; 


* The underlying mathematical reason why solutions are obtained in closed form (Theorems 1 and 2) 
for the utility functions (16)-(19) is that these functions are also the only (monotone increasing and 
strictly concave utility function) solutions (see (8]) to the functional equations u(xy) = (x)w(y). 
u(xy) = v(x) + wy), u(x + y) = vixjw(y, and u(x + y) = v(x) + w(y, which are known as the 
generalized Cauchy equations (1, p. 141]. 
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(it) in the case of Model III, 
a=, 
l-«@ 
(30) B=1-a, 
a log a ak 
C= log (1 — . 
Pog BU M+ et may 


Furthermore, the solution is unique. 


In proving this theorem, we shall make use of the following lemma and corol- 
laries. 


LEMMA : Let u(c), {B;}, and r be defined as in Section 2 and let à = (v4, ..., Um) 
be a vector of real numbers. Then the function 


M 
Y (B, - rv; + r) | 


(31) TE 
subject to the constraints 
(27) v 20, i¢S, 
and 
M 
(28) PIG -merRO =i, 
i=? 


has a maximum and the maximizing v; (= vj) are finite and unique. 


PROOF: Let D be the (M — 1)-dimensional space defined by the set of points v 
which satisfy (27) and (28). We shall first prove that the set D is nonempty, closed, 
bounded, and convex, and that h is strictly concave on D.' 

The nonemptiness of D follows trivially from the observation that 
5? = (0,0,...,0) isa member of D. By the boundedness of the fj;'s and of r ((2) and 
(3)), there exists a neighborhood of i? in relation to D. That is, there is a neighbor- 
hood of points »' such that 


M 
PY mi +r ol = | 
i21 
where v; > 0 for all i¢ S. 


Now consider the point 5^ = 59? + Ab’ = Ab’ where A > 0 and 0” is one of the 
points in this neighborhood. Let b(») be the greatest lower bound on b such that 


Pr{ Y (B, — rw» < 2 0. 
i-a 


? The author gratefully acknowledges a debt to Professor George W. Brown for several valuable 
suggestions concerning the proof of the closure and the boundedness of D. 
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By the "no-easy-money condition" (4) b(») —r for ?5'eD,b(5?) = 0, and 
b(b) < Oforallà # 5°. Applying the ‘‘no-easy-money condition" with respect to the 
point ?? and using the inequality 


M 
Zb (f, = nan < 28} >0, 
ia2 


we obtain that Ab(b) = b(40'). But when 4b(5') < —r,or A > —r/b(V), the point p? 
cannot lie in D since 1 > —r/b(i*) implies that 


M 
pf È naj tre of <1. 


iz? 
Thus, 4, = —r/b(č') is the greatest lower bound on 4 such that i^ ¢ D. Since 
Agb(b) = —r, i^? e D and is in fact the point farthest from i? lying on the line 


through 5? and ?' and belonging to D. 

We shall only sketch the remainder of the proof establishing the closure and 
boundedness of D. Let 6 # 0° be the limit of a sequence of points 5" e D. Since each 
point in the sequence belongs to D, b(5) > — r for all n. It can now be shown, by 
utilizing the fact that ZM, (f, — r)i; is continuous at any v # D°, uniformly with 
respect to the B,’s on any bounded set, that lim, , ,, b(v'”) < b(v), which implies that 
ùe D. Consequently, D must be closed. 

The boundedness of D is established as follows. Let S, be the set of points v such 
that [0| = R > 0. S, is then clearly both closed and bounded. If D' = D A Sp is 
empty, the boundedness of D follows immediately. Let us therefore assume that 
D' is nonempty ; in this case D’ is also bounded and closed since D is closed and Sp 
is bounded and closed. If vis a limit point of the sequence <b such that 0” e D', 
we must have that v € D’ since D' is closed. But b(v) < 0 by the "no-easy-money 
condition” (4), since v + i? by assumption. Therefore, since we already have that 
lim,.. . (6) < b(»), 0 cannot be a limit point to the sequence <b(i)>, 5 e D'. 
Consequently, b(») for 6 D' is bounded away from zero, which implies that D 
must be bounded. 

To prove convexity, let 6” and »" be two points in D. Then, for any 0 < A < 1, 


M 
e| Y (B, - às + ar > of =l, 
i=2 
and 
M 
Pr Y (B-n — àw + (1 -— Ay > o} =1, 
1=2 
which implies 
M 
Prd Y (i - nx + (1 - Aoi") + r > o} =1, 
i=2 


so that Av" + (1 — Ay?" e D. Thus, D is convex. 
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Ww, = Y B-ri r (n= 1,2). 
Then 
(32) h(Av' + (1 — A?) = E[wAW, + (1 — A)94)] 
and 
(33) Aho) + (1 — ANA?) = AE(u(9,)] + (1 — A)E(u(9W,)]. 


For every pair of values w, # w, of the random variables #, and w, such that i! 
and i? e D, we obtain, by the strict concavity of u, 


(34) u(Aw, + (1 — A)w,) > Au(w,) + (1 — Auw), 0«A« I. 
Consequently, (34) implies 
E[u(A9, + (1 — 49,)] > AE[u(9,)] + (1 — AE[u(V,)]], $1 ¥ dF ED, 
0<å<l, 


which, by (32) and (33), in turn implies that h is strictly concave on D. 

Since our problem has now been shown to be one of maximizing a strictly 
concave function over a nonempty, closed, bounded, convex set, it follows directly 
that the function h has a maximum and that the v; are finite and unique, 


A number of corollaries obtain from this lemma which we shall also require in the 
proof of Theorem 1. 


COROLLARY 1: Let u(c), (f), and r be defined as in the Lemma. Moreover, let u(c) 
be such that it has no lower bound. Then the vj which maximize (31) subject to (27) and 
(28) are such that 


M 
Pr} È (i — not +r > of =|. 
i=? 


The proof is immediate from the observation that h + — oc as the greatest lower 
bound on b such that Pr ZA , (8; — r)v; + r < b} > 0 approaches 0 from above. 


COROLLARY 2: Let u(c), {8;}, and r be defined as in the Lemma. Then the maximum 
of the function (31) subject to the constraints (27) and (28) is greater than or equal to 
u(r). 


PROOF: When v; = 0 for all i, which is always feasible, we obtain by (31) that 
h = u(r). 


COROLLARY 3: Let u(c), {B;}, and r be defined as in the Lemma. Moreover, let u(c) 
be such that u(c) < b. Then the vectors ù which satisfy (27) and (28) are such that 


h(v) = 2Ep (B, — rw, + J| « b. 
i=2 
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The proof is immediate from the observation that u(c) is monotone increasing 
and that r, (f), and the feasible v, are bounded. 

We are now ready to prove the theorem. The method of proof will be to verify 
that (22)-(25) is the (only) solution to (11).? 


PROOF OF THEOREM 1 FOR Monpes I-11 : Denote the right side of(11) by T(x) upon 
inserting (22) for f(x). This gives, for all decision points j, 


M 
(35) T(x) = max [5e 4 a(1 — (gt yia Y (Bi — r)z 
AET i22 


J 


+x- ce)+y+Y 


subject to 
(12) c 20, 
(13) 420, i¢€S, 
and 
M 
(14) Pri S B-a- tyto- 1) > 0} ur. 
i=2 
Since (14) may be written 
M 
pr} $ (8, — r)yz + r(x + r-a»oj- 1, 
i=2 


it follows from the “‘no-easy-money condition" (4) that (14) is satisfied if and only if 
either 

(36) s—c=0 

and 

(37) z= 0 (i =2,...,M), 


M 
(39) | Y B- ns co +r > ob = 1, 
ix2 


where s z x 4- Y. 
Under feasibility with respect to (14), we then obtain 


1 
(40) Tix) = max zs To}, 0«y«1, 
max(—oo,T(x), y «0, 


* A proof based on the method of successive approximations may be found in [7). 
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where 


(41) T(x) = slie + a(l — (aky) 0Y Ys — cy 


i| AÀ y 
TEPORE 
Y\i=2 


subject to (12), (38), (39), and 
(42) z/(s—c)>0, i¢sS, 


since (42) is equivalent to (13) in view of (38). But by (31) the expectation factor in 
(41) may be written 


(43) h(zj/(s — c),..., zu/(s — c) 

and (26), the Lemma, and Corollary 2 give 

(44) ky 2r'»0 (Model J), 

(45) ky <r’ <1 (Model II), 

while (26), the Lemma, and Corollary 3 give 

(46) ky > 0 (Model II). 

Thus, 6T/dh > 0 always in Model II and in Model I whenever 


1 
(47 ky <= 
a 


under feasibility. When ky > 1/a in Model I, T(x) does not exist; when ky = 1/a, 
(41) and (40) give T (x) = (1/y)s? x f(x). Consequently, it remains to consider the 
case when ôT /dh > 0. 

Since the maximum of (43) subject to (42) and (39) is k by (26) and the Lemma, 
we obtain by the Lemma that the strategy 


or 
(48) z; = v(s — c) (i = 2,..., M) 
is optimal and unique for every c which satisfies (12) and (38) when (38) holds. 
It is clearly also optimal when (36) and (37) hold. Consequently, (40) reduces to 
1 

(49) T(x) = max l5? + ak(1 — (aky)!/G -P (s — ort. 

Os£c&s|Yy 
Since u(c) is strictly concave and w'(0) = œ in Models I and II, T(x) is strictly 
concave and differentiable with an “interior” unique solution c*(x) whenever 

, 0 (Modell), 

_ Uaü-yy-1 
60) akl (4k) "Y | <o (Model I), 
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and s > Q. In this case, setting dT/dc = 0 and solving for c, we get, 
c'^! — aky(1 — (aky) t y (s — cy! = 0 
Or 
(51) c*(x) = (1 — (xky)!! ?)(x + Y). 
In Model I, (50) is satisfied whenever (44) and (47) hold ; as noted earlier, no 
solution exists in Model I for those cases in which ky > 1/a. In Model II, (50) is 


always satisfied as seen from (45) and (46). 
Inserting (51) in (48) we obtain 


z(x) = (aky)! ""vi(x + Y) (i = 2,..., M) 
and (24) follows from (6) upon insertion of c*(x) and the z;(x). T(x) now becomes, 
upon insertion of c*(x) in (49), 


T(x) = “(1 — (aky) -Y»ys! + ak(1 — (aky)! 7» 7 1e? (gkyy/ 70 


= “(I — (aky) 0 ts? 


= f(x) 
and the solution clearly exists for s > 0 or 
(52) x,2 —Y. 


Since (52) is an induced constraint with respect to period j — 1, it remains to be 
verified that (52) is either redundant or not effective in period j — 1. Because (52) 
is already present in period j — 1 through (14), the induced constraint (52) is 
redundant, which completes the proof. 


PROOF OF THEOREM | FOR Mope IIT: Denote the right side of (11) T(x) upon 
inserting (22) for f (x). This gives, for all decision points j, 


T(x) = max {loge + peus 


e,lzi) 


M 
n (B; — zi 


[s 


a a? loga atk 
log (1 — — P p —— 
Ja 08! 0+ 1 ataa 


+rx—ec)+ y+ Y 


where 


K= 


subject to (12), (13), and (14). By the reasoning for Models I and II, we obtain 
(53) T(x) = max ( — oo, T(x)} 
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where 


a 


(54) T(x) = sup fiog c+ log (s — c) 


c.l 1—a 


a M 
+ pues | È (B; - Nes e) +r 
l—a i22 


In +K 
subject to (12), 
(38) s—c»0, 
(42) z/(s — c) 20 (i 25 2,...,M), 
and 
M 
(39) Zb (f, - nae - 0) + r > 0) = l. 
iz 
By (31), the next to last term in (54) can be written 


G5 — Le = 6), zufs — 0) 


where 07/dh > 0. Since the maximum of (55) subject to (42) and (39) is (ak/1 — a) 
by (26) and the Lemma, we obtain from the Lemma that 


(48) z(x) = v(x + Y - c) (i = 2,...,M) 


is optimal and unique for every c which satisfies (12) and (38). Thus, (53) reduces, 
in analogy with Models I and II, to 


a 


(56) T(x) = max {loge + 1 log (s — c) + — + x} 


O€c€s — Q 


where T(x) always exists sinceO < a < 1; furthermore, T(x) is strictly concave and 
differentiable. Setting dT /dc = 0 we obtain 


(57) c*(x) = (1 — a)(x + Y), 
zi(x) = avj(x + Y) (i = 2,..., M), 
and (24), al! unique. Inserting (57) into (56) gives 


a a 


T(x) = log(1 — a) + logs + i mL +; —, logs 
ak a a? loga a?k 
*i-a*rzQ4098079*4 7a tuc 


= f(x). 
Since f (x)exists for x, 2 — Y, which as an induced constraint with respect to period 
j — 1 is made redundant by (14) for that period, the proof is complete. 
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When y = 0, the solution to (11) reduces to 
f(x) = Au(x) + C, 


c*(x) = Bx, 
zY(x) = (1 — BY — v*)x, 
z(x) = (1 — Byix (i = 2,..., M). 


But then, letting 5 = x + Y, 
f(s) = Au(s) + C, 


c*(s) 2 Bs, 
z,(s) = (1 — B)(1 — v*)s, 
zs) (1 — Byis (i = 2,...,M). 


As a result, except for z}(x + Y), the solution to the original problem is not altered 
when the individual, instead of receiving the noncapital income stream in install- 
ments, is given its present value Y in advance. Thus, instead of letting x be the state 
variable when there is a noncapital income, one could let x -- Y bethe state variable 
(pretending there is no income), as long as Y is deducted from zj(x + Y). 

Note that it is sufficient, though not necessary, for a solution not to exist in 
Model I that r” > 1/æ (Corollary 2). 


THEOREM 2: Let a, (f;), r, y, and Y be defined as in Section 2. Moreover, let u(c) = 
—e^* for c > 0 where y > 0. Then a solution to (11) subject to (12)-(14) exists for 
x > —Y + [r/((r — 1)*)] log (—akr) and is given by 


(58) f(x) = —— (~ akr) n7 Me “Ir tiae m 
r—1 , 


r-i 


(59) c*(x) = log ( — akr), 


; (+ Y-- 


l 
yr-D 
(60 ze X 22,4 Lea (make) = rot 

ror yr — 1) 
6D z= xp (i= 2,..., M), 


where the constants k and v; (v* = X1, vj) are given by 

(62) k = E[—e715:0:7 1] = max E[—e Hf:0:77"] subject to (27) 
(vi) 

provided that 


(63) log (— akr) + (0*) 2 0 
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where b(*) is the greatest lower bound on b such that 


M 
zb (B, — rw; «uj 20 


i=? 


- . . è . . 
and v* = (v},..., Dy). Moreover, the solution is unique. 


Since the conditions under which Theorem 2 holds are quite restrictive, the 
reader is referred to [7] for the proof. Condition (63) insures that the individual’s 
capital position x is nondecreasing over time with probability 1 ; it must hold for a 
solution to exist in closed form. The condition ar > 1 is a necessary, but not 
sufficient, condition for (63) to be satisfied.? 


4. PROPERTIES OF THE OPTIMAL CONSUMPTION STRATEGIES 


. Ineach of the four models we note that the optimal consumption function c*(x) 
is linear increasing in capital x and in noncapital income y. Whenever y > 0, posi- 
tive consumption is called for even when the individual's net worth is negative, as 
longas it is greater than — Y in Models I-III and greater than — Y + [r/(y(r — 1)?)] 
log (— akr) in Model IV. Only at these end points would the individual consume 
nothing. 

Since x + Y may be viewed as permanent (normal) income and consumption is 
proportional (0 « B « 1) to x + Y in Models I-III, we see that the optimal con- 
sumption functions in these models satisfy the permanent (normal) income 
hypotheses precisely [9, 5, 3]. 

In each model, c*(x) is decreasing in a. Thus, the greater the individual's im- 
patience 1 — a is, the greater his present consumption would be. This, of course, is 
what we would expect. 

By (20) and (21), the relative and absolute risk aversion indices of Models I-IV 

- are as follows: 


q*(c) 1— y (Models I-II), 


q*()-21 (Model III), 
q(c) = y (Model IV). 
* For example, when ufc) = — e^ :999'*, a = .99, y = $10,000, r = 1.06, M = 2, and fl; assumes each 


of the values .96 and 1.17 with probability .5, a solution exists for x > $— 22,986. For selected capital 
positions, the optimal amounts to consume, lend, and invest in this case are as follows: 


x c*(x) zx) zx) 
$ —22,986 0 $— 102,488 $79,502 
0 $ 1,30) — 80,803 79,502 
50,000 4,31 - 33,633 79,502 
100,000 6,961 13,537 79,502 
500,000 29,601 390,897 79,502 
1,000,000 57,901 862,597 79,502 


The maximum loss in each period from risky investment is $3,180. 
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In Models I-II, we obtain 


xe cayo [Menit = yl Ten e 
al = y) e kyl — y) ü -» 


where d(ky)/d(1 — y) is negative whenever b(5*) > 1 — r; otherwise the sign is 
ambiguous. Since ky > 0 and «ky < 1, the sign of (64) is ambiguous in both cases; 
i.e., a change in relative risk aversion may either decrease or increase present 
consumption. In Model IV, on the other hand, c*(x) is increasing in y; i.e., a more 
risk averse individual consumes more, ceterus paribus. 

From (26) and (62) we observe that k is a natural measure of the “favorableness”’ 
of the investment opportunities. This is because k is a maximum determined by 
(the one-period utility function and) the distribution function (F); moreover, F 
is reflected in the solution only through k, and f(x) is increasing in k. Let us examine 
the effect of k on the marginal propensities to consume out of capital and non- 
capital income. 

Equation (29) gives 


9B sy (dyyt-»} 

êk y-1 
Thus, we find that the propensity to consume is decreasing in k in the case of 
Model I. This phenomenon can at least in part be attributed to the fact that the 
utility function is bounded from below but not from above ; the loss from postpone- 
ment of current consumption is smal] compared to the gain from the much higher 
rate of consumption thereby made possible later. In Model IT, on the other hand, 
where the utility function has an upper bound but no lower bound, the optimal 
amount of present consumption is increasing in k, which seems more plausible from 
an intuitive standpoint. 

In Model III, we observe from (30) the curious phenomenon that the optimal 
consumption strategy is independent of the investment opportunities in every 
respect. While the marginal propensity to consume is independent of k in Model IV 
also, the level of consumption in this case is an increasing function of k as is apparent 
from (59). We recall that the utility function in Modellilis unbounded while that in 
Model IV is bounded both from below and from above. Thus, the class of utility 
functions we have examined implies an exceptionally rich pattern of consumption 
behavior with respect to the "favorableness" of :ne investment opportunities. 


(64) 


<0 (Model D), 
> 0 (Model IT). 


5. PROPERTIES OF THE OPTIMAL INVESTMENT AND BORROWING-LENDING STRATEGIES 


The properties exhibited by the optimal! investment strategies are in a sense 
the most interesting. Turning first to Model IV, we note that the portfolio of 
productive investments is constant, both in mix and amount, at all levels of wealth. 
The optimal portfolio is also independent of the noncapital income stream and the 
level of impatience 1 — « possessed by the individual, as shown by (61) and (62). 

Similarly, we find in Models I-III that, since for all im > 1,2;(x)/z'{x) = vi/v,, 
(which is a constant), the mix of risky investments is independent of wealth, 
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noncapital income, and impatience to spend. In addition, the size of the total 
investment commitment in each period is proportional to x + Y. We also note that 
when y = 0, the ratio that the risky portfolio X% , z;(x) bears to the total portfolio 
EM , zi(x) is independent of wealth in each model. 

In summary, then, we have the surprising result that the optimal mix of risky 
(productive) investments in each of Models I-IV is independent of the individual's 
wealth, noncapital income stream, and rate of impatience to consume ; the optimal 
mix depends in each case only on the probability distributions of the returns, the 
interest rate, and the individual's one-period utility function of consumption. 

Ineach case, we find that lending is linear in wealth. Turning first to Models I-III, 
we find that borrowing always takes place at the lower end of the wealth scale; 
(24) evaluated at x = — Y gives — Y « Oasthe optimal amount to lend. From (24) 
we also find that zj(x) is increasing in x if and only if 1 — v* > 0 since 1 — B is 
always positive. As a result, the models always call for borrowing at least when the 
individual is poor; whenever 1 — v* > 0, they also always call for lending when he 
is sufficiently rich. 

In Model IV, we observe that lending is always increasing in x. Thus, when an 
individual in this model becomes sufficiently wealthy, he will always become a 
lender. At the other extreme, when x is at the lower boundary point of the solution 
set, he will generally be a borrower, though not necessarily, since z1(x) evaluated 
at x = — Y + [r/(y(r — 1)*)] log (— akr) gives 
r log ( — akr) rv* 

Xr-1?  yr-1 
which may be either negative or positive. 

We shall now consider the case when the lending rate differs from the borrowing 
rate as is usually the case in the real world. Letrg — 1 andr, — l denote the borrow- 
ing and lending rates, respectively, where rg > r,. Unfortunately, the sign of 
dv*/dr is not readily determinable. However, since f(x) is increasing in k, the 
analysis is straight-forward.'? 


-Y+ 


10 When rp > r,, the "no-easy-money condition" requires that the joint distribution function of 
fa... Bu satisfies 


M 
(4a) Pr Y. (8, — rW: «o >0 
t=2 


for all finite numbers 6, > O such that 8; > 0 for at least one i; 
(4b) Pr PO — ny < o} >0 
igs 


for all finite numbers 0; < O such that 6, < O for at least one i; and 


M 

(4c) Pr Y Be, - 2 BO, < o} >0 
[Lr kes” 
is 


for all finite numbers 6,, 9, > O and all S* © S such that 


M 
X 0, = X 8, 
i22 keS* 
ru 


and 6, > O for at least one i. When ry = r,, 4(a)-4(c) reduce to (4). 
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Consider first Models I-II] when noncapital income y = Q. In that case, it is 
apparent from (24) that when the individual is not in the trapping state (i.e, 
x » — Y), he either always borrows, always lends, or does neither, depending on 
whether 1 — v* is negative, positive, or zero. Let k, denote the maximum of (31) 
when the lending rate is used and the constraint 


M 
(65) asil 

iz2 
is added to constraints (27) and (28). Since the set of vectors č which satisfy (65) is 
convex and includes ù = (0,...,0), the Lemma still holds when (65) is added to the 
constraint set. Analogously, let kg denote the maximum of (31) under the borrowing 


rate rg subject to (27), (28), and 


M 
(66) Yuz1 
i-2 


Again, the Lemma holds since the set of ù satisfying (66) is convex and any 6 
such that Z4, v; = I, v; > 0, for example, satisfies all constraints. Setting 
k = max{k,,k,}, Theorem ! holds as before when y = 0. 

When y > Oin Models I-III and in the case of Model IV, no "simple" solution 
appears to exist when ry > r,. 


6. THE BEHAVIOR OF CAPITAL 


We shall now examine the behavior of capital implied by the optimal investment 
and consumption strategies of the different models. According to one school. 
capital growth is said to exist whenever 


(67) E[xj, i] > x; G = 1,20...) 


that is, capital growth is defined as expected growth [10]. We shall reject this 
measure since under this definition, as j > 20, x, may approach a value less than 
x, with a probability which tends to 1. We shall instead define growth as asymp- 
totic growth ; that is, capital growth is said to exist if 


(68) lim Prix» x,1 =l. 


When the > sign is replaced by the > sign, we shall say that we have capital non- 
decline. If there is statistical independence with respect to j, (67) is implied by (68) 
but the converse does not hold, as noted. 

Model IV will be considered first. From (63) it follows that nondecline of capital 
is always implied (in fact, the solution to the problem is contingent upon the 
condition that capital does not decrease, as pointed out earlier). It is readily seen 
that a sufficient, but not necessary, condition for growth is that there be a nonzero 
investment in at least one of the risky investment opportunities since in that case 
Pr (xj, > x] > 0,7 = 1,2,..., by (63). A necessary and sufficient condition for 
asymptotic capital growth is ar > 1, which is readily verified by reference to (62), 
(63), and the foregoing statement. 
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Let us now turn to Models I-III and let, as before, s, = x; + Y. From (7), (23), 
and (25) we now obtain 


M 
(69) Sj, = Sl -B| Si = not +r] 
i=2 
= sW (j= 1,2,...) 


where W is a random variable. By (28), W > 0. Attaching the subscript n to W for 
the purpose of period identification, we note that since 


j-1 
00 — s-2s [T]. 


(70) verifies that 
s,20 foralljwhenevers, 2 0 (Models I-11). 


Moreover, since Pr {W > 0! = 1 in Models II and III by Corollary 1, it follows 
that 


(71) 5; > 0. whenever s, > Oforall finite; (Models 11-11). 


From (70) we also observe that s; = 0 whenever s, = O forall j > k. Consequently, 
x = — Y is a trapping state which, once entered, cannot be left. In this state, the 
optimal strategies in each case call for zero consumption, no productive invest- 
ments, the borrowing of Y, and the payment of noncapital income y as interest on 
the debt. In Models II and III, it follows from (71) that the trapping state will 
never be reached in a finite number of time periods if initial capital is greater 
than = Y. 
Equation (70) may be written 


j- 
Sj = s,e* = log Wo 


The random variable 2/71 log W, is by the Central Limit Theorem asymptotically 


normally distributed ; its mean is (j — I)E[log W). By the law of large numbers, 


j-t 
> log W, 
NES > E(log W] asj oo. 
Thus, since s; > s, if and only if x; > x,, it is necessary and sufficient for capital 


growth to exist that E[(log W] > 0. 
It is clear that u given by p = eF'^*"! may be interpreted as the mean growth rate 


of capital. By (69), we obtain 


M 
E[log W] = log(1 — B) + E| tog 1 Y (B-ri; + 1| 
iz 
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For Model II], this becomes, by (30) and (26), 


M 
E[log W] = loga + max 2L E (Bi -nu- 3] 
lei) 1-2 


subject to (27) and (28). Thus, a person whose one-period utility function of 
consumption is logarithmic will always invest the capital available after the allot- 
ment to current consumption so as to maximize the mean growth rate of capital 
plus the present value of the noncapital income stream. 


7. GENERALIZATIONS 


We shall now generalize the preceding model to the nonstationary case. We 
then obtain, by the same approach as in the stationary case, for all j, 


My 
(72) f(x) = max T + TEN È (By — rey + rx- c) + yj 
1=2 


[JEEP 
subject to 


(73) c, 20, 


(74) 2,20, i£S, 
and 
(75) Pr Usa 2-— Yi) 


where the patience factor « the number of available investment opportunities M 
and S and their random returns fj; — 1, the interest rater, and the noncapital income 
y may vary from period to period; this, of course, requires that they be time 
identified through subscript j. Time dependence on the part of any one of the 
preceding parameters also requires that f (x) be subscripted. 

Asshown in [7], the solution to the nonstationary model is qualitatively the same 
as the solution to the stationary model. 

In the case of a finite horizon, the problem again reduces to (72-475) with 
fasi (x41) = 0 if the horizon is at decision point n + 1. In this case, f(x), x, c, Z, 
and Y must clearly be time identified through subscript j even in the stationary 
model. Under a finite horizon, a solution always exists even for Model I. Again, the 
solution is qualitatively the same as in the infinite horizon case except that the 
constant of consumption proportionality B, increases with time j, B, — 1, and 
zh = 0 for all i.!! 


University of California, Berkeley 


Manuscript received September, 1966; revision received January, 1969. 


!! The implications of the results of the current paper with respect to the theory of the firm may be 
found in (6]. 
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ON OPTIMAL MYOPIC PORTFOLIO POLICIES, WITH AND 
WITHOUT SERIAL CORRELATION OF YIELDS 


NILS H. HAKANSSON 


I. INTRODUCTION 


In a recent paper, Mossin! attempts to isolate the class of utility functions of 
terminal wealth, f(x), which, in the sequential portfolio problem, induces myopic 
utility functions of intermediate wealth positions. Induced utility functions of short- 
run wealth are said to be myopic whenever they are independent of yields beyond 
the current period; that is, they are positive linear transformations of f(x). Mossin 
concludes (1) that the logarithmic function and the power functions induce com- 
pletely myopic utility functions; (2) that, when the interest rate in each period is 
zero, all terminal wealth functions such that the risk tolerance index —f’(x)/f’’(x) 
is linear in z induce completely myopic utility functions of short-run wealth; (3) that, 
when interest rates are-not zero, the last class of terminal wealth functions induces 
partially myopic utility functions (only future interest rates need be known); and 
(4) that all of the preceding is true whether the yields in the various periods are 
serially correlated or not. With the exception of the last assertion, the same conclu- 
sions are reached by Leland.? The purpose of this note is to show that the second 
and third statements are true only in a highly restricted sense even when yields are 
serially independent, and that, when investment yields in the various periods are 
statistically dependent, only the logarithmic function induces utility functions of 
short-run wealth which are myopic. 


II. PRELIMINARIES 


In this and the next three sections, the following notation will be employed: 


xj; amount of investment capital at decision point j (the beginning of the jth period); 
M j; number of investment opportunities available in period 7; 
Sj the subset of investment opportunities which it is possible to sell short in period 7; 
r; — 1: rate of interest in period j; 

Bij proceeds per unit of capital invested in opportunity i, where + = 2,..., M; in the jth 
period (random variable); that is, if we invest an amount @ in i at the beginning of the 
period, we will obtain §;;8 at the end of that period; 

zı; amount lent in period j (negative zı; indicates borrowing) (decision variable); 

2,5; amount invested in opportunity i, i = 2, .. . , Mj, at the beginning of the jth period (de- 
cision variable); 

fix: utility of money at decision point 7; 
s] (xj): an optimal lending strategy at decision point j; E 
2;,(xj): an optimal investment strategy for opportunity $, i = 2,..., M, at decision point j. 


Fi(yu Y» © Yu) = Pr (Bu S Yn Bas S yn Bui S yu) i hom nem. 


l Jan. Mosin, "Optimal Multiperiod Portfolio Policies," Journal oj Business 41 (April 1968): 215-29. 
3 Hayne Leland, ‘Dynamic Portfolio Theory” (Ph.D. thesis, Harvard University, 1968). 
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As most portfolio models do, we assume, in addition to stochastically constant 
returns to scale, perfect liquidity and divisibility of the assets at each (fixed) de- 
cision point, absence of transaction costs, withdrawals, capital additions, and taxes, 
and the opportunity to make short sales. Furthermore, we assume, until Section VI, 
that the yields in the various periods are stochastically independent. 

Since the end-of-period capital position is given by the proceeds from current 
savings, or the negative of the repayment of current debt plus interest, plus the pro- 
ceeds from current risky investments, we have 


M; 
Xi = fizy + 2 bitis j732..., (0) 
where 
M, 
Dori = x j=1,2, (2) 
Combining (1) and (2) we obtain 
M; 
Xi 2.05 — rig d rx j= 1,2, (3) 


Let us now assume that f;(x,) is given for some horizon J. Then, as Mossin shows, 
we may write, by the principle of optimality,’ 


fx) = pmax Elia] g=i,...,J—-1 (4) 


where Z;(x;) is the set of feasible investments at decision point 7 given that capital 
is xj. When there are two assets in each period (i.e., M; = 2 for all j) and Z,(x;) = 
[za 0 € 22; € xj], that is, borrowing and short sales are ruled out, Mossin con- 
cludes that f;(x;) = a;fs(x;) + b; (where a; > 0 and b; are constants), 7 = 1,..., 
J — 1, that is, that the induced short-run utility functions at decision points 1,... , 
J — 1 are completely myopic, if and only if (1) fs(x) is either logarithmic or a power 


function, or (2) ri = 72 =... = ria 1 and f,(x) is one of 
Jia) = e; (5) 
f(x) = log (x + 1) ; (6) 
fale) = o (ne + yo AOAI, (7) 


where u = 0 and à are constants. Note that \ and u cannot both be negative. 

While the first conclusion is beyond dispute, the second is incorrect, as are the 
conclusions concerning partial myopia in general, except in a severely restricted 
sense. We shall first demonstrate the assertion in the preceding case and then show 
that it also holds when borrowing and short sales are not ruled out. 


III. AN EXAMPLE 


Assume that r, = r4 =... = fj = 1 and that there is only one risky oppor- 
tunity (ie. M; = 2) in each period. Moreover, assume that the proceeds f»; of 


* Richard Bellman, Dynamic Programming (Princeton, N.J.: Princeton University Press, 1957). 
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this opportunity are 
Bay = [o with probability 1/2 l; (g 
V = 13 with probability 1/2 allj (8) 
and that 
falas) = (x, + dy? d»0. (9) 
(9) clearly belongs to the class (7). (4) and (3) now give 
f(x) = ,102X Ef fillz = 1)zz; + xjl) j = 1, ttt) J—1 , (10) 
2:;j2xj 


where f;(x;) is given by (9). 
It is easily verified that 
does not exist xy, <0 
27.4 (X) 3) = " 0 < xja [S d. (11) 
l/2(x;:-- d) Xa 2d 


Thus, 1/2(3 dy + 1/2d! 
_ ty + 2 O<xa1<d 
fray) = lan stn 4 ayn xad ; (12) 
where 
aye, = 1/2/(1/2)? + 2:5]. (13) 


We now observe that fs1(x) is a positive linear transformation of f;(x) only for 
x > d; fora <d, fr-i(x) < aza fale). 
Proceeding with the solution to (10), we obtain 


(x;) O<a;< db; . 
fix) = ee +n ax» b; ^ get,...,J-1, (14) 


where a; is a positive constant, g,(xj) < a,(x; + d)!” for0 € x; < bj, 
b= dt 2b (;-0,j721...,7—1, (15) 


le) — hj(xj) O <x <b; 
nx) = WA +da) yb; 


and 
j=l,...,J—-1. (16) 


It is easily determined that #,(x,) is highly irregular except for j = J — 1. 

When J = 11 and d = 1,000, we obtain from (15) that b, = 1.023 million. Thus, 
when the horizon is ten periods distant, the optimal amount to invest in oppor- 
tunity 2 is, in this example, proportional to x; + d only if initial wealth x, exceeds 
$1 million by a substantial margin. Furthermore, while f;(x) is a positive linear trans- 


formation of f;(x) for x > 5, 3 2 1,...,J — 1, it is not for x < b;, that is, for 
xı < 1.023 million, x, < 511,000, etc., in the above example. Since the constant 
b; > 0 depends on the distribution functions F,,..., F;.,, the short-run utility 


functions induced by the terminal utility function (9) are clearly not myopic. In 
other words, to make an optimal decision at decision point 7, not only F; but Pj, 
..., Fy.i must be known. 
IV. BORROWING AND SOLVENCY 
The nonmyopic nature of the induced utility functions fi(xi), fa(22), .. . , fo-i(xo 3) 
in the preceding example is clearly attributable to the constraint 
0 < 2; < x; j2143,...,J—1, (17) 


which precludes borrowing and short sales. We shall now relax this constraint. 
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Case I.—(17) will first be replaced with 
0O<%;< m; ml j21,...,7—1, (18) 


that is, 100/m is assumed to be the percentage margin requirement. The solution 
to (10) for 7 — 1 now becomes 


1/2((1 — m)x;-, + d] 0 < rja < 


+ 1/21Qm + Vasa + d]? 2m — 1 
f1(x2-) = d 
ay—ifs(xs_1) XjaZ Im = 
(19) 
AXI. 0S xac j 
* ma 1 
2g a(ts) = d 
1/2(x5-1 + d) Xy2az im -—1' 
and the total solution is represented by (14)-(16) with à; 20,4 = 1,...,J7 — 1. 
Consequently, the optimal portfolio policy is nonmyopic in the case of constraint 
(18) also. 


Case II.—Let us now introduce an absolute borrowing limit of m, that is, sub- 
stitute 
0€ mj S2x;+m m»0 jz21,...,J—1 (20) 


for (17). When m < d/2 the solution to (9) is again given by (14)-(16) with b; > 0, 
j251,...,J — t. However, when m > L = d/2, the solution becomes 


fix) = a(x; + a)? j=l.’ Jl; 
zyl) = 1/2(x, — d) j5143..,J7—1; (21) 


zy;(x;) = 1/2(x; + d) j=l... J— 1; (22) 


that is, the optimal investment policy would seem to be completely myopic on the 
basis of our assumptions. But Z clearly depends on Fy41,..., F;-,. Thus to know 
whether m > L, knowledge of future returns is necessary. Consequently, the optimal 
investment policy is not myopic in Case II either. 

Let us now consider the realism of assumptions (18) and (20). With respect to 
(20), we observe from (21) that borrowing takes place, considering decision point 
J — 1, only when x,., < d. By (3), (8), (21), and (22), we obtain 


_ §1/2(xs_1 — d) with probability 1/2 


*7 = dox, + d with probability 1/2. 


Thus, the terminal wealth position has a 1/2 chance of being negative if and only 
if x», < d, that is, when borrowing takes place. If the first event (82,51 = 0) takes 
place and the investor declares bankruptcy at time J, the lender will stand to lose 
the entire loan of |1/2(x;.; — d)]. 

The point here is that it would be unreasonable for anyone to lend money to 
his investor when his optimal strategy calls for it; that is, & should be zero in (20)— 
which converts (20) to (17). In fact, (18) and (20) may be said to be inconsistent 
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with the portfolio model itself. This is because a wealthier investor (one whose wealth 
exceeds d) with the same preferences and probability beliefs as a poorer one is, by 
(21), a possible lender to the poorer one whose wealth is less than d. But the model 
assumes that lending is safe, that is, that all loans are repaid with probability 1 
while, as we have seen, the poorer investor may not be able to repay. 

Case I11.—4A borrowing arrangement that is consistent with the assumed riskless- 
ness of lending is one which permits borrowing to the extent that ability to repay, 
that is, solvency, is guaranteed. Thus, a reasonable constraint on borrowing and 
short sales, with considerable intuitive appeal as well, is given by 


Pr(x20)21 j214...,J—1. (23) 


When (23) is substituted for (17), the solution to (10) is the same as when (17) is 
used; that is, it is given by (14), (15), and (16). Thus, myopia is not optimal in this 
case either. 

V. THE GENERAL CASE 


It is readily verified that the conclusions of Sections III and IV are not changed 
if the number of risky investment opportunities is arbitrary. Moreover, the con- 
clusions hold for all of the functions (5), (6), and (7) whenever u > 0, both with 
no borrowing and in each of Cases I-III. Finally, when r; # 1,7 = 1,...,J — 1, 
partial myopia, as defined by Mossin, is not optimal either in any of the preceding 
cases. It should be noted that a solution need not exist in Case III unless the "'no- 
easy-money condition" holds.* A generalization of this condition (for the case when 
yields are serially correlated) is given in Section VI. In the most general version of 
Case III, the set Z,(x,) in (4) is given by those 2; which satisfy 

Zij > 0 1€; (24) 
and (23). 

When u = 0 in (6) and (7), [(5) is of no interest when u < 0], complete myopia is 
optimal in both Cases I and III but not in Case II, as is easily shown. The Mossin- 
Leland conclusions concerning complete myopia when rf, = n =... = ry—1 = land 
partial myopia do not apply in (6) and (7) when u < 0 either, except in Case III, 
as we shall demonstrate below. In doing so, we shall also show that, when a = 0, 
(5), (6), and (7) imply that the optimal investment policies at decision points 1, 
...,J — 1 are never myopic in the presence of explicit borrowing limits of any 
kind, with one exception. 

When a solution to the portfolio problem at decision point J — 1 exists in the 
presence of constraints (24) only, the optimal lending strategy 2;,;-1(x;:) has the 
form 21,4 1(x;-:) = (1 — MM i)us-: — AJ in the case of (6) (A = 1) and (7) and 
the form 21,7-:(%s-1) = xy, — By; in the case of (5), where 4y_, and By_, are 
constants, generally positive,? which depend on F,_,.° 

Let us consider (6) and (7) when A,u > 0. Since Ay_;, and hence, —£:,y~-1(%y-1), 

* Nils Hakansson, ‘Optimal Investment and Consumption Strategies under Risk for a Class of Utility 
Functions,” Econometrica 38 (September 1970): 587—607. 

$ Nonpositive A,_, and B, , imply that total short sales exceed or equal total long investments. 


* Nils Hakansson, “Risk Disposition and the Separation Property in Portfolio Selection," Journal of 
Financial and Quantitative Analysis 4 (December 1969): 401-16. 


118 N. H. Hakansson 


ON OPTIMAL MYOPIC PORTFOLIO POLICIES 329 


may be arbitrarily large, any finite borrowing limit has a chance to be binding. 
Consequently, for f; (x) to be a positive linear transformation of f;(x) in the pres- 
ence of a borrowing limit, it must be a positive linear transformation of f(x) whether 
the borrowing limit is binding or not. From Section IV, it is apparent that, when 
M, = 2 and (17) holds, a necessary and sufficient condition for f;..,(x) to be a posi- 
tive linear transformation of f;(x) is that sf y21(xy1) has the form zfj_\(xs-1) = 
02, (Ni + u), where as,- is a nonnegative constant. When there is more than 
one risky asset and (24) holds, this condition generalizes to 


ztga(3423) = aigaQxrad a) $22,..,M;za, (25) 


where the à;,,-—i are constants, nonnegative only for i € Sy_;.? It is now clear 
that the optimal investment strategy 2f, will have the form (25) if and only if 
(1) the borrowing limit is not binding or (2) the borrowing limit has the form 


M ja 
-as (= Y uoa- 224) € (AC. — Dx a + Coa 
fad 


Myra > 1, Ap>d. (26) 


The latter assertion follows from (2) and the fact that this form of the borrowing 
limit does give the solution (25) for any Fy_,, as is easily verified; moreover, only 
(26) is capable of giving a solution of form (25) when the borrowing limit is binding. 
Since knowledge of whether any given borrowing limit is binding or not requires 
knowledge of Fs_,, it follows that f,-,(x) is myopic in the presence of a borrowing 
limit only if this limit has the form (26). By induction, fi(x), . . . , fy-i.(x) are then 
myopic in the case of (6) and (7) for A,u > 0 if and only if the borrowing limit in 
period j is given by 

QC;— Dat Ce — M24, ag> je-1..,J—1. (27) 


When u < Oor À < Oin (6) and (7), any borrowing limit would, to be consistent 
with myopia, again have to have the form (27). But when u < 0, we must have 
À » 0 and vice versa so that (27) cannot be nonnegative for all x; > 0 for which 
borrowing may be desired, a basic requirement of any “true” borrowing limit. The 
situation in the case of function (5) is analogous. As a result, fi(xi), . . . , fu-i(%s-1) 
can never be myopic for (5), (6), and (7) when À < 0 or u < 0 in the presence of 
a borrowing limit. 

Turning now to the solvency constraint (23), we obtain whenever a solution exists 
for (6) and (7) that the greatest lower bound on b such that Pr[x; < b} > 0, for 
any decision at decision point J — 1 which satisfies (24), is Ky ixi + u) + 
721,5 (351), where Ks: is à constant which depends on F;.,. Since f7(—u/d) = œ 
for A > 0, we obtain, letting x; = Ky. xi + p) + rhua(x, 2), that Ax, + 
B > 0, which implies, since A and p cannot both be negative, x, > 0 when 
u <0. Thus the solvency constraint (23) is not binding when u < 0 but may 
be when 4 > 0. Consequently, the induced utility functions fi(z), . . . , f»-i(x) are 
myopic for the class (6) and (7) when u < 0 in the presence of (24) and the solvency 
constraint (23). 


7 Ibid. 
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In sum then, when u = 0 and interest rates are zero, the induced utility of wealth 
functions fi(xi), . . . , fy-1(”s-1) are myopic in the presence of borrowing constraints 
if and only if the borrowing limits are of the form (27) and f;(x) is of the form (6) 
(7) with A,n > 0; in the presence of the solvency constraint, fi(zxi), . . . , fz-i(xs-i) 
are myopic only if f;(x) has the form (6) or (7) with u < 0 (and hence A > 0). 

It should also be noted that, when u < 0, f,(x) is undefined for x < |u/A] and 
fi(x;), j 9 1,...,J — 1 is undefined for small x;, a significant drawback. In addi- 
tion, the relative risk aversion index — xf; (x)/f;(x) is decreasing for these functions, 
whereas Arrow,® for example, suggests that plausible utility functions of money 
exhibit increasing relative risk aversion. 


VI. SERIALLY CORRELATED YIELDS 


We shall now consider the sequential investment problem when yields are serially 
correlated. In contrast to Mossin's assertion,? we shall find that the optimal invest- 
ment policy is myopic in this case only for a small subset of the terminal utility 
functions which induce myopic short-run utility functions when returns are serially 
independent. 

For simplicity, we assume that yields and the interest rate obey a Markov process. 
A distinction between risk due to general market forces, called. the economy, and 
risk due to individual assets and periods is made. As a result, the assumptions and 
notation of Section II are modified as follows: 


xj amount of investment capital at decision point j; 
N; number of states of the economy at decision point 7; 
M jm: number of investment opportunities available at decision point 7, given that the economy 
is at state m at that time; 
Sjm: the subset of investment opportunities which it is possible to sell short at decision point j, 
given that the economy is in state m at that time; 
fim — 1: interest rate in period j, given that the economy is in state m at decision point j (rjm > 1); 
Bijmn: proceeds at the end of period 7, given that the economy is in state » at that time, per 
unit of investment in opportunity i, i = 2,..., Mjm, at decision point 7, given that 
the economy was in state m at that time; 
Pim: probability that the economy makes a transition from state m to state # in period 7 


Zijm: amount lent in period 7, given that the economy is in state m at decision point j (nega- 
tive zij4 indicate borrowing) (decision variable); 
Zija? amount invested in opportunity 4, 4 = 2, .. . , M js, at decision point j, given that the 
economy is in state m at that point; 
fim (xj): utility of money at decision point j, given that the economy is in state m at that time; 
|j»: an optimal lending policy for state m at decision point j; 


Z5; an optimal investment policy for state m at decision point j, 2 2,..., M jm 
` Zij . 
Zim = (Zijm, tey EM, im) P Vijm = Us > = 1, , M jm 


dim = (vajm; teeny UM jim) ; F jmn (Ya; sey YM, n) = Pr {Baimn < Sr) Bu, minn o < yu, E ., 


* Kenneth Arrow, Aspects of the Theory of Risk-bearing (Helsinki: Yrjö Jahnssonin Säätiö, 1965). 
? Mossin, p. 222. 
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Clearly, v;j4 represents the proportion of capital x; invested in opportunity 7 at 
decision point j, given that the economy is in state m at that point; thus 


Min 
Uy = i- > Usjm . 
im? 


It will be assumed that the joint distribution functions F mn» are independent with 
respect to 7. In addition, we postulate that the [8;;4.] satisfy the following condi- 
tions: 


Pr (0 S Bins < o) = 1, i-22,..., Mim, (28) 
all j, m, and n 
Mim 
Pr | >> (Bijmn — Fim); < o} >0 (29) 
inf 


for all 7, all m, some n for which imn > 0, and all finite 6; such that 6; > 0 for all 
i € Sim and 6; ~ O for at least one s. (29) is a modification of the “no-easy-money- 
condition" for the case when the lending rate equals the interest rate.!? This condi- 
tion states that no combination of risky investment opportunities exists in any 
period which provides, with probability 1, a return at least as high as the (borrowing) 
rate of interest; no combination of short sales is available for which the probability 
is zero that a loss will exceed the (lending) rate of interest; and no combination of 
risky investments made from the proceeds of any combination of short sales can 
guarantee against loss. (29) may be viewed as a condition which the prices of all 
assets must satisfy in equilibrium. 
(3) is now replaced by the conditional difference equations 


Min 
zip em = 5 (Biimn — Tim)Biim F fimt; J =1,...,J—1,all mn, (30) 
122 
and (4) becomes, for j = 1,..., J — 1 and all m, 
Nj 
Jinzi) = max 2 Pim El fiii (5| mn)] , (31) 
where fym(xs) is given for all m, subject to 
Zijm > 0 1€ S jn , (32) 
and 
Pr {xj41|mn > 0} = 1 n= 1,..., Nga. (33) 


VII. OPTIMAL MYOPIC POLICIES 


On the basis of the finite yield assumption (28) and the “no-easy-money-condi- 
tion" (29), we obtain the following: 

Theorem.—Let fim, Fim, and Pjmn be defined as in Section VI and let u(x) be a 
monotone increasing and strictly concave function for all x > 0. Then the functions 


him (jm) = ho is Y (Bim — Tim) Viim + rm |} ; (34) 


1? Hakansson, “Optimal Investment . . ." (see n. 4 above). 
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subject to 

Vg > 0 1€S;.; (35) 
and 


Mim 
Pr PCS — f jm) ijm T f jm > o} = 1 n = 1, e.g Nizi (36) 


have maxima for all j = 1,...,J — 1 and all m. Moreover, the maximizing vec- 
tors, 97, are finite and unique. The proof may be found in Hakansson (1968)." 
Let us now assume that 


fan(z) = x alm; (37) 


and let &;,, denote the maximum of (34) subject to (35) and (36) when u(x) = x1”; 
that is, 


N Mim ya 
Rim = 2 Pies E| [ 2a, (Bisma = 1 jm) tjm + rim | l j = 1, s.y J — 1 all "m. (38) 


By the theorem, we know that ky, exists. 
Let us now determine f; 1,4 (x;-:). From (31) we obtain for all m 


N, 
faim xu) = ymax Zaps -imn Elas min)i] D] (39) 
J-lm a= 
subject to 
Z1» Z 0 t€Sy_im (40) 
and 
M jm 
Pr i e (Bi ime — f) m)85J-1,m T oriainxaX of = 1 n= 1, e. N,. (41) 


By (29) and (41), fy-1(v-1) does not exist for xj. < 0. For z;.: > 0, (39) may be 
written, since (32) and (33) are equivalent to (35) and (36) when x,_, > 0, 


N, 
fj im (xy) = xj max 2 fum 
LR nul (42) 
Mim 1/2 
Ei Bist — FJ-im)PiJ-1.m F tsm] l ; 
, i=? 
subject to . 
Y.s-1.m 2 0 4€Sy-im (43) 
and 
M jim 
Pr i - (Bi s-i ms — Poem) sim T sein X of =1 nzl...,N,. (44) 


By the theorem, we now obtain that fy_1,4(%y-1) exists for all m and x; > 0 and 
is given by, using (38), 
frs (x) = ky Sx? allm. (45) 


u Nils Hakansson, “Optimal Entrepreneurial Decisions in a Completely Stochastic Environment," 
Management Science: Theory 17 (March 1971): 427-49. 
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By (31), the expression to be maximized at decision point J — 2, given that the 
economy is in state m at that time, becomes 
Moy 
»» Ps-amnks—1 SE[(x7-1| mn)! . (46) 
Since the constants kj—ı,n will in general be different for different n, (46) and, there- 
fore, the optimal portfolio 2$_2,.m. depend on the yields in period J — 1. Thus the 
optimal investment policy is not myopic at decision point J — 2; neither is it myopic 
at decision points 1,..., J — 3, which is easily shown by induction. 

The existence of positive constants ai,...,@s-1 and of constants bu,..., 
bs., such that 


Sim (2) = Af 1m (x) + Bin all "m, j =1,...,J-1 (47) 


are clearly both necessary and sufficient for myopia to be optimal in this model. 
As noted, (45) violates (47) for j = J — 1 whenever Ns_, > 1. However, when 


Nj-1, jahan” J, (48) 


(47) is satisfied; but (48) also implies that yields are statistically independent in 
the various periods. This confirms Mossin's result that the optimal investment 
policy is myopic when returns are stochastically independent over time and the 
terminal utility function is x1”. 

Let us now assume that /,,,(x) has the form 


fan(x) = log x allm. (49) 


Letting Hy_;,, denote the maximum of (34) subject to (35) and (36) when u(x) = 
log x, that is, 


J—1.m 


> (Bi. rims — A + "anl , ($0) 


N; M 
Hy. = pia SE] log [ 


we obtain from (31)-(33), solving recursively, 
f[m(x)-1logx-d-b alim j;-1...J—1, (51) 


(where bj... = Hs-i.m, all m), which is consistent with (47). As a result, the in- 
duced utility functions 


fu(x), ... , fin (2), eee fIal) J-i (2) 


are myopic when the terminal utility function is logarithmic, both when yields are 
serially correlated and when they are not. 

Just as in the case of (37), which is a special case of (7), the optimal investment 
policy is not myopic for any function (7) (whether u = 0 or not), nor for any func- 
tion (5), when yields are serially dependent, as is easily verified. Since interest rates 
are assumed to be positive and state-dependent, myopia is not optimal for log 
(x + u), u = 0 either, or any other nonlogarithmic function, under serial depen- 
dence. Thus, when yields are serially correlated, only the logarithmic utility func- 
tion of terminal wealth induces short-run utility of wealth functions which are 
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myopic when yields in the various periods are nonindependent, contrary to Mossin's 
assertion.” 


VHI. CONCLUDING REMARKS 


In view of the difficulty of estimating future yields and their apparent serial cor- 
relation, the myopic property of the logarithmic utility functions is, of course, 
highly significant. However, this function also has other attractive properties.!! Per- 
haps the most important of these is the property that maximization of the expected 
logarithm of end-of-period capita] subject to (32) and (33) in each period also 
maximizes the expected growth rate of capital, whether returns are serially corre- 
lated!* or not. 

As Mossin points out, the portfolio decision is in general not independent of the 
consumption decision. A realistic model of the investor’s decision problem must, 
therefore, include consumption as a decision variable and a preference function for 
evaluating consumption programs. Consumption-investment models of this type 
have been developed by Hakansson, both when investment yields are serially corre- 
lated! and when they are not." 


?! Mossin, p. 222. 

i1 Some of the properties are reviewed in Nils Hakansson and Tien-Ching Liu, “Optimal Growth Port- 
folios When Yields Are Serially Correlated," Review of Economics and Statistics 52 (November 1970): 
385-94, 

M Ibid. 

5 Henry Latané, "Criteria for Choice among Risky Ventures," Journal of Political Economy 67 (April 
1959): 144-55; and Leo Breiman, "Optimal Gambling Systems for Favorable Games," Fourth Berkeley 
Symposium on Probability and Mathematical Statistics (Berkeley: University of California Press, 1961). 

1 Hakansson, “Optimal Entrepreneurial Decisions . . .” (see n. 11 above). 

17 Hakansson, "Optimal Investment ..." (n. 4 above); and “Optimal Investment and Consumption 
Strategies under Risk, an Uncertain Lifetime, and Insurance," International Economic Review 10 (October 
1969): 443-66. 
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EVIDENCE ON THE “GROWTH-OPTIMUM” MODEL 


RicHARD Rorr* 


I. INTRODUCTION 


A PORTFOLIO OWNER may hope to maximize the long run growth rate of his 
real wealth. In 1959, Latané suggested maximum growth as an operational cri- 
terion for portfolio selection, contending that its (possible) suboptimality 
on theoretical grounds was practically unimportant and emphasizing Roy's 
[1952] warning that *A man who seeks advice about his actions will not be 
grateful for the suggestion that he maximize expected utility." 

In the past ten years, however, little work on growth-maximization of port- 
folio value has appeared in the academic literature of finance or economics. The 
neglect was due to competing norms for asset selection, particularly to norms 
based on two-parameter, two-period portfolio models deriving from the work 
of Markowitz [1959], Tobin [1958], Sharpe [1964], and Lintner [1965]. 
These were developed into full theories of capital market equilibrium and the 
empirical evidence collected in their support seemed at least sufficient to justify 
continued research along the lines of relaxing assumptions and performing more 
tests on observed portfolio behavior. 

Recently, Hakansson [1971] and Hakansson and Liu [1970] again brought 
the growth maximization criterion to our attention. Hakansson presented a 
persuasive theoretical argument that “. . . the mean-variance model [a special 
case of the aforementioned two-parameter model, was] severely compromised 
by the capital growth model in several significant respects."! 

Most readers will find the following a significant respect: Given temporally 
independent returns, a number of mean-variance efficient portfolios can be 
shown to bring complete ruin after an infinite sequence of re-investments. It is 
true, of course, that such sequences may indeed be optimal from an expected 


* Graduate School of Industrial Administration, Carnegie-Mellon University. The comments of 
Eugene Fama, Haim Levy, Robert Litzenberger and Myron Scholes are gratefully acknowledged. 
Remaining errors are due to the author alone. 

This project was supported by the Ford Foundation which does not necessarily agree with the 
results and opinions, 

1, Hakansson [1971, p. 517]. 
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utility viewpoint, despite ultimate ruin. No mathematician can prove the 
contrary, especially when it is recognized that a typical investor’s horizon will 
probably fall short of infinity and that he will likely consume a fraction of his 
assets each period. However, one of our basic concerns should not be with 
formal proof, but with practical and intuitively credible models of investor 
behavior. In this regard, "growth-optimum" portfolios possess appealing fea- 
tures even during a finite time span. For example, such portfolios maximize the 
probability of exceeding a given level of wealth within a fixed time.? 

Hakansson also pointed out other features of the growth-optimum model that 
may be more or less appealing. It implies a logarithmic (in wealth) utility 
function, which displays decreasing absolute risk aversion, and it implies 
optimal decision rules that are myopic. As a further embellishment, Hakansson 
and Liu derived a "separation theorem" that holds the optimal sequence of 
investments to be independent of the sequence of wealth levels even when 
returns are stochastically dependent across time. (Myopia also holds with tem- 
poral dependence). 

Although these are strong challenges to the practical superiority of two- 
parameter portfolio models, we should not abandon the latter too hurriedly. 
They have been successfully used in many empirical contexts and their com- 
petitors should be required to weather empirical examination; so the purpose of 
this paper is to report on some empirical tests of growth-optimum theory using 
common stock returns, 

Briefly, the growth-optimum model receives mixed support. In some tests it 
performs extremely well while the results of other tests are puzzling. In com- 
parison to the mean-variance model it also performs well but the test results are 
clouded by the close operational similarity of the two models. 


II. A Test STATISTIC FOR THE GROWTH-OPTIMUM MODEL 


The quantitative derivation of the growth-optimum rule will employ the 
following convenient 


NOTATION: 


n,—number of shares purchased of security j initially 
pi.c— price per share of security j in period t. 
N 


Vom > njp, — Value of a portfolio of N distinct securities in period t. 


Xt —np,/Z nP; fraction of resources invested in security j in t. 

Rje = ([(p,. + Dj)/p,4—1] —1}—rate of return to security j from t — 1 
to t. 

D,.c—per share dividend or coupon paid to security j between t — 1 and t. 

E—Mathematical expectation. 


2. Breiman [1961, section 5]. Hakansson seems to have erred slightly when he states that 
“Breiman has shown that if the objective is to achieve a certain level of capital as soon as possible, 
then the optimal-growth portfolio . . . minimizes the expected time to reach the given level,” 
Hakansson [1971, p. 540]. Breiman conjectured that this was true but was unable to state a proof 
for a fixed level of wealth. As wealth grows indefinitely large, however, the limited expected 
minimum time is in fact achieved by the "growth-optimum" portfolio. 
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Since the rule for maximizing the expected growth of portfolio value is 
myopic, one only needs to optimize between successive periods. Thus, the 
growth-optimum rule is 


maximize E[loge(V./Vi_1) ] = E (log [EX1 (1 + Rie) }} (1) 
Xin 


subject to 2:Xi:-1 = 1. Negative values of X represent short sales which, by 
assumption, can be accomplished without penalty. Transaction costs are ne- 
glected. ' 

Although the problem is easily solved without further qualifications, for 
historical comparison purposes it is worth assuming that the N‘ asset, denoted 
.F, is risk-free and that it receives the residual portion from the amounts in- 
vested in all risky assets; i.e., 


N—I1 N—1 
VN -1 = Y Xua (1+ R4) + (1+ Ret) (a — D Xa). 
jot jel 
First-order conditions from problem (1) show that the investor’s growth- 
optimum portfolio will be determined by proportions X* such that 
R,,—R 
E|—————— 7€ 2 0—0——-[|[20;j—1...,N—1 (2) 


N—1 


1+ Ree + Xa (Ris — Rr.) 


As illustrated by Hakansson [1971] and Breiman [1960], these optimal invest- 
ment fractions generally will imply a diversified portfolio but their exact values 
cannot be determined without specifying the joint probability distribution of 
the R,’s. 

To obtain a testable proposition from (2), however, it will not be necessary 
to specify that distribution. We can rely instead on the fact that in a given 
period, the value of 


1+, 
1+ Rea + =[Xse—1 (Rye — Rr) | 


expected by each individual is equal for every risky security. Advancing to an 
aggregate level will require either of two traditional assumptions: (a) that all 
investors hold identical probability beliefs or (b) that a “representative” in- 
vestor holds the expectations of equation (2) with the invested proportions 
(X*’s) being equal to relative values of existing asset supplies. In either case, 
the denominator of (2) is equal to 1+ Rm. a “market return" defined as a 
value-weighted average of all individual asset returns. The approximately ob- 
servable? variable 


1+ Ry 
1+ Ra 


3. It is only approximately observable because no comprehensive value-weighted asset indexes 
exist, 


Za. = (3) 
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will have the same mean for all securities. It will also be true, of course, that 
some unexplained variation may occur about these expectatioris so that the 
quantities Z;, will not be exactly equal in every period. However, an appro- 
priate test of the identity of expectations only requires that corresponding 
sample means be statistically equal; that is, the temporally averaged means 


(4) 


must be insignificantly different across securities. Testing the equality of N 
sample means is an analysis of variance problem. Its application to New York 
and American Stock Exchange listed securities will be reported in the next 
section. 


III. A SIMPLE TEST oF THE GROWwWTH-OPTIMUM Moper’s Basic VALIDITY 


Data used in this section are rates of return obtained from the Wells Fargo 
rate-of-return tape prepared by M. Scholes. The tape contains daily price 
changes for all common stocks listed on the New York and American exchanges 
from June 2, 1962 through July 11, 1969. It is a condensation and thus a 
tractable version of the ISL Quarterly Historical Stock Price Tapes. The 
Standard & Poor's Composite Price Index (The 500") is used to obtain the 
"market return." 

In the following test, returns were taken over weekly intervals. Thus, the 
statistic 


Ze = (1 + R,)/G + Rao) 


was calculated for stock j at the end of week t; where 1+ Rje = (Pie + 
D,4J/p,, i and 1 + R,, was similarly calculated using the S&P Composite 
Index. The null hypothesis requires the expected return ratios to be equal, 
E(Z = E(Z,,), for all i, j, and t. Usually, this would be tested by one-way 


= 1 
analysis of variance on the temporally-averaged means, Zsa Z,, but 


simple analysis of variance procedures requires that all the Z,,’s be uncorre- 
lated cross-sectionally and have equal variances under the null hypothesis. 
These assumptions are obviously too strong and are not required by the (null) 
growth-optimum hypothesis anyway. Furthermore, we have the evidence of 
many previous studies to confirm the existence of positive covariation between 
stock returns and returns on a market index. Some researchers (notably King, 
[1966]) have calculated directly a substantial co-movement among stock 
returns. Although the covariation between two individual stocks returns, say R, 
and R,, may be reduced in the return ratios (Z’s) as the result of division by 
1 + Rm, it would be too audacious to assert a complete elimination, Further- 
more, there is no a priori reason to suppose that Var(Z;) = = Var(Z,), as is 
required by a simple one-way of variance. 

Fortunately, the Hotelling T? statistic* is available for precisely those cases 


4, See Morrison [1967, pp. 117-124] or Graybill, [1961, pp. 205-206]. 
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where independence among observations and equal variances do not hold. At 
the expense of considerable computer time, which is patiently borne by Car- 
negie-Mellon's undergraduates and the Nation's taxpayers, and is thus free to 
its current beneficiaries, this statistic was calculated for groups of 31 stocks 
selected alphabetically according to the procedure described below. 

Hotelling's T? is computed as a quadratic form in the sample vector of 
differences between adjacent return ratios and the sample covariance matrix 
of those differences.” It is distributed as an F distribution with k — 1 and 
N — k + 1 degrees of freedom where N is the sample size (weeks) and k is the 
number of stocks in a group.’ Since stocks are not always traded over coinci- 
dental calendar periods, and since coincidental observations are required in 
order to calculate sample covariances, the sample size N of each group was 
reduced to the number of weeks when all 31 stocks had been traded and re- 
corded. At most, this was the number of weeks for the stock that had the 
minimum in its group and it was generally somewhat less. In fact, realizing in 
advance that some groups might be reduced to a very low number of coinci- 
dental observations, I decided to discard a stock if keeping it in the group 
meant reducing the second number of degrees of freedom, N — k + 1, below 
30. When a stock was discarded, the next alphabetical one on the tape was 
added to the group. Of course this meant that a stock was then missing from 
the subsequent group and another had to be taken from the group following 
that and similarly to the end of the tape. Finally, 68 groups of 31 stocks re- 
mained for analysis and this number comprises all the stocks on the tape with 
sufficient coincidental observations for the test procedure. These 68 F statistics 
are depicted in Figure 1 and tabulated in Table 1. 

The distribution of Figure 1 is stochastically below the expected null distribu- 
tion. For example, using a Chi-square goodness-of-fit test with 17 classes to 
compare the F sampling distribution with the null distribution, the test statistic 
is about 70, which is far above the .005 level of significant difference between 
the two distributions. It should be emphasized, however, that only high F values 
reject the null growth-optimum hypothesis. Thus, growth-optimum theory is 
strongly, even too strongly, supported by this test of its basic validity." 


5. As further explanation, recall that Z, = (1 + R, 0/7 + Rm.) is the ratio of return on 
stock j to the market return. For a given group of 31 stocks, the differences in return ratios are 
calculated as Yj = Ze — Zine for j =1,.... 30. The means of yj, and covariances of yj, 
and y, , are then calculated over time. Hotelling's statistic is based on the sample quadratic form 
$'S-!y where $ is the vector of sample mean differences and S is the sample covariance matrix 
of differences. Differences were calculated between adjacent alphabetic pairs but any other random 
scheme for selecting pairs would have been equally acceptable. Cf. Morrison [1967, pp. 135-138]. 

6. This is the rationale for using a group size of 31 stocks: since the number of degrees of 
freedom for the test is k—1, where k is the number of stocks, a group size of 31 is both large 
and makes tabular comparison easy. If the group size had been chosen larger, the second degrees 
of freedom parameter, N—k--1, (where N is the number of available time points), would be reduced 
to a low number. Thus, I thought k—31 would balance the two d.f. parameters and still leave 
them quite large. 

7. Because the Hotelling test supports the null hypothesis too strongly, I decided to check several 
potential causes. One obvious possibility is the thick-tailed distributions of stock returns that have 
been pointed out by many researchers, (Cf. Fama [1965], Blume [1970]). The appropriate way 
to check this problem is to use a non-parametric analysis of variance, Friedman's multi-sample 
test, (Bradley, [1968, p. 127]). This was done for exactly the same sample of stocks grouped in 
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Frequency 


Computed 
F Statistics 


F Distribution 
expected under 
null hypothesis 


Ficure 1 
Distribution of F Statistics from Hotelling's T? Test of Equality Among Mean Return Ratios 


IV. RELATIONS BETWEEN THE GROWTH-ÜPTIMUM AND THE SHARPE-LINTNER 


MonELs 
A. Theory 
When one compares the first order conditions for the growth-optimum model 
1 4- &, 1 
Egg, Eg i j=1,...N— 1 (5) 
with those for the Sharpe [1964]-Lintner [1965] model 
PUED pp) ee jol,....N-1 (6 


(where 8, = Cov (&, R,.)/Var (Ra)), some correspondence appears but it is 
rather difficult to evaluate fully just by inspection. For example, when the 
Sharpe-Lintner risk coefficient, B,, is equal to zero, equation (6) becomes 


EO +R) — 14 R 
EO + E) EQ ES 


But when D, = 0, Cov (R,, Ra) = 0, and the growth optimum condition ex- 
pressed in equation (5) becomes 


the same way. The results were identical to those obtained by using Hotelling’s T2, the growth- 
optimum model was too strongly supported. 

A second possible misspecification is a deficiency in the market price index. The Standard & 
Poor's Composite Price Index, used in the preceding tests, is heavily-weighted in favor of a few 
stocks, Also, since it is essentially a “buy-and-hold” portfolio, weights of individual stocks change 
over time as relative prices change. Evans [1968] and Cheng and Deets [1971] have provided 
empirical evidence that such an index performs quite differently from a "fixed-investment propor- 
tion" or "rebalanced" index, Therefore, a rebalanced index composed of stocks on the tape was 
constructed and used in reporting the tests already done. Again, no difference was detected. 

Thirdly, the possibility that an unrepresentative episode biased the entire sample period of seven 
years of weekly observations was checked by repeating the tests for annual sub-periods, There was 
no perceptible difference among the sub-periods or between the overall period and any sub-period. 
In every case the growth-optimum model was strongly supported. 

Further details of all the tests in this footnote are available in an earlier working paper. (It was 
edited to conserve Journal space.) 
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TABLE 1 
CALCULATED VALUES OF HOTELLING’s T? STATISTIC FROM COMMON STOCK 
MEAN RETURN Rats, 1962-1969 


Sample size Sample size 
Group (weeks) T? Group (weeks) T? 
1 113 36.0 36 98 30.8 
2 81 20.2 37 76 35.0 
3 61 53.3 38 101 29.2 
4 61 45.4 39 79 54.1 
5 143 28.4 40 69 . $6.3 
6 76 35.3 41 107 32.3 
7 132 55.5 42 61 68.9 
8 61 14.4 43 61 72.0 
9 148 29.7 44 78 32.0 
10 131 24.1 45 64 35.9 
11 90 22.6 46 97 46.3 
12 68 28.7 47 66 56.9 
13 108 41.3 48 62 57.1 
14 76 27.5 49 91 24.9 
15 109 41.5 50 60 50.1 
16 273 29.4 51 61 37.6 
17 111 33.5 52 64 40.3 
18 110 35.0 53 63 38.3 
19 175 36.5 54 65 25.6 
20 75 42.0 55 60 35.1 
21 95 27.9 56 67 63.8 
22 76 51.3 57 80 48.2 
23 112 224 58 60 107.0 
24 74 64.5 59 96 24.9 
25 112 23.4 60 79 39.0 
26 127 31.8 61 73 57.8 
27 158 38.6 62 82 56.2 
28 78 23.0 63 72 28.5 
29 123 39.6 64 66 65.7 
30 112 40.2 65 61 47.3 
31 104 49.7 66 65 53.5 
32 61 32.3 67 75 42.3 
33 115 35.2 68 63 52.9 
» o ee Note: The F statistic is calculated as 
l N — 30 2 
(N—1)30 ` 


1 I 
1+R, 1+ Ra 
Since the terms containing market returns cancel in both of the displayed 
equations just above, the growth optimum model provides a market diversifica- 
tion result which is well-known from Sharpe-Lintner theory; namely, a security 
whose portfolio risk is zero will sell at an expected return equal to the riskless 
rate. 
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FIGURE 2 
Stocks Satisfying the Growth-Optimum Model with Different Degrees of Market Response 


It is not true that the growth-optimum model implies a constant D coefficient 
although it may seem to after a first glance at equation (6).° A simple example 
is sufficient to show the contrary. Figure 2 illustrates the discrete probability 

8, Because (6) is 


EU +R) 1+Rp 3 [ 1+Rp ] 
—n = oo + 1 — ——_.—_ 
EO +a) EU+Rk,) ^! EU + Rp) 


1+R, 1+; 
E ———— = E ——Á—— , 
1T E, 14 R, 
a reader not cautious about quotients and reciprocals of random variables might think that the 
latter model implies f, = zero, independent of j. 


and (5) is 
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distributions of stocks A and B and of the market. In this example, only four 
equally likely returns are possible for the market and each stock can return 
corresponding amounts to those four. The growth-optimum model requires that 


CHE) CIE) 


1+ E, 1+ Ra 


which in geometric terms implies that the expected slopes of rays from the 
origin through each possible point in the 1 + R,, 1 + Ra plane is equal for both 
stocks. This is clearly satisfied by the radially symmetric solid lines which 
pass through the points of possible occurrence in Figure 2. Nevertheless, the 
slopes of regression lines of Rj on Rm, indicated by dashes, are quite different. 
Stock A has low portfolio risk because its D is low, while stock B has much 
greater response to the market and thus higher risk. Both securities’ returns 
are perfectly functionally related to the market return but it is already apparent 
that correlation per se will have the same relatively unimportant role in a 
growth-optimum as it has in the Sharpe-Lintner framework (i.e., the expected 
slopes of rays can be equal no matter what correlation occurs between a stock’s 
return and the market’s), 

A close correspondence between the two models can be made more ap- 
parent by rearranging a few terms. The result ( derived in the footnote?), is 


" " 1 " 1 ` 
E(R, — Rr) = | cov ( %,—— ) /cov (&, —.—)| E(R, — Rr); 
` 1+Rn 1 


+ Rn 
(7) 
which is very similar indeed to the Sharpe-Lintner equilibrium equation. In 
~ 1 
fact, since Cov E ze) is negative, one is tempted to suggest that 


a second implication of Sharpe-Lintner theory is also satisfied by the growth- 
optimum model: namely, that security j’s expected return will exceed the 
riskless return if and only if Cov (R,, R,,) is positive. One would need to prove, 


9. To obtain (7), note that equation (5) is equivalent to 
CoviR,, 1/1 + K,)1— (1+ ROK — EO 4- ÉDK (5a) 


where K = E[1/(1 + &,,)1. 


Multiplying both sides of (5a) by X, the proportion of wealth invested in security j, (or the 
fraction of aggregate economic wealth represented by security j), 


Cov EX R, 1/0 + £09] = -K FE [XER — Ry) | 
j 1 


and substituting for the definition of R, i.e., for Rẹ, = Rp + EX (R; — Rp), 
Covi Rm — Re(1 — EZ Xp, 1/0 + Rm) 1 = —K E(R,, — Ry). 
i 
Since Rp(1—2x;) is a constant, 
K = —CovIR,, 1/0 + ,)/E(R, — Rp). 


Substituting this for K in (5a) provides equation (7). 
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1 
however, that Cov (R;, Rn) > 0 implies Cov e —) « 0 and this 


+R, 


is definitely not true in general. Counterexamples can be displayed for highly- 
skewed distributions.!? 


B. Testing Sharpe-Lintner vs. Growth-Optimum 


The two models will provide approximately equivalent empirical implications 
if certain restrictions are placed on the ranges of individual rates of return. To 
verify this, one only needs to note the following facts: (a) quadratic utility 
functions and homogenous anticipations will lead to the Sharpe-Lintner equi- 
librium result of equation (6) and (b) the logarithmic utility function implied 
by portfolio growth maximization can be approximated by a quadratic as 


log (1 + Ry) f Ry — 1/2 Rr? 


provided that the portfolio’s total return is restricted to less than 100 per cent 
per period.’ It is indeed trivial to show that the two models are identical when 
this approximation to the logarithm is made. Thus, given the truth of one 
theory, we should not be surprised to find that an empirical test of the other 
supports it very well, especially when the observed rates of return used in test- 
ing fall predominantly near zero. Of course, this leads us to ask whether the 
strong empirical support for the growth-optimum model reported in the pre- 
ceding section is really damnation for Sharpe-Lintner or just an accidental 
stroke of choosing a short time interval (one week) which guaranteed that 
returns were never observed far from zero. 

An obvious way to test this is suggested by the logarithmic approximation. 
If growth-optimum theory appears to satisfy the data only because the loga- 
rithm approximates a quadratic when returns are near zero, one should choose a 
longer time interval for empirical testing so that many more large and small 
returns are observed.!? This was done for both four week and twenty-six week 
periods with the same common stock data as used previously and the conclu- 
sions were identical to those for weekly periods already reported. 

A more refined and direct test to discriminate between the two models can be 
based on equilibrium conditions of the two competing theories, 


" ~ 1 
E(R, — Re) ={ cov | &.- V |/ 


T Ra 
- 1 ~ 
Cov |) E(Rm — Re) = y; (8) 


m 


10. However, for at least one special asymmetric case, (lognormal distributions) the growth- 
optimum model does agree completely with the Sharpe-Lintner result that a security's expected 
return will be a linear function of systematic risk; i.e, of Bj. I am indebted to Robert Litzenberger 
for demonstrating this point. 

11. And more than minus 100 per cent per period. Without short sales, this last restriction is 
presumably satisfied for common stocks by the existence of limited liability. Samuelson [1970] has 
derived a broader "fundamental approximation theorem" which shows the close match of mean- 
variance to any correct portfolio theory when the joint distribution of returns has a small dispersion. 

12. This completely ignores the crucial question of investor horizon period that may have a sig- 
nificant effect on the form of the Sharpe-Lintner market model. Cf. Jensen [1969, pp. 186-191]. 
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which is the growth-optimum condition (7), and 
E(K, — Re) = {Cov [&, Ran] /Var (Rn) } E(Rm — Re) = ò; (9) 


which is the familiar Sharpe-Lintner condition. A suggested test procedure 
obtains the best estimates of all components of (8) and (9) from time series 
and then performs cross-sectional regression with those estimates. 

A series of comparative tests of these two equations was conducted with the 
common stock returns mentioned previously. In each case, time series were used 
to calculate Rj, the mean return, and 9, or $, the estimated risk measures im- 
plied by the growth-optimum or the Sharpe-Lintner model, respectively. 
Then, cross-sectional computations were performed for the regression models 


Ry = à, + 418, (10) 


and 


R, = bo 5,8, | (11) 


Estimated coefficients from (10) and (11) should be compared to their theo- 
retical counterparts: depending on which theory is correct, 8, or b, should equal 
Rr, the average risk-free interest rate, and 4, or 6, should equal unity. 

In the first test, all calculations were carried out with individual security 
returns during the same time period.* For example, 1192 separate values of 
R, 9, and 9 were obtained from weekly data covering the annual sub-period 
July, 1962 through June, 1963. Cross-sectional regressions using these 1192 
estimates gave 4, = 20.1 and 6, = 203 per cent. The value of Rp, as measured 
by the weekly average interest rate on short-term government debt obligations 
during the year, was only 3.27 per cent; so neither model satisfied its theoreti- 
cal prediction very well in this particular annual sub-period.?? Over all the 
seven years of available data, the estimated values of 4: and b: were very sig- 
nificantly positive and they were scattered around unity in nice accord with 
their expected level. à, and 6, were also reasonably close to Rp, at least on 
average. As a distinguishing test of the two competing theories, however, these 
results failed miserably; for the estimates were very highly correlated between 
the models. The two competitive intercepts were practically identical in every 


13. To be precise, 


and 
8, = [COv(RyR,,)/Var(Ry) VR, — Rp) 


where ^ indicates the sample analog of a population parameter, calculated from weekly observations 
over a specified period, and — indicates sample mean from the same period. 

14. To save computation expense, only New York Exchange listed stocks with at least 30 weekly 
quotations during a year were included in the sample. The risk-free rate was measured by a weekly 
“average of short-term government debt obligations” taken from Standard & Poor’s Trade Statistics. 
The market indexes used were: The S&P Composite Index and a rebalanced index constructed by 
weighting all NYSE Stock Returns equally each week, The results were very similar but only the 
results for the rebalanced index are quoted in the text. 

15. To save space, only one annual sub-period is reported here but all the results are available 
from the author upon request. ' 
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period as were the two competitive slope coefficients. They deviated much more 
from theoretical predictions than from each other. 

In order to sharpen the discriminatory resolution of the data, a different 
procedure!? was necessary. The first problem to be alleviated was an extremely 
low explanatory power which was indicated by low R*?'s (on the order of .05), 
for the cross-sectional models with individual stock returns. A technique to 
remedy this is to form portfolios of stocks and conduct cross-sectional tests on 
portfolios rather than on individual securities. A difficulty arises, however, be- 
cause randomly-selected portíolios would have very similar values of the risk 
measures ? and 9. To create a cross-sectional spread in risk measures, portfolios 
must be selected on the basis of risk by grouping stocks with the lowest $'s or 
5’s in one portfolio, stocks with higher 9’s or 5’s in the next portfolio, and so 
on. 

This procedure for forming portfolios makes another econometric problem 
obvious: In the cross-sectional regression, the explanatory variables contain 
errors which will tend to bias slope coefficient estimates toward zero." To 
alleviate this problem somewhat, stocks were placed in portfolios based on their 
risk measures calculated from weekly data of a given year and then the mean 
portfolio return and risk measures for the portfolio as a whole were calculated 
from weekly data in the subsequent year. Cross-sectional models (10) and (11) 
were then estimated using these latter estimates and the results are given in 
Table 2 and Figure 3. To recapitulate, the procedure which resulted in the 
output of Table 2 and Figure 3 was as follows: 


1. For each stock (j) in year t, the total risk premia ?, and 8, were calcu- 
lated from the time series of year t using the rebalanced market index 
(see note 7). 

2. Stocks were ranked from smallest to largest ?, and from smallest to 
largest 5,. 

3. Twenty portfolios were selected by assigning the lowest five per cent of 
ranked stocks to one portfolio, the next lowest five per cent to a second 
portfolio, etc. Thus, two sets of 20 portfolios each were formed; one set 
based on y rankings and one set on 5 rankings. 

4. Each portfolio’s mean return, R,, was calculated from time series in year 
t + 1. For each portfolio that had been formed on the basis of y rankings, 
the growth-optimum risk premium $, was calculated from year t + 1 
data. Similarly, the Sharpe-Lintner premium ô, was calculated from year 
t -+ 1 data for each portfolio that had been formed by 5 rankings. 

5. For each of six years (1963-64, 1964-65, . . . 1968-69), these calculated 
portfolio mean returns and risk measures are plotted in Figure 3 and 
regressions across portfolios are reported in Table 2. 


In Figure 3, the scatters of mean portfolio returns versus the two compet- 
ing risk measures are displayed for six different years. The solid lines mark 


16. Blume [1970], Miller and Scholes [1972], Black, Jensen, and Scholes [1972], and Fama 
and MacBeth [1972], have originated and developed the procedures used here in their work with 
the empirical validity of the two-parameter portfolio model. 

17. This is true, of course, for regressions using individual stocks as well as for those using 
portfolios. 
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Ficure 3 
Mean Portfolio Returns and Estimated Portfolio Risk Premia, New York Exchange Stocks, 
1962-1969, Rebalanced Index 
* (S-L) Denotes Sharpe-Lintner Risk Premia and (G-O) denotes Growth-Optimum Premia. 


the theoretical predictions, an intercept of Rp and a slope of unity.!* Table 2 
contains results from cross-sectional models (10) and (11) applied to these 
data. 

On average across the six years, both the growth-optimum and the Sharpe- 
Lintner model seem to have excessive intercepts, (4), b, > Ry), and deficient 
slopes, (A, b, < 1). The averages are given in footnote d of Table 2. These 
deviations from the anticipated can no doubt be attributed, at least in part, 

18. Since the axes are scaled differently in each plot, the lines do not appear to have slopes of 


unity upon first examination. Note that the plotted lines are the theoretical predictions and are 
not the regression lines reported in Table 2. 
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to errors in the measurement of risk premia. In the first two years and in 
year 5 (1967-68) the wide scatter suggests that either the mean portfolio 
returns or the risk premia were measured inaccurately. 

In the two years of best fit, 1965-66 and 1966-67, portfolios with low risk 
premia return less than anticipated while portfolios with high premia return 
more. This is unlikely to be just a sampling phenomena if the standard errors 
of à, and D, are credible. For example, the growth-optimum slope for 1965-66 
is 2; = 1.85 which is nearly eight standard errors above unity.'? 

Perhaps the most striking characteristic of the two models is their very 
close relation over time. The slope coefficients and intercepts given in Table 
2 vary widely across periods but are extremely close, between the two models, 
in each period. This is obvious from even a quick look at Figure 3.?? Based on 
these results, one can only conclude that the two models are empirically 
identical. A qualification is in order, of course; assets whose returns are much 
more highly skewed (e.g., warrants), may permit a finer discriminatory test. 


V. SUMMARY AND CONCLUSIONS 


If investors wish to maximize the probability of achieving a given level of 
wealth within a fixed time, they should choose the “growth-optimum”’ port- 
folio; that is, the portfolio with highest expected rate of increase in value. 
This paper has examined the implications for observed common stock returns 
of all investors selecting such a portfolio. 

Given some widely-used (and useful) aggregation assumptions, the growth- 
optimum model implies that the expected return ratio E[ (1 4- Ra + Ra)] 
is equal for all securities. This implied equality of expected return ratios 
was utilized in analysis of variance tests with New York and American Ex- 
change listed stocks from 1962-1969 in order to ascertain the basic validity 
of the growth-optimum model. The model was well-supported by the data. 

The growth-optimum model was compared algebraically to Sharpe-Lintner 
theory, which is probably the most widely-used portfolio result in empirical 
work, A close correspondence was demonstrated between their qualitative im- 
plications, For example, both models imply that an asset’s expected return 
will equal the risk-free interest rate if the covariance between the asset’s return 
and the average return on all assets, Cov(R,, Rm), is zero. For most cases, 
the growth-optimum model also shares the Sharpe-Lintner implication that an 
asset's expected return will exceed the risk-free rate if and only if Cov(R,, Rm) 
> 0. There are, however, some cases of highly-skewed probability distributions 
where this implication does not follow.for the growth-optimum model. 

A close empirical correspondence between the two models was demonstrated 
for common stock returns. The procedure (1) estimated returns and risk premia 

19. For a more detailed discussion of this point, see Friend and Blume [1970]. 

20. The greatest difference between ay and 6, is .02 which is only about 2.3 per cent of their 


average value. Between Ay $y, the greatest difference is about six per cent of their average value. 

A close connection between the two models was previously implied by the work of Young and 
Trent {1969}. They showed that the geometric mean of portfolio returns was closely approximated 
by functions of the arithmetic mean and variance of returns. These functions were developed as 
approximations to the geometric mean. For accuracy, they require a minima] amount of skewness 
and are, therefore, analogous to the truncated (after two terms), Taylor series expansions of 
log, (1-- R). 

21. R, is the rate of return on security j and R,, is the rate of return on a portfolio of all assets. 
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implied by the two models from time series; (2) calculated cross-sectional 
relations between estimated returns and risks; and (3) compared the cross- 
sectional relations to the theoretical predictions of the two models. They could 
not be distinguished on an empirical basis. In every period, estimated corre- 
sponding coefficients of the two models were nearly equal; and indeed, they 
deviated much further from their theoretically anticipated levels than they 
deviated from each other. 
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Introduction to the Classic Papers and Theories 


In this part of the book, we present papers that formalize, generalize, and extend 
the early results on the Kelly strategy. Although the early papers on the Kelly 
optimal growth strategy contained powerful results, the topic was not significant 
in mainstream financial economics. Unfortunately, despite lots of good evidence, 
it is still not a serious part of academic financial economics. Roll (1973), for ex- 
ample, shows that in his data set, capital growth and mean-variance portfolios are 
similar. Thorp (1971) had shown that the Kelly strategies were not necessarily 
mean-variance efficient and Markowitz (1976) argued that the Kelly strategy was 
the limiting mean-variance portfolio. These papers appear later in this book. Here 
we take up the work of those who extended the early results and begin to consider 
the good and bad properties of these strategies and the sensitivity of the results to 
data inputs and errors. 

Bell and Cover (1980) consider the static one period behavior of two investors. 
They each can allocate among m stocks and have one dollar to invest. The winner 
has the most money after one period's returns of the portfolio. They show that the 
optimal onc period policy is an appropriately randomized version uA* of the Kelly 
growth rate optimal portfolio A*. Then for any other portfolio A, Pr(W(A) > 
W(uA*) € i. So this establishes a good short run property in addition to the 
previously known good long term asymptotic Kelly criterion properties. In part IV, 
we review the good and bad properties of the E log Kelly capital growth criterion. 

Algoet and Cover (1988) generalize the classic Breiman (1961) results to gen- 
eral dependent asset return distributions. Breiman assumed iid asset returns. So 
maximizing conditionally expected log in each period t given information up to t 
is optimal in that it maximizes the asymptotic growth rate of wealth for arbitrary 
asset returns. Algoet and Cover (1988) and Thorp (2006), in part VI, present the 
most general Kelly asymptotic optimality properties. 

Cover (1991) presents an algorithm and theory for a universal portfolio that will 
perform as well as if the investor knew the realized distribution of the future asset 
returns. There are no assumption on these asset returns. The universal portfolio 
strategy is based on the past returns z;,..., x: .1 up to the given period t, and will 
perform asymptotically as well as the best constant rebalanced portfolio based on 
foreknowledge of the sequence of price relatives. The universal portfolio in period 
1 is uniform equally weighted over all the stocks, and the portfolio in period t is 
the performance weighted average of all constant rebalanced portfolios. Examples 
in Cover's paper show the exponential outperformance of the universal portfolio 
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with respect to the assets used to construct this portfolio. Cover assumes zero 
transaction costs. 

Ordentlich, E. and T. M. Cover (1998) greatly refine the statements in the 1991 
paper and provide exact results for the minimax relative behavior of an investor's 
wealth with respect to the wealth of the universal investor. 

When one sees the naturalness of growth optimality as a goal in investment and 
then finds the natural mathematics behind it (to maximize the geometric mean), 
it is tempting to think that the optimal strategy, which turns out to be Kelly 
gambling or, if you will, log optimal investment, has many other properties as well. 
Such correspondences certainly occur in mathematics. For example, numbers like 
mw and e occur in hundreds of different contexts and are characterized by the set of 
all problems that give 7 and e as answers. 7 and e do not have just one defining 
property but many. Looking at it from this point of view, we ask for the properties 
of growth optimal investment beyond that of growth optimality itself. 

The first question is what are the short run properties of growth optimal invest- 
ment? Is Kelly also good for just one investment period and if so in what sense? 
Bell and Cover (1980) find the strategy that maximizes the probability of outper- 
forming one’s opponent during a single investment period. The optimal one period 
strategy is to choose a fairly randomized version of the portfolio that maximizes 
the expected logarithm of wealth. A generalization of this to other payoff func- 
tions appears in Bell and Cover (1988). All of these generalizations are achieved 
by randomized version of the log optimal strategy. Logarithms are nowhere in the 
statement of the problem. Is this a fortuitous coincidence or is it a property of the 
naturalness of the approach? 

Next, we turn to Kelly and the value of side information. Kelly proves in his 
classic 1956 paper that the mutual information I(X;Y) between a horse race mar- 
ket! X and side information Y gives an increase, AW = I(X;Y), in the growth 
rate of wealth in repeated investments. This is generalized in Barron and Cover 
(1988) to general stock market distributions. The mutual information is always an 
upper bound, AW < I, on the increase in the growth rate of wealth, with equality 
if and only if the underlying stock market distribution is a horse race market. 

A summary of the properties of Kelly gambling and growth optimal investment 
is given in Cover and Thomas (2006), Chapter 6 (Gambling and Data Compression) 
and Chapter 16 (Portfolio Theory). 

Algoet and Cover (1988) use a simple sandwich argument to prove that the time 
average of the log wealth converges to a constant with probability one for any ergodic 
market. This, then, gives the proof of the famous Shannon MacMillan Breiman 
theorem as a special case in which the underlying market is a horse race, that 
is, there is one winner from all the entrants. This work also shows the asymptotic 
optimality of the investment scheme maximizing the conditional expected logarithm 
of wealth given the available past. These results are simplified and summarized in 
Cover and Thomas (2006, Chapter 16). 


1By a horse race market we mean a situation where out of N entrants, only one wins. 
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The idea of a universal portfolio is to asymptotically achieve the same long-run 
growth rate as if one had known ahead of time the realized distribution of the stock 
market outcomes. This parallels universal data compression, where one wants a 
data compression scheme that works for any source of data, like music or voice or 
text, and compresses it to the limit that one could achieve if one knew ahead of time 
whether it was music or voice or text. The answer is Yes, there is such a scheme, 
and with a negligible cost of learning which washes out over time. Cover (1991) is 
the first in the series of three papers. Cover and Ordentlich (1996) and Ordentlich 
and Cover (1998) refine the statements in the 1991 paper. The latter gives exact 
results for the minimax relative behavior of an investor's wealth with respect to the 
wealth of the universal investor. 

In summary, Kelly investing and its generalization to expected log optimal in- 
vesting have good short run as well as long run properties. Furthermore, W,/W; is 
a martingale, where W% is the wealth generated by Kelly investing, and W,, is the 
wealth generated by any other investment scheme. Moreover, there is an asymp- 
totic equipartition result that says that iln W7 converges to W, where W is the 
maximum expected log wealth conditioned on the infinite past. These are some of 
the natural properties of the Kelly criterion. 

Finkelstein and Whitley (1981) generalize the Breiman (1960, 1961) results to 
iid assets that are not necessarily discretely distributed as Breiman assumed, but 
to arbitrary iid random variables with finite expectations. They then show how 
Breiman's basic results can be easily proved. For non iid assets the reader is referred 
to Algoet and Cover (1988) and Thorp (2006) in this book. 

Chopra and Ziemba (1993) show how sensitive mean-variance optimization port- 
folios are to errors in the estimates of the input parameters. Mean-variance analysis 
assumes that all the means, variances and covariances are constant and known. In 
practice, they are estimated. Earlier studies by Kallberg and Ziemba (1981, 1984) 
showed that the errors in the means are the most important by far, being about 
ten times errors in variances, which are about twice as important as the co-variance 
errors. The loss is measured in certainty equivalent terms. Chopra and Ziemba 
(1993) refine the analysis and investigate the effect of the investor’s risk aversion. 
They show that the relative importance of the mean errors increases as the risk 
aversion decreases. So instead of a 20:2:1 ratio of importance its more like 60:3:1 
for low risk aversion and for extremely low risk aversion like log, it is well over 
100, so the ratio is about 100:3:1. So Kelly bettors must accurately estimate 
their parameter values especially the means so as not to get into an overbet sit- 
uation. Geyer and Ziemba (2008) show in a five year period asset-liability model 
that the sensitivity is huge in period 1, low in period 2 and by period 5, essentially 
non-existent. 

MacLean, Ziemba and Li (2005) consider à dynamic investment problem where 
there are upper and lower limits on wealth. Instead of typical rebalancing at discrete 
fixed or variable points in time one rebalances when one of the limits is reached. 
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Assets are assumed to be lognormally distributed and the investor's goal is to mini- 
mize the expected time to reach the upper goal while maintaining a high probability 
of reaching that goal before falling to the lower wealth limit. The optimal strategy 
is fractional Kelly, being a blend of the Kelly expected log maximizing strategy and 
cash and is, under the log normal asset assumptions, equivalently represented by a 
negative power utility function. One has in this case the handy rule f — iz where 
f is the Kelly fraction and a < 0, is the parameter of the negative power utility 
function aw®. By rebalancing when control limits are reached, the time to wealth 
goals approach provides greater control over downside risk and upside growth com- 
pared, for example, to an expected utility approach with fixed rebalancing times in 
an asset allocation problem with stocks, bonds and cash. 

Evstigneev, Hens and Schenk-Hoppé (2009) survey current research in evolution- 
ary finance focusing on the survival and stability properties of Kelly-type invest- 
ment strategies. The approach to the study of dynamic wealth evolution follows 
Darwinian ideas of selection and mutation in N-person game theoretic markets. 
The result is an alternative to general equilibrium economic models. The setting 
is more broad than general equilibrium as well because only historical and current 
data influence the agents’ behavior. No agreement about the future or coordinated 
behavior of the agents is required. The evolutionary model does not use unobserv- 
able agents characteristics such as subjective beliefs or utilities. Individual goals 
of investors are described in terms of properties such as survival with evolutionary 
stability holding almost surely rather than in terms of expected utility maximiza- 
tion. The Kelly rule insures the survival of those traders following this rule and 
yields global evolutionary stability. 

Lv and Meister (2009) discuss the Kelly criterion in continuous time when the 
asset returns are multivariate Ornstein-Uhlenbeck mean-reverting processes assum- 
ing that there is a complete market. They develop the existence of the optimal 
self-financing trading strategy and the explicit form of the associated optimal in- 
vestment fraction of the investor’s current wealth. 
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Consider the two-person zero-sum game in which two investors are each allowed to invest 
in a market with stocks (X5, X5,..., Xm) — F, where X, > 0. Each investor has one unit of 
capital. The goal is to achieve more money than one's opponent. Allowable portfolio 
strategies are random investment policies B € R",B 20, E»7.., B, = |. The payoff to player | 
for policy Bj vs. B2 is P(B1X » B5X). The optimal policy is shown to be B* = Ub*, where U 
is a random variable uniformly distributed on [0, 2], and b* maximizes £ In b'X over b 20, 
Do, = 1. 

Curiously, this competitively optimal investment policy b* is the same policy that achieves 
the maximum possible growth rate of capital in repeated independent investments (Breiman 
(1961) and Kelly (1956)). Thus the immediate goal of outperforming another investor is 
perfectly compatible with maximizing the asymptotic rate of return. 


1. Introduction. An investor is faced with a collection of stocks (Xp Xp ..., Xm) 
drawn according to some known joint distribution function F. We shall assume that 
stock values X; are nonnegative. A portfolio is a vector b=(b,,....6,,)'. b; 2 0, Xb; 
=|, with the interpretation that b; is the proportion of capital allocated to stock i. 
The capital return S from investment portfolio 6 is 


S-bX-DbX. (1) 


How should b be chosen? A currently accepted procedure is the efficient portfolio 
selection approach of Markowitz (1952, 1959). A portfolio b is said to be efficient if 
(Eb'X, Var b'X) is undominated. Criticisms of this approach are many. Only the first 
two moments are used in the analysis; there is no optimality of this procedure with 
respect to other obvious investment goals, and no choice procedure among the 
efficient portfolios is provided. (See Thorp (1971) and Samuelson (1969) for further 
comments.) Also, such a portfolio is not necessarily admissible [Hakansson (1971. p. 
529), Thorp (1971, p. 20)] in the sense that it may be stochastically dominated by 
some other mixture S. 

Another criterion for selecting b, that of maximizing £ In S, has been put forth by 
Kelly (1956) and Breiman (1961), and persuasively advocated by Thorp (1969, 1971. 
1973). (Also see Latané (1959) and Williams (1936).) This portfolio is admissible, since 
it maximizes the expectation of a monotonic function of S. The resulting portfolio 
investment policy b* has been demonstrated by Breiman to have the following 
properties: 

Pl. In repeated independent sequential investment, b* maximizes lim inf(1/ »)In S,. 
Thus the asymptotic “interest rate" 1s maximized. 
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P2. The time required to achieve a certain capital 4 is minimized by b7 (in a sense 
that can be made precise). in the limit as A > x. 

Yet b* is not accepted in current economic practice. Perhaps one reason is that 
maximizing Eln S suggests that the investor has a logarithmic utility for money. 
However, the criticism of the choice of utility functions ignores the fact that maximiz- 
ing E In S is a consequence of the goals represented by properties PI and P2, and has 
nothing to do with utility theory. (See Thorp (1971).) A thorough discussion of 
investment strategies and their relation to utility theory is developed in Arrow (1971). 

We are not interested in utility theory in this paper insofar as utility theory deals 
with the consistency of subjective preferences. We wish instead to emphasize the 
objective aspects of portfolio selection, i.e., properties of optimal portfolios that have 
some objective appeal. In particular we wish to add another goal to the list Pl, P2. 
namely that of outperforming another investor (or even of outperforming oneself with 
respect to what one could have done). If we can show that all three goals are uniquely 
achieved by a given policy, we are on our way to making an objective case for 
utility-independent optimality of the stated portfolio. 

An objection to Pl and P2 and, by extension, to 5*, held by Samuelson (1967, 1969) 
and others, is that not all investors are interested in long term goals. Samuelson (1969, 
p. 245) writes, “Our analysis enables us to dispel a fallacy that has been borrowed into 
portfolio theory from information theory of the Shannon type. Associated with 
independent discoveries by J.B. Williams (1936), John Kelly (1956), and H.A. Latané 
(1959) is the notion that if one is investing for many periods. the proper behavior is to 
maximize the geometric mean of return rather than the arithmetic mean. [ believe this 
to be incorrect (except in the Bernoulli logarithmic case where it happens to be correct 
for reasons quite distinct from the Williams-Kelly-Latané reasoning)... . [t is a 
mistake to think that. just because a w** decision ends up with almost-certain 
probability to be better than a w* decision. this implies that w** must yield a better 
expected value of utility.” 

Another possible objection is that b* may be too conservative, since it optimizes a 
concave (risk averse) function of the return S. One interpretation of "too conserva- 
tive" could be that 6* will be outperformed (i.e.. PX >b*' X) with high probability by 
a more ambitious policy b. Alternatively. too conservative might mean that with 
substantial probability 6* will be outperformed by a large factor (i.e., fX > cb*'X, for 
some constant c > |) by a more risky policy b. Thus a reasonable goal for an 
individual investor or a mutual fund would be good short term competitive perfor- 
mance. 

With the above objections in mind, we are led to the analysis of one-stage 
investments. Consider the two-person zero-sum game in which two investors seek 
portfolio policies that are competitively best in the sense that at least half the time one 
achieves more capital than one's opponent. Surprisingly. the game theoretic optimal 
strategy will be shown to be Ub* where U is an independent uniform [0, 2] random 
variable, and 6* is the same log optimal policy as before. Furthermore. among 
non-randomized strategies, b* is shown to be competitively best in the sense that it 
will not be beaten by very much very often. Thus the alleged conservatism of 6* must 
be established on other grounds; and the short term value of Ub* is established 
competitively. 

In the next section. we shall argue for the naturalness of the random variable U in 
the competitive investment game. Theorem 1, establishing C^* as the solution of the 
game, will be proved in $3. 


2. A game-theoretic digression. Before proceeding, we establish the necessity 
for randomization in the competitive investment game. 
Suppose 2 players each have | unit of capital. Their competitive positions are equal. 
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However. let us now suppose that player 2 has available to him any fair gamble 
(EX = |, X > 0). By selecting the distribution of the gamble X judiciously, he can 
beat player 1 with probability 1 — e. Simply let P(X =1/(1 — €) = 1— e, P(X =0) 
=e. Then P(X > 1) = 1 — e. Therefore, player | must protect himself by randomizing 
his capital. This is a purely game theoretic maneuver and has nothing to do with 
maximizing investment return. 

We now solve the following two-person zero-sum game. Let players 1 and 2 choose 
df.’s F and G, respectively, [x dF = fy dG = 1, F(0 ) = G(0 ) 2 0. Assume X ~F 
and Y — G are independently drawn. The freedom of choice we allow in the choice of 
F and G makes physical sense, since any capital distribution F(x), [x dF(x) = 1, 
F(0 ) = 0, is achievable from initial capital 1 by a sequential gambling scheme on fair 
coin tosses (Cover (1974)). The payoff to player 1 is 


P(X > Y)- [GaF. (2) 


Lemma. The value of this game is 1, and the unique optimal strategies are 


2 O0O<t<2 
F*(r)- G*(r) = K > , 3 
(= 6*0) 7| 32 3) 

Proor. For F* and for any G, 
[^] 
P(Y > X)» f F*dG= [ min(1t/2, 1) dG(r) 
0 

< 1 [raeQ) =i. (4) 


Thus F* achieves } against any G. 

Uniqueness of the optimal distribution F*(x) is proved by assuming P(Y > X) <4 
for (i) Y uniform [0, 2], Gi) Y a two point distribution at O and a point c € [1. 2], and 
(ii) Y a two point distribution at c € [0, 1] and 2. Then (1) — F*(2) = 1; (1) => F*(c) 
€&c/2, 1&c€2; and (iii) F*(c) < c/2, O< c < 1. Since ft dF*(r) 2 l|, we see 
F*(t) = 1/2, 0 < t < 2. The proof of the uniqueness of G* follows by symmetry. 

We see that a gambler must exchange his unit capital for a r.v. U uniformly 
distributed on [0, 2] in order to protect himself. We mention parenthetically that one 
way to achieve this on a sequence of fair coin flips is to divide the one unit of initial 


capital into piles of size (1), ¿= 1,2, ..., then bet the / pile on the outcome of the 
i" coin flip. Letting ww»... beiid. Bernoulli (1/2) r.v.'s, we have the return 
9o " 
S = 3 2o27'7 1.00304... 


il 


in binary, which is clearly uniformly distributed on [0, 2]. 


3. The competitive investment game. Let B be the set of all r.v.'s B — (B, 
B,,..., B,), B20, ae, EO, B= 1. Note that the random investment policy B 
can be achieved by first exchanging the | unit initial capital for a fair random return 
W drawn according to the distribution of S7_, B,. Observe that W > 0, EW -— I. 
Then W is distributed across the stocks according to the conditional joint distribution 
of (B,, B4, . - . , B,) given 7, B; = W. The latter distribution can be performed on 
paper. Happily, the allowed conditional randomization is not necessary in the game 
theoretic optimal policy in Theorem 1 below. 

Let the investment vector X = (Xp Xz.. X,) bea r.v. with known distribution 
function F(x). We assume that X > 0, a.e. To eliminate degeneracy, we also assume 
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that 


~œ «sup£ In bX« x, 
b 


Consider the two-person zero-sum game in which players 1 and 2 choose B® 
€ 34,8? © 9$, and player 1 receives payoff 


P {B° x > BOX). (5) 
It is assumed that BÓ), B® and X are jointly independent. 


THEOREM |. The solution for the competitive investment game is B* = Ub*, where U 
is unif. on [0, 2], independent of X, and b* maximizes Ein P'X. The value of the game 
is i. 


PRoor. The Kuhn Tucker Theorem (1951) implies that the 6* maximizing 


E In $7. | bj X, subject to the constraint Xb, = l. b, > 0, satisfies 
E x =A), bë > 0. (6) 
D OFX, | <A, b* =0,7= 1.2,....m, 
where A is chosen so that X b* = 1. But we see A = 1. since 
A= DFA = MOF EX, /(S HF X;) 
= E(X bf Xj) = 1. (7) 


We now investigate the payoff of B* = Ub* against any other tnvestment policy 
Bes: 


U «( B' X)/(b*' X)) < 1/2E(( B' X)/(b*' X)) 


| 


=1/2 5 EB,E(X,/(X b? X;)) < 1/25 EBA 


i=l 
=A/2EDB, =d/2= 1/2. (8) 
Thus B* = Ub* achieves the value of the game against any B, and the proof is 
complete. 
The above strategy B* can be implemented by first exchanging the | unit initial 
capital for the fair gamble U, uniformly distributed over [0, 2], then distributing U on 
the investments according to the solution 6* maximizing E In b'X. 


This result can be generalized to show that B* — Ub* will not be beaten by very 
much very often: 0 


COROLLARY l. P(B'X > cUb*X) «ic. for all B & h.c » 0. 


PROOF. P(cU«B'X/b*'X) < (O)E(B'X/b*' X), and the proof proceeds as in 
Theorem I. 
Dropping the randomization U increases this probability by at most a factor of 2: 


COROLLARY 2. 

P(B'X»cb* X)«l/c. forall B€5,c»Q. (9) 
PRoor. By Markov’s inequality and (8), 

P( BX > cb" X) «(VMo)E(B X/ b*' X) < Uc. 
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REMARK |. This is the best that can be attained by any nonrandomized strategy. 
as can be seen from the discussion at the beginning of 82. 

Remark 2. Corollary 2 bears a strong resemblance to Markov’s lemma, ie.. 
Y 2>OQ EY = p= P(Y > ch) € l/c. Vc > 0. This suggests that 5*'X acts like the fixed 
amount of capital p in Markov's lemma and that 5*'X can be changed from a 
competitive standpoint only by fair randomization. Inequality (9) is true despite the 
fact that EBX may be greater than Eb*'X. 


4. Example: The St. Petersburg paradox. In the St. Petersburg paradox, a 
gambler pays an entry fee c. He receives in return a random amount of capital X. 
where P(X 22) 22 im 1.2..... oo. Note that EX = oc. 

Suppose that a gambler has total initial capital Sọ. He is allowed to receive | unit of 
St. Petersburg investment for each c units that he pays as an entry fee. Let him invest 
the amount 6S), 0 < b < 1, and retain (1 — 6)S, in cash. Thus his return S is given by 
S = S,((1 — b) + (b/c)X). In the framework of the previous sections, the investment 
vector is X = (X X5 = (1. X/oy.. 

Let b* € [0, 1] maximize E In S. We calculate 


dE\nS  , ~1+X/e 
db > (1—b) + (b/c)X 
~Syfiy Q/e-0 
-X() b/c +(1-— b) ` (19) 


Letting b = 1, we see that dE ln S/db = 1 —(c/3), which is 20 for c « 3. Thus 
b* = |, forO < c < 3. Fore > 3. the solution b* to (10) tends monotonically to zero as 
the entry fee c > oo. Finally it can be seen that 6* and max Eln S are always strictly 
positive. 

Investing a proportion of capital b* guarantees that 

(1) The investor is acting in accordance with an investment policy maximizing 
lim inf(1/ n)In S,. regardless of whether or not the other investment opportunities are 
of the St. Petersburg form; and 

(2) the investor investing Ub* is competitively optimal in the St. Petersburg game. 

Moreover, we see that all entry fees c are "fair." However, the proportion b* of 
total capital invested varies as a function of c. Also. b* is independent of the total 
initial capital So. 

Finally, if the investment fee is low enough, i.e., 0 < c € 3, then b* = | and all of 
the capital is invested. This results in S, ~ S(4/c)" in the sense that (1/n)log, S, 
252-log;c,for0« c <3. 


5. Conclusions. 1t should now be clear that the investment policy 5* achieving 
max £ 1n b'X has good short run as well as good long run properties. In addition, b* is 
admissible in the sense that no other policy P stochastically dominates 6*. 

We wish to comment on the use of Ub* (as opposed to b* alone) in practice. We 
have seen in $2 that the use of U is a purely game theoretic protection against 
competition and has nothing to do with increasing a player's capital. Thus we feel that 
b* alone is sufficient to achieve all reasonable competitive investment goals, and we 
do not choose to advocate the additional randomization U. 

Finally, it is tantalizing that 5* arises as the solution to such dissimilar problems as 
maximizing lim inf(1/7)In S, and maximizing P ( BiX »B5X). The underlying reason 
for this coincidence will be investigated. 
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Abstract —It will be shown that each bit of information at most doubles 
the resulting wealth in the general stock market setup. This information 
bound on the growth of wealth is actually attained for certain probability 
distributions on thé market investigated by Kelly. The bound will be shown 
to be a special case of the result that the increase In exponential growth of 
wealth achieved with true knowledge of the stock market distribution F 
over that achieved with incorrect knowledge G is bounded above by 
D(F||G), the entropy of F relative to G. 


l INTRODUCTION 


Let X 20, X € R” denote a random stock market vector, with 
the interpretation that X, is the ratio of the price of the ith stock 
at the end of an investment period to the price at the beginning. 
Let B= (be R”; b, 2 0, Y 3b; —1), be the set of all portfolios 
b, where b, is the proportion of wealth invested in the ith stock. 
The resulting wealth is 


S=} bX -Vx. 
1-1 
This is the wealth resulting from a unit investment allocated to 
the m stocks according to the portfolio b. 


IL DOUBLING RATE 


Now let F(x) be the probability distribution function of the 
stock vector X. We define the doubling rate W(X) for the market 


by 
(2) 


The units for W are “doubles per investment.” All logarithms in 
this correspondence are to the base 2. Let b* = b*( F) denote a 
portfolio achieving W(X). Note that W(X) is a real number, a 
functional of F; the apparent dependence of W on X is for 
notational convenience. 

Necessary and sufficient conditions for b to maximize E log / X 
are 


W(X) 7 max flog /xdF(x). 


X 
Egy ^b for bj» 0 
X 
EX x1, for b, - 0. (3) 


These are the Kuhn- Tucker conditions characterizing 5*( F) (sec 
Bell and Cover [3], Cover [4], and Finkelstein and Whitley [5]). 
If current wealth is reallocated according to b* in repeated 
independent investments against stock vectors X,, X,,--- inde- 
pendent identically distributed (i.i.d.) according to F(x), then the 
wealth $5 at time n is given by 
So = F] "x, 


i=l. 


(4) 
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The strong law of large numbers for products yields 
(s?) e 20/mZf-ilog "X, , 2, 


(5) 
with probability one. Moreover, no other portfolio achieves a 
higher exponent (Breiman (1]; Algoet and Cover [9]. 

Now suppose side information Y is available. Here Y could be 
world events, the behavior of a correlated market, or past infor- 
mation on previous outcomes X. Again we define the maximum 
expected logarithm of the wealth, but this time we allow the 
portfolio b to depend on Y. Let the doubling rate for side 
information be 


W(XIY) = max f fog (y) xdF( x, y) (6) 


and let 5*( y) = b°( Fy,,) be the portfolio achieving W(X|Y). It 
can be shown that 5*(y) maximizes the conditional expected 
logarithm of the wealth E(log FX|Y — y). 

In repeated investments against X,, X, -:, X, where (X, Y,) 
are ii.d. ~ F(x, y), and b*(Y,) is the portfolio used at investment 
time i given side information Y, we have resulting wealth 


a 


Sot = I P" (Y)x, () 
with asymptotic behavior 
(527) S 2m (8) 


with probability one. It follows that the ratio of wealth with side 
information to that without side information has limit 


g*a” 
E 2, DHONI) WOO) 


S^ (9) 
with probability one. 
Let the difference between the maximum expected logarithm 
of wealth with Y and without Y be 


(10) 


Thus À is the increment in doubling rate due to the side 
information Y. It is this difference that we wish to bound. As an 
example, if A=1 then the information Y yields an additional 
doubling of the capital in each investment period. Finally, we 
observe from (6) and (2) that Az 0. Information never hurts. 
Kelly [6] identified A with the mutual information for a “ horse- 
race” stock market, a result we will generalize here. 


A«W(X|Y) -W(X). 


TI. MUTUAL INFORMATION AND RELATIVE ENTROPY 


The relative entropy (or Kullback Leibler information number) 
of probability distributions F and G is 


D( FIG) = ftog( 4/8) dF a) 


where f and g are the respective densities with respect to any 
dominating measure. (Nove: D is infinite if g(x) is zero on a set 
of positive probability with respect to F.) 

The relative entropy may be interpreted as the error exponent 
for the hypothesis test F versus G (Stein’s lemma; see Chernoff 
[2]. Another interpretation of the relative entropy for a discrete 
random vector X ~ P is that D(P||Q) is the expected increase in 
description length of the Shannon-Fano code based on the 
incorrect distribution Q. 

Let X,Y be two random variables with joint distribution Pyy. 
The relative entropy between the conditional distribution Pyy 
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and the marginal distribution Py is the mutual information 
IOG Y) = f DU Psy IP) Py (dy) = D(PyvllPxP 3). (12) 


Of the many alternative expressions for I, the most evocative'is 

the identity 

I(X;Y)  H(X)—- H(X|Y) (13) 
where H(X) is the entropy of X and H( X|Y) is the conditional 
entropy. Thus / is the amount the entropy of X is decreased by 
knowledge of Y. One can compare (13) with (10) to see why a 
relationship between A and / might be expected. 

The mutual information / can also be interpreted as the 
information rate achievable in communication over the commu- 
nication channel P(x, y) There is also an interpretation of 
I( X; Y) in terms of efficient descriptions. Since H( X) bits are 
required to describe the value of the random variable X (if X is 
discrete), and since H( X|Y ) bits are required to describe X given 
knowledge of -Y, the decrement in the expected description length 
of X is given by H(X)— H(X|Y) = 1(X; Y). 

In summary, the mutual information /( X; Y) is 1) the decrease 
in the entropy of X when Y is made available, 2) the number of 
bits by which the expected description length of X is reduced by 
knowledge of Y, 3) The rate in bits at which Y can communicate 
with X by appropriate choice of Y, 4) the error exponent for the 
hypothesis test ( X, Y) independent versus ( X, Y) dependent. 


IV. PORTFOLIOS BASED ON INCORRECT DISTRIBUTIONS 
Suppose that it is believed that X — G(x) when in fact X — 
F(x). Thus the incorrect portfolio 5*(G) is used instead of 
b*( F). The doubling rate associated with portfolio 5 and distri- 
bution F can be written 


W(b, F) = " log FxdF(x) (14) 
with resulting growth of wealth 
S e 20s P) (15) 
The decrement in exponent from using 5*(G) is 
AW(F,G) - Wr F),F) - W(b'(G), F). (16) 
The following theorem is central to our results. | 
Theorem 1: 
Os AW( F,G) s D( FIG). (17) 


Proof: The first inequality 0 < A follows by the optimality of 
b*(F) for the distribution F.. The second inequality A < D is 
shown to be a consequence of the optimality of 5*(G) for the 
distribution G. Let F and G have densities f and g with respect 
to some dominating measure. The result A < D is trivially true if 
D(F||G) = oc, so it is henceforth assumed that D is finite (whence 
F«G) 

Let. 


(18) 


- be the wealth factors, corresponding to the optimal portfolios 
with respect to F and: G. From the Kuhn- Tucker conditions the 
wealth factor $, is’strictly positive with probability one’ with 
respect to G (and with ‘respect to F since F « G). It follows 
(again since F & G) that the set A={x: S,>0, f(x)» 0, 


St-b"(F)X S-b"(G)X 
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8(x) > 0) has probability one with respect to F. Then 


AW(F,G) = fos 
-fpl È 


-f os| $ s) dE + D( FIG) 


Hb 
se 


slog f S do D( FIIG) 


s D( FIIO) (19) 
where the first inequality follows from the concavity of the 


logarithm and the second from the Kuhn-Tucker conditions for 
the optimality of 5*(G) for the distribution G. 


We can improve Theorem 1 by normalizing X. Let P denote 
the distribution of X/LX,. We note that E(log /X/b, X) de- 
pends on the distribution F(x) only through the distribution of 
X/Vj- X - F. 

Corollary: 

AW( F,G) < D( FyjG). 


Remark: Another relationship between W and D is shown by 
Mori [13]. The doubling rate W = W(6*(F), F) is equal to the 
minimum of D( FIG) over all distributions G for which E; X, <1, 
for i-1,2,---,m 


V. THE INFORMATION BOUND FOR SIDE INFORMATION 


We now. ask bow A and / are related for the stock market. Wc 
have 


A= Elog c (20) 
and " 
f 

BIOL ` (21) 


where (X, Y) ~ F(x, y). The first involves wealth and depends 
on the values X takes on. The second involves information and 
depends on X and Y only through the density f(x, y). The 
following theorem establishes that the increment A in the dou- 
bling rate resulting from side information Y is less than or equal 
to the mutual information /. 


Theorem 2: 
0sAxI(X;Y). (22) 
Proof: For any. y, let Py, be the conditional distribution for 
X given that Y= y and let Py be the marginal distribution for X. 
Also let 5** = 5*(Py,,). Apply Theorem 1, with Py,, and Py in 
place of F and G, respectively, to obtain 


war 
os elos tur r->] S D( PriyllPx)- (23) 
Averaging with respect to the distribution of Y yields 
O<AsI. (24) 


Remark: An alternative proof of this theorem, based on money 
ratio tests and Stein's lemma, appears in [7]. 


VI. SEQUENTIAL PORTFOLIO ESTIMATION 


Here we show that a good sequence of estimates of the true 
market distribution lcads to asymptotically optimal growth rate 
of wealth. First we generalize Theorem 1 to handle the scquential 
setting. 


A Bound on the Financial Value of Information 
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Let X,, X;,---, X, bea sequence of random stock vectors with 
joint probability distribution P". The log-optimal sequential 
strategy uses the portfolio b* = b*( Pe, ,, ), which maxi- 
mizes the conditional expected value of log PX; given that X, = 
xut Xia m xia Suppose that instead of 5*, we use portfolios 
b; = b*(Qx s, ,) Which are optimal for an incorrect distribu- 
tion Q” for the sequence X,,---, X,. 

Let Pyiy x, and Qyjy ., x, , be the regular conditional 
distributions associated with P" and Q", respectively. Wc com- 
pare the resulting wealth 


$7 TI 5x (25) 
i-i 
with thc wcalth 
Sst = [|X (26) 
i~i 
Theorem 3: 
s; 
Os E log = € < D( P10"). (27) 
Proof: Application of Theorem 1 shows that 
hae 
O< E| Hog z— |XX, 
E og BX, ') pO’ Aia 
S D( Py yx Oxy). (28) 


Averaging with respect to the distribution of X'^! =(Xp ^, 
X,_,) and then summing for | —1,2,- -:, n yields 


n b*X 
0< Y; E| teg —— 
E | 271 


SY ED( Pye llOxjx) 
i=l 


= D(P"jjQ") (29) 


by the chain rule, completing the proof. 


Suppose X,,X,,:-- are independent with unknown density 
p(x). Clearly, the optimal portfolio 5* does not depend on the 
time i or on the past. However, if p(x) is unknown, a series of 
estimators of the distribution BG ) corresponding to density 
estimators p,(x) based on the past X'^' may be used to obtain 

asymptotically optimal portfolios 5, = = b*(P,). i is often the case 

see Barron [11], [12]) that there exists a sequence of estimators 
^ converging to P in the sense that ED( PHP, ) 2 0, at least in 
the Cesaro sense, i.e., 
1 a 
lim — r ED( PI) - 0. (30) 
n- o0 n, 
In this case Theorem 3 applies with Qy yi: given by the 
estimator LIO +) to yield 
Sx 
“tog = 0. (31) 
S, 
It follows that the actual wealth S is close to the log-optimal 
wealth $^ as shown in the following theorem. 


Theorem 4: Let X, X}, +> beiid. ~ P. Let B, be a sequence 
of estimators of the true distribution P such that 


- X ED( PIB.) +0 - (32) 
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and let 
Š, = TI bx, (33) 
i-1 : 
where 
3, = b°(#). (34) 
Let 
Sy = D1 4"(?) x, (35) 
i=l 
be the optimal wealth sequence. Then 
$a r, (39 


where o(1) — 0 in probability. 


, Consequently, if S* has an exponential growth rate W*, then 
S, has the same asymptotic exponent. 

Proof: To see that $,/S,* — 2") in probability, first ob- 
serve that by Markov's inequality 


P s 2 27"E— $ 2 37 
D yon =m -m 

AJS Uses e» 
where the inequality E(S,/$) x1 follows from the Kuhn- 
Tucker conditions for the optimality of b* (sce Bell and Cover 


(8). 
On the other hand, using ‘the notation y* = max{0, y}, y^ = 


max (0, — y), 
P E 2 P EN 
me = > 
$ > $ ne 


Aad 
— E| log -= 
< (vs £j 
i[a Ss 
si [sete + | (38) 


where the first inequality follows from Markov’s inequality and 
the second from 


E(log S"/S,) = Elogmax{ S/S," 1} 
< Elog(1+ S/S") 
<log(1+ E($,/57)) slog2=1 (39) 
by the concavity of the logarithm and the Kuhn- Tucker condi- 
tions. Combining (31), (37) and (38), we have S, /S* — 2^* in 
probability, as claimed. 
VH. EXAMPLES 


We first give an example due to Kelly [6] in which A = 7. Here 
the stock market is a horse race, which, in the setup of (1), 
consists of a probability mass function P(X = O.e;) = p, i= 
1,2,-+-, m, where e, is a unit vector with a 1 in the /th place and 
O's elsewhere, O, equals the win odds (O, for 1), and p, is the 
probability that the ith horse wins the race. 

Then 


W(X) 7 max E log YX 


l = max E p; log b,0, 
i=l . o 
=} p; log0,- H(X) (40) 


where H(X)=—Z7,p,log p,. Also b*—p, ie, the optimal 
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portfolio is to bet in proportion to the win probabilities, regard- 
less of the odds. ! 

For side information Y, where (X, Y) has a given distribution, 
a similar calculation yields - 


W(X|Y) = X. pog0, - H(XIY) (41) 


and 
br = P(X = Oey), 


Here the optimal portfolio is to bet in proportion to the 
conditional probabilities, given Y. Subtracting (40) from (41), we 
have : 


A-W(X|Y) -W(X) = H(X)- B(X|Y) = I(X;Y). (42) 


Consequently, the information bound on À is tight. 

Of course, it sometimes happens that the information Y about 
the market is useless for investment purposes: The next example 
has A=0, /=1. Let X=(1,1/2) with probability 1/2, and 
X = (1,3/4) with probability 1/2. Let Y= X. An investment in 
the first stock always returns the investment, but an investment 
in the second stock may cut the investment capital to cither 1/2 
or 3/4 depending on the outcome X. It would be foolish to invest 
in the second stock, since the first. stock. dominates its perfor- 
mance. Thus 6* = 6*(y) = (1,0) for all y, and 4=0. On the 
other hand, since the outcomes of X are equally likely, and 
Y= X, we see 


I(X;Y) = (X; X) = H(X) - H(X|X) 
-H(X)-1 bit (43) 


Thus a bit of information is available, but A = 0 and the growth 
rate is not improved. 


i71,2,--:,m. 


VIIL CONCLUSION 


We offer one final interpretation. Recall that H(X) — H(X|Y) 
= I(X; Y) is the decrement in the expected description length of 
X due to the side information Y. Hence the inequality A < Z bas 
the interpretation that the increment in the doubling rate.of the 
market X is less than the decrement in the description rate of X. 
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ASYMPTOTIC OPTIMALITY AND ASYMPTOTIC 
EQUIPARTITION PROPERTIES OF 
LOG-OPTIMUM INVESTMENT 


By PAUL H. ALGOET! AND THOMAS M. COVER? 


Boston University and Stanford University 


We ask how an investor (with knowledge of the past) should distribute his 
funds over various investment opportunities to maximize the growth rate of 
his compounded capital. Breiman (1961) answered this question when the 
Stock returns for successive periods are independent, identically distributed 
random vectors. We prove that maximizing conditionally expected log return 
given currently available information at each stage is asymptotically opti- 
mum, with no restrictions on the distribution of the market process. 

If the market is stationary ergodic, then the maximum capital growth rate 
is shown to be a constant almost surely equal to the maximum expected log 
return given the infinite past. Indeed, log-optimum investment policies that 
at time n look at the n-past are sandwiched in asymptotic growth rate 
between policies that look at only the k-past and those that look at the 
infinite past, and the sandwich closes as k > oc. 


1. Introduction. Suppose an investor starts with an initial fortune S, = 1. 
At the beginning of each period ¢ (where t takes on discrete values 0, 1,...), the 
current capital S, is distributed over investment opportunities j= l,..., m 
according to some portfolio 5, — (b cj em a vector of nonnegative weights 
summing to 1. Let X7 > 0 denote the return per monetary unit allocated to 
stock j during period t, and X, = (XP ijs m the vector of returns. The yield 
per unit invested according to portfolio b, is the weighted average of the return 
ratios of the individual stocks, i.e., the inner product 


(1) (b X) = È UX. 


l<jsm 


Given that S, units are invested at the beginning of period t, the total amount 
collected at the end of the period when the random outcome X, is revealed is 
5S,,, = S(b, X,). This capital is redistributed at the beginning of the next round, 


tt1 
and the compounded capital after n investment periods is 


(2) S,- |I (5, X). 


Oxt«n 
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Portfolio b, must be chosen on the basis of ¥,, a o-field that embodies what is 
known at the beginning of period ¢. It obviously makes a difference whether 
decisions may depend on the history of an aggregate quantity like the Dow-Jones 
average, on detailed records of the past, or perhaps on inside information or help 
of a clairvoyant oracle. Our default assumption is that F, = o( Xo,..., X,_,) is 
the information contained in the past outcomes. We wish to distinguish an 
optimum strategy (b*),.,.,, among all nonanticipating strategies (5,), <<% 
such that b, is ¥,-measurable for all ¢ > 0. 

We are dealing with a sequential version of the portfolio selection problem 
that has received much attention in the literature (not to speak of financial 
practice). Economic theory promotes the maximization of subjective expected 
utility as a guiding principle toward its solution, and this is certainly appropriate 
if the investor’s preferences are sufficiently well elucidated so that they can be 
captured in a well-defined utility function. But subjective utilities are difficult to 
assess and many investors may prefer a less elusive and more objective criterion 
if there is some rationale for its use. The mean-variance analysis of Markowitz 
(1952, 1959) trades off expected return with risk as quantified by the standard 
deviation of the return. This approach is mathematically and computationally 
tractable, but it lacks generality [cf. Samuelson (1967, 1970)] and it fails to single 
out an optimum among the portfolios located on the efficient frontier. However, 
its economic foundation becomes more solid when cast in the form of the capital 
asset pricing model [cf. Sharpe (1985) ]. Breiman (1960, 1961) considered a market 
with m stocks and independent, identically distributed discrete-valued return 
vectors X, = (X/ Ais j«5 and proved asymptotic optimality of the portfolio b* 
that attains the maximum expected log return w* = sup, E(log( b, X)}. Thorp 
(1971) exhibited certain optimality properties of the log return as a normative 
utility function, and Bell and Cover (1980, 1986) proved that log-optimum 
investment is also competitively optimum, from a game-theoretic point of view. 
Although some authors [e.g., Samuelson (1967, 1971)] have suggested that the log 
return should be considered just one among many possible utility functions, we 
hope to convince the reader of its more fundamental character. 

We consider arbitrarily distributed outcomes (.X,) and prove that maximizing 
the conditional expected log return given currently available information at each 
stage is optimum in the long run. A nonanticipating portfolio b* = 


b*(X,,..., X, ,) is called log-optimum (for period ¢) if it attains the maximum 
conditional expected log return 
(3) w* = E(log(5b*, X F} = sup E (log b, X) X, ,,..., X;]. 


b-b(Xs,..., Xp) 
Such b, also attains the maximum (unconditional) expected log return 


(4  W,*-E(wf*) = E(log(b*, X,)} = sup E{log(b, X,)). 
b-b(Xo,..., Xp) 


A log-optimum portfolio b* always exists, and is unique if the conditional 
distribution of X, given F, has full support not confined to a hyperplane in 2”. 
In any case, the return (bt, X,) is always uniquely defined, even if b* is not. 
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The results of Breiman (1960, 1961) and Finkelstein and Whitley (1981) for 
independent, identically distributed (.X,) are enhanced by the following theorem, 
which proves that b* is optimum to first order in growth exponent. 


THEOREM (Asymptotic optimality principle. LetS* — [Io .,.,(b*, X,) and 
S, = IIo 2, (6, Xj), respectively, denote the capital growth over n periods of 
investment according to the log-optimum strategy {b }o<1< and a competing 
strategy (bj), .,.,. Then (S,/S*, £ 19... ,, is a nonnegative supermartingale 
converging almost surely to a random variable Y with E(Y) < 1, and 


(5) limsupn !log(S;/S*) <0 a.s. 


Thus S, < exp(ne)S,* eventually for large n and arbitrary « > 0, which means 
that no strategy can infinitely often exceed the log-optimum strategy by an 
amount that grows exponentially fast. 


The asymptotic optimality principle will be deduced from the Kuhn- Tucker 
conditions for log-optimality using Markov's inequality and the Borel-Cantelli 
lemma. 

_ Now suppose (X,) ,.,-, 18 a two-sided sequence of return vectors, and 
bf = b*(X_,,..., X_,) is a log-optimum portfolio for period 0 based on the 


t-past ¥,= o(X_,,..., X_,). Portfolio 6; attains the maximum conditional 

expected log return for period 0 given F, 

(6) m* = E(log(b*, XoF} = sup E(log(b, X)|X_,,..., X ,). 
b-b(X i, X) 


'The maximum expected log return for period 0 given F, is given by 


(7) W,* = E({w*} = Ellog(bz, X,)} = sup E{log(b, X,)}. 
b=0(X_1,-.., X_,) 

The supremum is taken over a larger set of portfolios as ¢ increases, so that W,* 

is monotonically increasing and (w;*,.7,)9 .,.,, is a submartingale [strictly 

speaking only if all W,* are finite]. 

_ The information fields 7, = o(X ,,..., X ,) increase to a limiting o-field 

Zo = 0(X_,, X2...) Any accumulation point of {6} is a log-optimum port- 

folio for period 0 based on Z p, and br = b*(X_,,..., X_,) almost surely 

converges to bx = b'(X ,, X_o,...) if the log-optimum portfolio for period 0 

given £. is unique. Furthermore, W,* increases to the maximum expected log 

return given the infinite past, 

(8) W,* a W,* = E(log( 6x, X,)) = sup E{log(b, X,)). 
b=0(X_,, X_a,---) 

We may use the expanded notation W,* = W*( XX, ,,..., Xo) and W,* = 
W*(XoX .,,..., X_,). Setting E(log S*} = W*(X,,..., X,_,) yields the chain 
rule 
(9) W*(Xo,...,X,.)) = Y; W( XIX,_1,---, X9). 


Oxt«n 
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If (X,) is stationary, then W,* = W*(X,|X_,,..., X ,) is equal to W,* = 
W*( XX, .,,..., Xo) and these definitions are equivalent: 


Wt = W*(XQJX 4, X 3s) = limtW'GIX iss X_,) 
t 
(10) = lim 1 W*CX4X, sss, Xo) 
= lim ? a7'W*(X,, X,,..., X, 4). 


These identities for maximum capital growth rate generalize those for relative 
entropy rate in information theory. Indeed, suppose one stock will return m 
times the amount invested in it, whereas all other stocks return 0. Thus we must 
gamble against uniform odds, on the identity of the winning stock (indicated by 
the direction of X,). Placing proportional bets, b/ = Prob( X7 * O|X, ,,..., Xo} 
is log-optimum, and W*( XX, ,,..., Xo) = log m — H*(X,X, ,,..., Xo), where 
H*(X,X, ,,..., Xo) is the conditional entropy of X, given X, ,,..., Xy. Now 
H*(Xo|X_,, X_9,...) = lim, | H*(XJX, ,,..., Xo) is the entropy rate of ( X,), 
and 


(11) W*(XjX i, X S,...) = logm - H'(XyX 4, X_9,-..). 


The following AEP for log-optimum investment in a stationary ergodic 
market generalizes the Shannon-McMillan-Breiman theorem of information 
theory. 


THEOREM (Asymptotic equipartition property or AEP). If (X,) is sta- 
tionary ergodic, then S* — I1, .,. (b, X,) grows exponentially fast with con- 
stant asymptotic rate almost surely equal to the maximum expected log return 
given the infinite past, i.e., 


(12) n logS* > W* = W*(XgJX ,, X ,,...) as. 


Equivalently, S* = exp[n(W.* + o(1))], where o(1) > 0 a.s. The rate W¥ is 
highest possible. 


The AEP is an immediate consequence of the ergodic theorem if ( X,) is finite 
order Markov. A sandwich argument and the asymptotic optimality principle 
will reduce the proof of the general case to applications of the ergodic theorem. 

In the first half of the paper we discuss log-optimum investment for a single 
period. The Kuhn-Tucker conditions for log-optimality of a portfolio b* are 
recalled in Section 2, and in Section 3 we examine log-optimum portfolio 
selections and the maximum expected log return as functions of the distribution 
P of the random outcome X = (X5) < j«m 0n. 227. To simplify the analysis we 
use a divide-and-conquer approach. Namely, we consider the decomposition 
X = (B, X)U, where B = (Bi) sjem is a fixed reference portfolio and U = 
X/(B, X) is the scaled outcome in the simplex Y= {u = (u/).,., € 47: 
(B,u) — 1). The return (b, X) factors as (B, X)(b, U), and the maximum 
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expected log return w*(P) = sup, Ep(log( b, X)) decomposes as the sum of a 
reference level r(P) = Ep{log( 8, X)) that is affine in P and an extra term 
w*(Q) = sup,Eg{log(b,U)} that depends on P only through the marginal 
distribution Q of U. The term w*(Q) is nonnegative, bounded and continuous in 
Q when the space of probability measures on 4& is equipped with the weak 
topology, whereas the irregular term r(P) = Ep(log(B, X)} is irrelevant for 
portfolio selection. 

We need the decomposition w*(P) = r(P) + w*(Q) to show that the maxi- 
mum conditional expected log return w,* is always attained by an .Z, measurable 
portfolio b*. Furthermore, the nonnegativity and lower semicontinuity of w*(Q) 
are essential in Section 4 when we argue that the maximum expected log return 
given the £-past converges to the maximum expected log return given the infinite 
past (i.e, W,* 7 W* at t > oo). 

'The asymptotic optimality principle is proved in Section 5, for an arbitrarily 
distributed sequence of return vectors. In Section 6 we argue that S* has a 
well-defined growth rate if {X,} is stationary ergodic, and in Section 7 we 
examine whether the same is true if the market is stationary, or stationary in an 
asymptotic sense. Although the ergodic theorem is generally valid for asymptoti- 
cally mean stationary processes (whose definition is recalled in Section 7), the 
AEP will hold for an asymptotically mean stationary market only if the investor 
can recover from transient losses before reaching the asymptotic regime. Finally, 
in Section 8 we specialize the investment game to gambling on the next outcome 
of a random process. 


2. The Kuhn-Tucker conditions for log-optimality. When managing 
funds during a given investment period, an investor may diversify his risk by 
building a portfolio that includes several assets. The allocation of one unit of 
capital over elementary investment opportunities 7 = 1,..., m is conveniently 
described by a vector of weights b = (b^); < j<m The weights must be nonnega- 
tive (since no borrowing is allowed) and sum to 1. Thus a portfolio is a vector b 
in the unit simplex 


(13) B= {b= (bi) sjem E AP: bxc bn -i). 


Let X/ > 0 denote the return per monetary unit invested in stock j, and let 
X = (X7),,;.,, denote the vector of returns. Capital invested according to 
portfolio b will grow by the factor (b, X) = X, < ; < mb/X’, that is, the weighted 
average of the per-unit returns of the individual stocks. Portfolio b must be 
selected at the beginning of the investment period, before the actual value of the 
random outcome X is revealed. However, the distribution of X on 27 is 
assumed to be known. 

Let the expected log return of a portfolio b be denoted by 


(14) w(b) = E{log(b, X)). 


We set w(b) = — oo if the expectation is not well defined in the usual sense. 
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DEFINITION. A portfolio b* is called log-optimum if no competing portfolio 
b can improve the expected log return relative to b*, i.e., if 


(5, X) 
(15) hos rg <0, foral beg. 


Every log-optimum portfolio b* attains the maximum expected log return 
(16) w* = sup E{log(b, X)}. 
beg 


Conversely, if w* is finite, then every portfolio b* attaining w* = sup,w(b) is 
log-optimum. However, condition (15) may single out a unique log-optimum 
portfolio b* even if w(b) is infinite for all b € 2. 

We recall the Kuhn- Tucker conditions for log-optimality derived in Bell and 
Cover (1980). Let the expected score vector be defined for each portfolio b as 


(17) a(b) = E( X/(b, X)). 


THEOREM 1. Let a* = a(b*) denote the expected score vector for portfolio 
b*. Then b* is log-optimum iff the Kuhn-Tucker conditions a*/ < 1 hold for all 
1 <j <m, or equivalently, iff 

(5, X) 


(18) (ba) = BL | <1, forallbe Z. 


Proor. Forbe 4 and0 <A =1-—A <1 let b, 2 Ab* + Ab. Then 
(b,, X) x (b, X) (b, X) 
—— =A +A =1+)AZ, where Z = — 
(b*, X) (b*, X) waere PT (b, X) 

Using a Taylor series expansion we obtain, for any a > 0, 
AZ>log(1 + AZ) > log(1 + A(Z ^ a)) 


=(ZAa)-102(Z Aa)” (for some 0 « 0 < 1) 
> A(Z Aa) — Xa’. 


Choosing a = a(A) so that a(A) > oo and Aa(A) > 0 as A N 0, we see that 
A^!E(log(1- AZ)} > E(Z) as AN 0. But E{Z} = (5,a*) — 1, so the right 
derivative at A = 0 of u(b,) = E(log( b,, X)) is given by 


d . Eflog(1 + AZ)} 
(19) gn bi) cos = lim rr a 


The Kuhn- Tucker conditions assert that b* is log-optimum iff the directional 
derivative of the expected log return is nonpositive when moving from 5* to any 
competing portfolio b (in particular, when moving from 5* to any extreme point 
of #). The infinitesimal conditions dw({b,)/dA|,_94< 0 are necessary for log- 
optimality of b*, and they are also sufficient because w(b) is concave 
in b. B 


E(Z) = (b,a*) - 1. 
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The set B* of log-optimum portfolios is never empty [cf. Cover (1984)]. In 
fact, let Y denote the linear hull of the support of the distribution of X, that is, 
the smallest linear subspace of #” such that X € Z with probability 1. Then 
w(b) is strictly concave when restricted to .Z and constant along fibers per- 
pendicular to .Z. It follows that B* is a polyhedral set (the intersection of 4 
with a fiber orthogonal to £), and the log-optimum portfolio b* is unique if X 
has full support (.Z— 22"). The return (5*, X) and log return log(5*, X) are 
unambiguously defined, independent of the choice of log-optimum portfolio 5* in 
B*. 


3. Continuity and attainability of the maximum expected log return. 
We make explicit how various quantities depend on the distribution P of X on 
227. Let w(b, P) = Ep(log(b, X)) denote the expected log return of portfolio b, 
w*(P) = supyw(b, P) the maximum expected log return and B*( P) the set of 
log-optimum portfolios. It is clear that w*( P) is convex in P, since w*( P) is the 
supremum of functions Ep(log( 5b, X)) that are affine in P. 

The direction of the return vector X embodies everything an investor needs to 
know in order to maximize the expected log return. To justify this claim, we 
choose a fixed reference portfolio B = (87), <j .,, with 87 > 0 for all j, and we 
define the scaled return vector 


(20) U-u(X), where u(x) — x/(f, x). 
Thus U is obtained by projecting the return vector X on the simplex 
(21) U= fu = (wW)icjem © 97: (B, u) = 1]. 


If X = 0, then we set U = u(0) = uy for some arbitrary ug € X. 

The distribution Q of U = u( X) on € is obtained by integrating out the 
distribution P of X along rays through the origin. All mass accumulated along a 
ray is collected at the point where the ray crosses the simplex 4, except that 
mass found at X = 0 is transferred to u(0) = uy. Thus Q is the image measure of 
P through u: 22? —> Y, and for any Borel subset A C 4&/ we have 


(22) Q(U € A) = P(u(X) € A) = P(u (A)). 


Since X = (B, X)u( X), the expected log return may be decomposed as the 
sum Epj(log(b, X)) = Ep(log(B, X)) + Ep(log( b, u( X))), or equivalently, 


(23) w(b, P) = r(P) + w(b,Q). 


Here r( P) = w(ß, P) denotes the expected log return of the reference portfolio 
P. We interpret r( P) as a reference level for the expected log return, since it is an 
inherent property of the market over which the investor has no control. Whereas 
r(P) = Ep(log(B, X)) is affine in P, it is also a very irregular function of P, 
possibly infinite or ill defined. Since our choice of 5 cannot affect its value, we 
shall subtract r(P) from the expected log return w(b, P). The remaining 
quantity w(b, Q) = Eg{log(b,U)} depends on P only through the marginal 
distribution Q of the scaled outcome U, and represents the relative improvement 
in expected log return that results when portfolio b is chosen instead of 8. The 
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maximum expected log return can be expressed as the sum 
(24)  w*(P)-r(P) * w*(Q), where w*(Q) = sup Eg(log( b, U)). 
beg 


Maximizing w(b, P) or w(b,Q) are equivalent operations, so that B*(P) = 
B*(Q). Notice that w*(Q) = w*(P) — r(P) = 0, with equality iff the reference 
portfolio B is log-optimum. 

It is an interesting fact that the maximum expected log return w*(Q) is 
always attained by some portfolio choice. However, we need a stronger result, 
namely, the existence of log-optimum portfolios b*(Q) that depend measurably 
on Q. To prove the existence of a measurable selection of log-optimum portfolios, 
we make use of topological properties, including compactness of 4 and upper 
semicontinuity of the expected log return w(b, Q) = Ec(log( b, U )} in (b, Q). 

The space 2 of probability measures on the compact metric space 4? is 
compact and metrizable when equipped with the weak topology [that is the 
weakest topology on 2 such that Q > Eo{ f(U)) is continuous in Q € 2 for all 
bounded continuous functions f: Y — 4]. Its Borel o-field is the smallest o-field 
on 2 such that A > Q(A) is measurable in Q for all Borel subsets A c 4. 


THEOREM 2. The maximum expect log return w*(Q) = sup, c gEQ{log( 5, U)) 
is convex, bounded [between 0 and max ( — log B7)] and uniformly continuous 
when the space 2 of probability measures on € is equipped with the weak 
topology. The set of log-optimum portfolios B*(Q) is a nonempty compact convex 
subset of B for every distribution Q on %, and a log-optimum portfolio b*(Q) € 
B*(Q) can be selected for each Q € 2 so that b*(Q) is measurable in Q. 


PnRoor. Clearly w*(Q) is convex in Q for the same reason that w*(P) is 
convex in P. We argue that w*(Q) is bounded below and lower semicontinuous 
on 2, because (8, u) is bounded below on 4 and (b, u) is concave in b € 4 and 
lower semicontinuous in u € 4. We also prove that w(b, Q) is bounded above 
and upper semicontinuous, using compactness of # and boundedness above and 
upper semicontinuity of (b, u) on Z x 4. Boundedness and uniform continuity 
of w*(Q) and existence of a measurable selection of log-optimum portfolios 
b*(Q) will follow automatically. 

First, we argue that w*(Q) is nonnegative and lower semicontinuous on 2. 
For 0 < À < land b E€ Z, let A = 1 — À, b, = AB + àb, B = (by: b E€ 4) and 


(25) w*(Q) = sup Eę{log(b, U)} = sup Eg{log(b,, U)]. 
bed, beg 


Observe that w,;*(Q) is monotonically increasing in A, since the supremum is 
taken over a larger set Z, as à increases. Furthermore, #, = Z, Z, = {8} and 
(B, u) = 1 for all u € €, so that 


w*(Q) = wt (Q) = we(Q) = uf (Q) = Eg(log( 8, U)) = 0. 


If À < 1, then log( b,, u) is bounded below (by log A) and lower semicontinuous 
in u, so that w(b,, Q) = Ec(log(b,,U)) and w*(Q) = SUP au(b,, Q) are 
lower semicontinuous in Q. On the other hand, the inequality (b, u) > A(b, u) 


Asymptotic Optimality and Asymptotic Equipartition Properties of Log-Optimum Investment 165 
884 P. H. ALGOET AND T. M. COVER 


implies that 
wx (Q) x w*(Q) < wt (Q) — log à, 


and hence w*(Q) 7 w"(Q) as A 7 1. Since w*(Q) is the supremum of lower 
semicontinuous functions w;*(Q), it follows that w*(Q) is lower semicontinuous 
as well. 

The expected log return w(b, Q) = Eg{log(b, U)) is bounded above and upper 
semicontinuous on 4 x 2, since (b, u) is bounded and upper semicontinuous on 
Bx U. Since 4 is compact, it follows [cf. Bertsekas and Shreve (1978), Proposi- 
tion 7.33] that w*(Q) = sup, gw(b, Q) is bounded above and upper semicon- 
tinuous on 2. Furthermore, log-optimum portfolios b*(Q) € B*(Q) can be 
selected in a measurable fashion for all Q € 2 by the measurable selection 
theorem of Kuratowski and Ryll-Nardzewski (1961). The upper bound w*(Q) x 
max (—log B’) holds since (b, u) < Eju’ x max;(1/87) for all be 2 if u 
satisfies (B, u) = 1. 0 


It is impossible to select a portfolio b*(Q) € B*(Q) for all distributions Q on 
% so that b*(Q) is continuous in Q. However, if Q, > Q.. and b* € B*(Q,, ) for 
all n, then any accumulation point b* of the sequence (br) is a point in 
B*(Q,,). Furthermore, (b*, U) > (b*, U) almost surely under Q,,. These con- 
tinuity properties of the multivalued correspondence Q ^ B*(Q) follow from the 
following. 


THEOREM 3. The set Gr(B*) = ((Q, b*): b* E€ B*(Q)} is closed in 2 x &. 
Consequently, any selection of log-optimum portfolios Q > b*(Q) € B*(Q) is 
continuous at any Q € 2 such that B*(Q) = (b*(Q)) is a singleton set. 


PRoor. Since Z is compact, the theorem will follow from the following 
gam: If Q,> Qo in 2, b% > b* in @ and be e B*(Q, ) for all n, then 
€ B'(Q,,). 
aso prove the claim we consider the sequence of maximum expected log returns 
w*(Q,,) = w(b*, Q,). It is clear that w*(Q,) > w*(Q,,) since Q, > Q, in 2 
and w*(Q) is continuous in Q. On the other hand (see the proof of Theorem 2), 
w(b, Q) is upper semicontinuous on Z x 2 and hence 


lim supw(b£, Q„) < w| limbs, imĝ,) = w(bž, Q..). 
The claim bx € B*(Q,) and Theorem 3 follow, since 


w(b*, Qo) > lim sup w(B*, Q,) = limw*(Q,) = w*(Q,,) = sup w(b, Q..): o 
n n b 


The maximum expected log return w*( P) is neither bounded nor continuous 
(for the weak topology) as P ranges over the space of probability measures on 
227. But if the support of P is constrained to a closed subset X of 227" then 
w*( P) is lower semicontinuous and bounded below iff 2# is bounded away from 
0, upper semicontinuous and bounded above iff % is bounded, and bounded and 
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uniformly continuous iff X^ is bounded away from 0. In particular, if #= 4 (i.e., 
if X is distributed on the simplex 4), then P= Q and w*(P) = w*(Q) is 
bounded and continuous. 

Some of the conclusions of Theorems 2 and 3 continue to hold if the investor 
may distribute his funds over a countable set or even a separable metrizable 
space of investment opportunities. Indeed, suppose every realization of the 
return X is a nonnegative lower semicontinuous function x(a) on £. (This is no 
restriction if »/ is a countable set with the discrete topology.) The average 
return (b, x) = f ,x(a)b(da) is then well defined for every portfolio b [i.e., for 
every normalized measure b(da) on the Borel o-field of $7]. Further assume the 
existence of a reference portfolio B such that (8, x) > 0 is strictly positive for 
any return function x(a) that is not identically 0. [Such £ exists if æ is locally 
compact, and, in particular, if . is countable.] If P and Q denote the distribu- 
tion of X and U = X/(B, X), then the maximum expected log return w*(P) 
admits the decomposition r( P) + w*(Q), and w*(Q) is nonnegative and lower 
semicontinuous by the argument presented in the proof of Theorem 2. If, 
moreover, ./ is compact and the return functions x(a) are continuous and 
bounded by a fixed constant, then w*(Q) is bounded and continuous and a 
measurable selection of log-optimum portfolios b*(Q) exists by Theorem 2, and 
Gr( B*) is closed by Theorem 3. 


4. Martingale properties. It will be shown that the maximum expected log 
return given increasing information fields tends to the maximum expected log 
return given the limiting o-field. We assume that the random return vector 
X(w) € @™ is defined on a perfect probability space (Q, F , P), so that X admits 
a regular conditional probability distribution given any sub-o-field of F. See 
Jifina (1954) for a proof of this fact, and Ramachandran (1979) for a complete 
discussion of perfect measures. 


THEOREM 4. Suppose the random vector X is defined on a perfect probability 
space (Q, £,P), and {F,}o .,..,, is an increasing sequence of sub-o-fields of F 
with limiting o-field F, CF. 

(a) If P, is a regular conditional probability distribution of X given F, then 
(26) P, P, weakly a.s. 

(b) If b*(-) is a measurable selector of log-optimum portfolios, then bx = 
b*(P,) is an Z-measurable portfolio attaining the maximum conditional ex- 
pected log return given F, Moreover, (b*, X) > (bx, X) a.s., and hence 
(27) log(b*, X) > log(b*, X) a.s. 

If the log-optimum portfolio given Z, is unique [B*(P,,) = (b*)], then b* > bz 


a.s. as well. 
(c) If w*(-) denotes the maximum expected log return function, then the 


maximum conditional expected log return given F, is given by 
(28) w* = w*(P) = sup E{log(b, X))Z,) = E(log(b*, X))Z,) a.s. 
bez 
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Furthermore, (0, £,), << is a submartingale and 


(29) D* > wx a.s.(andin L if Wt < œ). 
(d) The maximum expected log return given F, is given by 
(30) W,* = E(i5*) = sup E{log(b, X)} = E{log(b*, X)}. 
bes, 
Furthermore, 
(31) W,* W^, ast co. 


Proor. Lévy's martingale convergence theorem for conditional expectations 
of a bounded continuous (or nonnegative measurable) function f(x) states that 


JidP, = E(fGXIE) > fidP, = E(IGOIEZ) as. 


This proves (a) and assertion (b) follows in view of Theorem 3. Notice that 
by = : b*(P,) and w,* = w*(P,) are Z.measurable, since P, is measurable on 
(Q, F,) and both b*(-) and w*(-) are measurable functions. 

If0 x s < t € co, then & C F, so that every ¥-measurable portfolio (includ- 
ing b*) is also F, “measurable. It follows that 


E(log(b*, X)|%,} x w* = sup E{log(b, X)|¥,}. 
bez, 


Taking _ F,-conditional expectations proves that w,* = E{log(b*, XJF} < 
E{W*|F,}, and hence (E, Z),.,., isa submartingale. The maximum ex- 
pected log returns W,* = Sup, c z, E(log( 5, X)} increase with ¢ since the supre- 
mum is taken over larger and larger sets (DEF = be, ;). More information 
does not hurt! o 

It remains to show that W,* 7 W.* and i* > w* a.s. (and in L! if W.* is 
finite). For this purpose we choose a reference portfolio f (with f > 0 for all 
1 <j <m), and we recall the decomposition w*(P) = r(P) + w*(Q) of the 
maximum expected log return into a reference level r( P) = E,{log(B, X)} anda 
relative improvement w*(@) that only depends on the distribution Q of the 
scaled return vector U = u( X) = X/(B, X). 

Let Q, designate a regular conditional probability distribution of U = u( X) 
given - Fo for 0<t< œ. Then Q, > Qo weakly almost surely and 
{w*(Q,); Filo ., «,, is à submartingale. Since w*(Q) is bounded and continuous 
in Q, it follows that w*(Q,) > w*(Q,,) as. and in L', and E{w*(Q,)} 7 
E(w*(Q,,)}. The sequence {r(P,), Z,), <<. [where r(P,) = E(log( B, X)|F,}] is 
a martingale [at least if log(8, X) has finite expectation], and the martingale 
convergence theorem for conditional expectations asserts that 


r(P,) = E (log( B, X)Z) > r(B,) = E (log( 8, X)£.]) as. 
[and in L' if E(log(B, X)) is finite]. Since i5* = r(P,) + w*(Q,), we may 
conclude that (1, £), < i<» is a submartingale such that ip^ > i5* a.s. (and in 


S 
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L! if W.* is finite). The expectations satisfy 


oo 


W,* = E{log( 8, X)) + E{w*(Q,)} ^ Wt = E(log( 8, X)) + E{w*(Q,,)}. 
m 


The main conclusion of Theorem 4 is that no gap exists between lim, t W,* 
and We. Thus the limit of the expectations W,* — E (log( 5f, X)) coincides with 
W* = "E(log(bs, X)) which is the expectation of the limit log(b*, X) = 
lim jlog(b*, X). 

We have shown that ij* > w* as. and W,* 7 W,*, using boundedness and 
continuity of w*(Q). These convergence theorems also hold for a market with 
infinitely many investment opportunities, when w*(Q) is only nonnegative and 
lower semicontinuous. Indeed, {W,*, 7,), .,..,, is still a submartingale, so that 
w,* < E(w|7,) and lim TE") < E(w7) and hence, by Lévy’s martingale 
convergence theorem for conditional expectations, 


(32) lim sup àj* ES lim E {a K[Z)-E(wi£)-iwut as 


Since {w*(Q,), Fijo <1< is a submartingale also, one similarly obtains 


limsupw*(Q,) < w*(Q,) a.s. 
(33) ELA - 
and lim TE{w*(Q,)} < E{w*(Q,,)}. 


But Q, > Q,, weakly a.s. and w*(Q) is lower semicontinuous in Q, so that 
(34) liminfw*(Q,) > w*(Q,) as. 
t 


We conclude that w*(Q, )2 w*(Q.. ) as. Since w*(Q) is also nonnegative 
Fatou’s lemma implies that E(w*(Q)) 7 E(w*(Q,, )). It follows that w,* > w> 
a.s. and W,* 7 W.*, at least if E(log( B, X,)} > —oo or sup,E(w*(Q,)) < oo. 


5. The asymptotic optimality principle. We now prove the asymptotic 
optimality principle for sequential log-optimum investment. The market is 
described by a sequence of return vectors (X,)9.,.,, defined on a perfect 
probability space (Q, .7, P), and capital invested according to a portfolio b, at 
the beginning of period ¢ will grow by a factor (b, X,) when the random 
outcome X, is revealed at the end of that period. If the initial fortune is 
normalized to S, = 1, then the compounded capital S, after n periods is given by 
(35) S, — II (5, X). 

O<t<n 
The objective is to select portfolios b, so as to maximize the capital growth rate 
lim inf,~ ‘log S,. Portfolio b, must be selected on the basis of an information 
field Z, that embodies what is known at the beginning of period f. In other 
words, b, must be ¥-measurable (b, € F, for short). 

Let P, denote a regular conditional probability distribution of X, given F, 
and let b*(-) be a measurable selector of log-optimum portfolios. Then bf = 
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b*(P,) is an ¥F,-measurable portfolio attaining the maximum conditional ex- 
pected log return 


(36) w* = w*(P,) = E(log( bf, X,)|F,} = sup E (log(b, XDF} 
bes 


The expectation of the log return log( b*, X,) and of its conditional expectation 
w, are both equal to the maximum expected log return for period t given F, 


(37) W,* = E(w*) = E(log(b*, X,)} = sup E(log(5, X,)}. 
bes, 
We argue that (5^), .,<,, is optimum in the long run. 


THEOREM 5 (Asymptotic optimality principle) Suppose the random out- 
comes (X,)9 ,,.,, are defined on a perfect probability space (Q, £,P), and 
(4)o.,-.,, is an increasing sequence of sub-o-fields of F such that 
o( X,,..., X, 0 EF, for all 0 x t « œ. Let the compounded capital after n 
periods of investment according to the log-optimum strategy (b*),..,.,, and 
some competing nonanticipating strategy {bi} .,..,, be denoted by 
(38) St = II (bf , Xj) and S, = H (5, X,). 

Oxtxn O<t<n 
Then (S,/S,*, Filocn<o is a nonnegative supermartingale converging almost 
surely to be a random variable Y with E{Y} < 1. Furthermore, E(S,/S*) x 1 
for all n, and 


S, 
(39) im supn"og| =] <0 as. 


n 


Proor. The log-optimum investor and his competitor start with equal 
fortunes, so that Sj/Sy = 1. The ratio S,/S* = IIo .,., (b, X,)/(0*, X,) is 
5 „measurable, and the conditional log-optimality of b given Z, is equivalent 
to the Kuhn- Tucker condition 


b,, 
E (bas Xn) 
(5, X,) 
It follows that 


Seal) uf S, (Ons Xa) CRAME 


i St (br, 
So {S,/S,*, Flo <n< 1S a nonnegative supermartingale. Any nonnegative su- 
permartingale converges almost surely, and the expectations decrease monotoni- 
cally to a limit no smaller than the expectation of the limit, by Fatou’s lemma. 
Thus S,/S,* converges almost surely to a nonnegative random variable Y and 


1 = E(S/S*) = E{S,/S*} > lim |E{S,/S*} > E{Y}. 
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Since E{S,/S,*} < 1, it follows from the Markov inequality that, for r, > 0, 
P(S/S*2r)zr, !'E(S/S*)zr,! 
If r, increases sufficiently fast so that Y, r, ! < oo, then 


YP(S/S: 25) < Erz? < o, 


and hence S,/S,* < r, eventually for large n, by the Borel-Cantelli lemma. In 
particular, choosing r, = exp(ne) with e > 0 proves that 


P(n'!log(S,/S*) = e infinitely often} = 0. 


Since e > 0 was arbitrary we may conclude that limsup,n 'log(S,/S,*) < 0 as. 
[This fact can be proved also by observing that S,/S* converges to a random 
variable Y with E(Y) < 1 and hence 0 x Y < oo as. Indeed, S,/S,* < (1+ Y) 
for large n and hence limsup,n !log(S,/S*) x lim,n !log(1-* Y) = 0 a.s.] O 


Theorem 5 asserts that any alternative is dominated in the long run by the 
log-optimum strategy. Indeed, E(S,/S,*) x 1 for all n, and the Borel-Cantelli 
lemma implies that S,/S* < r, eventually for any sequence (r,) such that 
Ln, | < 0 (e.g, r, = nl** or r, = e"*). The maximal inequality for nonnegative 
supermartingales [cf. Neveu (1972), Proposition II-2-7, page 23] asserts that 


(40) P{ sup S,/S,* > Aj <1/A. 


Thus with probability at least 1 — 1/A, a competing investor will never outper- 
form S, by a factor greater than A. The random variable sup,,S,/S,* is finite 
almost surely, although its expectation is generally infinite. A game-theoretic 
sense in which S,* dominates S, for games with payoff E(9(S(9/S(?)) with @ 
increasing is given in Bell and Cover (1980, 1986). 

The conclusions of Theorem 5 hold if {¥,} is an increasing sequence of 
information fields with o(U,,...,U, ,)) C F, for all t. Indeed, S,/S* = 
IIo << nlb U,)/( 6%, U,) is completely determined by the history of the scaled 
outcomes U, = u(X,). 


6. The asymptotic equiparitition property. Breiman (1960, 1961) consid- 
ered a market with outcomes {X,} that are independent and identically distrib- 
uted according to an atomic measure, and he argued that repeated choice of the 
log-optimum portfolio b* is optimum according to various criteria. In particular, 
the capital S* =TIy.,.,(6*, X,) will grow exponentially fast almost surely 
with limiting rate equal to the maximum expected log return w* = 
sup, E(log(b, X )}, by the strong law of large numbers, 


41) n lg S* =n! log( b*, X,) > w* = Ef{log(b*, X a.s. 
n t 
O<t<n 


We prove an asymptotic equipartition property for log-optimum investment 
in a market that is stationary ergodic. The successive outcomes X,(w) = X(T w) 
are defined in terms of a random vector X(w) € 47* and an invertible measure- 
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preserving and metrically transitive transformation J defined on a perfect 
probability space (€, F , P). Since T is invertible, the returns can be embedded 
in a two-sided sequence ( X,) -<< 

Let bf be a log-optimum portfolio for period ¢ based on the t-past F, = 
0( Xo, ..., X,.,), and let b* be log-optimum for period 0 based on the shifted 
information field F, = T'2, = o(X_,,..., X_,). Portfolios b* and b attain the 
maximum conditional expected log returns w = sup; s E(log(b, X,)|.7,) and 
W,* = suppe z E{log(b, X,).7,). We denote by W,* = E(log(b*, X,)} and 
W,* = E(log(b*, X,)) the maximum expected log returns. Then W,* = 
W*(XjX ,,..., X_,) equals W^ = W*(X,|X,_,,..., Xo) by stationarity. If 
bz is a log-optimum portfolio for period 0 based on the limiting o-field E = 
o(X_,, X_n...) then W,* = W,* increases monotonically to W* = 
E{log(b2, X_)}. This limiting expectation is equal to the maximum expected log 
return given the infinite past, and is denoted by W.* = W*(X,|X_,, X s...) 
It may be noted that W*(X,|X_,,..., X_,) is the maximum expected log 
return given the infinite past under the stationary kth-order Markov process 
having the same (k + 1)st-order marginal distribution as {X,}. 

Let S* = II, .,., (bt, X,) denote the capital growth over n periods of 
log-optimum investment. The AEP asserts that the time-averaged growth rate 
n log S* and its expectation n^! E(log S*) =n 'W *(X,,..., X,.,) converge 
to the same limit. 


THEOREM 6 (Asymptotic equipartition property). If the sequence of stock 
return vectors ( X,) is stationary ergodic, then capital will grow exponentially 
fast under the log-optimum investment strategy, almost surely with constant 
asymptotic rate equal to the maximum expected log return given the infinite past 
(42) n log S* > W* = W*(X,|X_,,X_9,...) a.s. 


where 


W*(XjX 4, X_9,--.) = im W*(X,|X_,,.-., X_,) 
t 


(43) lim TW *(XAX,. ,,..-, Xo) 


= lim tn^!W*(X,,..., X, j). 


PRoor. One potential approach to establish the AEP for log-optimum in- 
vestment is to invoke the extended ergodic theorem that was used by Breiman 
(1957/1960) to prove the AEP of information theory. This extension of the 
ergodic theorem asserts that 
(44) n-loggS*-n^ Y, w*(Tw) > We = E(w} as.andin L, 

Ost<n 
if w,* = log(b*, Xo) converges to W,* = log(b*, X) and (w,*)o.,.,, is L- 
dominated. Theorem 4 asserts that w,* > w,* a.s., but it seems hard to check 
the integrability condition E{sup,|w,*|} < oo. We shall instead reduce the AEP 
to direct applications of the ergodic theorem, using a sandwich argument. 
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The information field F, = o( X,,..., X, ,) is approximated by a more re- 
fined o-field F, and by less refined o-fields .7,?, defined for 0 < k < oo as 
follows: 

e( Xos- X,.,), if0<t<k, 
o(X,_4,---,X,_-1), ifk<t<o, 


(46) Fi) TUE =0(..., X 4, Xoo X, ). 


(45) FAT Gry | 


Let bí? and b{® denote log-optimum portfolios for period £ based on the 
approximating o-fields 4, and 9), and let the corresponding capital 
growths over n periods be denoted by 
(47) Se= Y, (bo), X,) and o= [I (8, X,). 
O<t<n Oxt<n 

Thus S(P, S* and S‘*°) denote the capital growth over n periods of log-optimum 
investment when the investor is allowed to look back at each stage, respectively, 
at the k-past (but not beyond period 0), up to time 0 and into the infinitely 
distant past. 

Observe that b(9(o) = bE(T* ^o) if t > k. Given the expansion 


(48) n^ logS(?-n-MdogSt- n^ Y, log(b(^, X,), 
R<t<n 
it follows from the ergodic theorem that 
(49) n log S\ > W,* = E(log(bf, X,)) as. 
The sequence (log( 59), X,)} is stationary ergodic and bf = b*, so that again 
by the ergodic theorem, 
(50) nog Si) =n Y, log(b(9, X,) > Wx = E(log(bz, X,)) as. 
O<t<n 


The log-optimum F,‘*)-measurable portfolio b(^ is #,-measurable since 
F,” c £, and the log-optimum #,-measurable portfolio b* is ¥;,‘~)-measurable 
since F, c F. It follows from the asymptotic optimality principle that 

go» 


( S 
(51) imsupn tog S <0 and lim supn'log| 55 


<0 as. 


Thus we obtain the chain of asymptotic inequalities 
W = limn log S(? < liminfn~'log S* 
n 


(52) 


< limsupn^!log S* < limn^ log Si = W* as. 
n n 


The AEP follows since W,* = W,* 7 W,* with no gap as k > oo. O 


'The sandwich proof of the AEP remains valid if the log-optimum portfolios 
b* are based on information fields F, other than the history of past outcomes 
o( X,, ..., X, ,). However, (.7,), ., .,, must be monotonically increasing and the 
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history of the scaled return vectors o(Up,..., U, ,) must be contained in .£,, so 
that the asymptotic optimality principle can be invoked. Monotonicity of (.7,) 
means that information available about the past should never be erased from 
memory. In addition, one must assume that the shifted fields F, = T'Z, are 
monotonically increasing to a limiting o-field #,, so that W* 7 wt by Theo- 
rem 4. Monotonicity of {F, } means that later investors have an advantage in 
information when compared on common grounds, after shifting back to the 
reference period 0, where all face the same decision problem of selecting by. 

Suppose in particular that side information Y,(w) = Y(T'o) is revealed to- 
gether with the return vector X, at the end of period ¢. Then ¥,= 
o( Xy, Yy,..., X, Yı) and F, = o( X_,, Y_,,..., X ,, Y) are monotonically 
increasing, and n !loggS > W,* almost surely where W* = W*(X,|X_,, 
Y_,, X. ,, Y. ,,...)is the maximum expected log return given the infinite past. 
The proof is identical to that of Theorem 6, except that b* — b*(Q,) and 
bë = b*(Q, ) are now defined by applying a measurable selector of log-optimum 
portfolios b*(-) to regular conditional probability distributions Q, and Q, of 
U, = u(X,) given F, and of U, = u( Xo) given F, 

The true log return log(b*, X,) will generally differ from the conditional 
expected log return w,* = E{log(b*, X,)|¥,}. If conditional expected log returns 
were always exactly realized then the capital growth over n periods would be not 
S,* but rather 
(53) S*- [I exp[E{log(d*, X)17;)]. 

O<t<n 
If S, = [Io .,.,exp[E(log(5, X,)|27,)] denotes the corresponding quantity un- 
der the competing strategy (5,), then S, < S* for all n, and hence 


(54) limsupn^!log(S/S$*) <0 as. 


This may be called an asymptotic optimality principle for the hypothetical 
growth rate S*. * If the market is stationary ergodic, then an asymptotic equipar- 
tition property for S* can be proved as well, under certain integrability condi- 
tions. Let Llog L designate the class of random variables g(w) such that 


E(|gllog|g]) < oo. 


THEOREM 7. If the market is stationary ergodic and E{log(B, X,)|.Z..) 
belongs to L log L, then 


(55) n^!lgS* > W,* a.s. andin L. 


ProoF. Breiman's (1957/1960) extension of the ergodic theorem asserts that 
n Y, .,-,8(T'o) > E(g) a.s. and in L! if g, > g as. and E{sup,|g,|} < oo. In 
particular, if (g, F,}9 .,.,, is a martingale or a nonnegative submartingale with 
limit g in L log L, then the integrability condition E{sup,|g,|} < oo is satisfied. 
Indeed, Wiener's dominated ergodic theorem [cf. Chung (1974), example 7, page 
355] asserts that 


E(sup lel) s 


e 
= | + supEE (Ig,log^le,) 
t 
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Consider the decomposition w, = r, + w*(Q,), where r, = E(log( B, X37). 
Since (7, Z,); <: <œ isa martingale with limit 7, = E(log(B, X)| Zo} in L log L, 
Breiman's extended ergodic theorem implies that 

n™ , E(log(B, XDF} > E(log(B, X,)) as.andin L'. 
O<t<n 
Since (w*(Q,)) is bounded and w*(Q,) > w*(Q,,) a.s., we also have 


n È E(log(b*,U)|Z,) > E{w*(Q@,,)} as.andin L. 


O<t<n 
By summation we may conclude that 
n logS$*-n' Y, w* > E{w*} = We as.andin L. d 


O<t<n 


7. Stationary markets. We shall prove the AEP for markets that are 
stationary but not necessarily ergodic. A stationary market is a mixture of 
stationary ergodic modes [cf. Maitra (1977)], but no finite number of observa- 
tions may suffice to exactly identify the (random) ergodic mode of {X,}. How- 
ever, log-optimum portfolios based on the f-past are better and better suited to 
the ergodic mode as t increases, and the log-optimum portfolio given the infinite 
past will be perfectly tailored because the ergodic mode is uniquely determined 
by the infinite past. It is therefore not surprising that S* will grow with the 
same asymptotic rate as if the ergodic mode were known to begin with. 

The AEP may hold even if the market is stationary in an asymptotic sense 
only. A dynamical system (Q, #,P, T) asymptotically mean stationary (a.m.s.) 
if the Cesàro averages n^'3,.,.,P(T 'F) converge for any event F € 7. 
Setting the limit equal to P(F) then defines a stationary (T-invariant) probabil- 
ity distribution P on (Q, Z), and P is perfect whenever P is. P the stationary 
mean of P, and expectations with respect to P are denoted by E{-}. The 
measures P and P have the same restriction to the invariant o-field = {F EF: 
TF = F}, so that E{-|%} = E{-|#}. See Gray and Kieffer (1980) for further 
discussion of asymptotically mean stationary measures, and Section 34.2 in 
Loéve (1978) for a proof that the following strong law of large numbers holds for 
nonnegative measurable g(w): 

(56) n^ Y, g(Tw) E(g.7) » E(g|/) as. (P) and as. (P). 
O<t<n 

A market asymptotically mean stationary if the underlying dynamical system 
(Q, #,P, T) is a.m.s. As before we assume that T is invertible, P is perfect, and 
X (w) = X(T'o) for some random vector X(w) € 47. The AEP holds for an 
asymptotically mean stationary market, unless the investor goes broke after a 
few rounds and remains trapped in a state that is infinitely worse than any 
other. The investor should not be completely ruined by the time he reaches the 
asymptotic regime, so that he can recover from transient losses. 


THEOREM 8. Suppose the market is stationary, and bi is a log-optimum 
portfolio for period 0 given the infinite past ¥,,. Then 


(57) nog St > E{log( bx, X,))4) a.s. 


Asymptotic Optimality and Asymptotic Equipartition Properties of Log-Optimum Investment 175 


894 P. H. ALGOET AND T. M. COVER 


The same conclusion holds if the market is asymptotically mean stationary and 
b* is log-optimum under the stationary mean, at least if n^ ‘log Sg > 0 a.s. for 
some sequence {k,,} such that k, 7 oo and k,/n > 0. 


PRoor. We consider the asymptotically mean stationary case. Recall that 
Sž = [lo <,< (5, X,), where b* is a log-optimum portfolio for period £ based 
on the t-past F, = o( X,,..., X, 4). Portfolio b; is log-optimum with respect to 
the true distribution P. Let b7 and b* designate portfolios for period 0 that are 
log-optimum with respect to the stationary mean P, based on the shifted 
information field F, — o(X ,,..., X ,) and the limiting o-field #, = 
o(..., X 5$, X_,). If an investor selects log-optimum portfolios b during the 
first k periods 0 < £ < k, and in later periods £ > k switches to suboptimum 
portfolios b*(T ‘w) (i.e., portfolios based on the k-past that are log-optimum with 
respect to the stationary mean P), then capital growth over n periods will be 
given by 


S*, if0<n<k, 
k) — — 
S? -isr [] (X(T), Xj), ifk<n<o. 
kes 


If the investor always selects the portfolio 6*(Tw) that is log-optimum based on 
the infinite past with respect to the stationary mean P, then capital growth is 
given by 
s= [I (oz(T'o), X,). 
O<t<n 
It is clear that E(S(?/S*) « 1 and E{S*/S‘} « 1, so that by Markov's 
inequality and the Borel-Cantelli lemma (cf. the proof of Theorem 5), 


* 
n 


Sc» 


5 FI <0 as.(P) and lim sup nog 


n 


(k) 
imsupn og - | <0 as.(P). 


The ergodic theorem for a.m.s. measures implies that 
n^ log S\ > E(log( bz (T^o), X,)|%} = E(log(bz, X)|} as. (P) 
and 
n^'loggS(? > E{log( 5x, X)|.7]) as. (P). 
Combining the previous results yields 
E log( bz (T^^), X, 1%} = E{log( bz, Xo) 4} < lim inf n~ ‘log S% a.s. (P) 
and 


limsupn !log S* < E{log( 6%, X,)|.) a.s. (P). 


The last inequality also holds a.s. (P) since both sides are invariant (the 
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left-hand side by assumption). We obtain the chain of asymptotic inequalities 
E[log( bz, X,)|.4) x liminfn"!log S* 


< limsupn log S* < E(log(bt, X,)|.7) as. (P). 


We claim that E(log( bz, X,)|.%} is increasing in k. Indeed, if k < l, then the 
event where E (log( b, X$)|-£) exceeds E(log(b?, X,)|%} must have zero prob- 
ability, since conditioning on this invariant event and taking expectations would 
otherwise contradict the inequality W,* < W,*. The expectations W,* = 
E(log(bt, X,)) increase to W,* = E(log(b*, X,)} as k > oo, so that by the 
monotone convergence theorem, 


E(log(bz, X J4} 7 E(log(b*, X,).4) as. (P). 


Convergence also holds a.s. (P) since we are dealing with invariant random 
variables, and Theorem 8 follows since E(log( b*, X,)|./) = E(log(b*, X,)|.4). 


8. Gambling as investment. We consider a market in which exactly one 
stock will yield a nonzero return, the jth stock with probability g/. The random 
outcome X is then oriented along one of the coordinate axes of #”, and the 
scaled return U is an extreme point of the simplex Y= {u € 47:(B,u) = 1). As 
observed by Kelly (1956), investing in such a market is like gambling on the 
outcome of a horse race in which horse j has win probability g/. Since one unit 
bet on horse j yields U/ = 1/5/ if horse j wins, we have 


w(b,Q) = Eg(log(b,U)) = $, q/log(b//B7) = D(qlB) — D(ql|b). 


l<j<m 


The information divergence D(q||b) = Y, < j <m@/log(q//b’) is nonnegative, and 
equal to zero iff b = q. It follows that the bet vector b = q = (q’), < jem is the 
unique log-optimum portfolio. Thus the gambler should ignore the odds 1/87 
and place an amount on each horse j proportional to its win probability q^. The 
maximum expected log of the scaled return w*(Q) is precisely the Kullback- 
Leibler divergence between the probability vector q and the reference portfolio 8 
that defines the odds, i.e., 


(58) w*(Q) = D(gqlB) - 5, a!log(q?/8?). 
Uxjxm 

Gambling on a set of m stocks out of which exactly one will yield a nonzero 
return is a most risky type of investment game. Least risky is a market whose 
return vector has a fixed direction, so that the stock(s) with highest return can 
be predicted with certainty. In general, we say that a distribution Q on Z is less 
risky than another distribution Q’, and write Q < Q’, if there exists a dilation 
T(duļu) of Y such that Q’ = TQ, i.e., Q'(-) = f4T(-|u)Q(du). [A dilation of X is 
a transition probability l'(du|u) from 4 to Y such that u is equal to the 
barycenter of I(-|u) for all y € 4.] See Alfsen (1971) for more discussion of this 
so-called dilation or Choquet order on the space 2 of probability measures on X. 
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If Q x Q’, then Q is less risky and more attractive than Q’, in terms of 
expected log return. Indeed, Q < Q’ iff fap(wQ(du) x fae(u)Q'(du) for all 
lower semicontinuous convex q: Y— (—0o, 00]. Choosing g(u)- —log(b, u) 
proves that w(b, Q) = Ec(log( b, U)) is increasing in Choquet order, and taking 
suprema proves 


THEOREM 9. The maximum expected log return w*(Q) = sup, E(log( b, U)} 
is monotonically decreasing in Choquet order on 2, i.e., 


(59) if Q x Q' in 2, then w*(Q) > w*(Q’). 

% is a Choquet simplex, so every distribution Q on Y admits a barycenter 
u(Q) € & and for every point p € Y there exists a unique probability measure 
7, on the set of extreme points of Y that admits u as barycenter. Two measures 
that are comparable in Choquet order have the same barycenter. The point mass 
ô, that is concentrated at u is minimal and the measure 7, on the extreme points 
of 4/ is maximal with respect to Choquet order on 2, among all distributions 
that admit the point u € as barycenter. Notice that mọ) = IIQ, where 
II(du|u) = 7,(du) is the maximal dilation that sweeps all mass to the extreme 
points of 4; the minimal dilation is the identity kernel A(du|u) = &,(du) that 
leaves all mass put. 

Among all distributions Q on 4 with a given barycenter u(Q) = u, the most 
concentrated measure 6, is best and the most dilated measure m, is worst in 
terms of expected log return. Indeed, let u = u(Q) and let q denote the 
probability vector proportional to u [with components q} = u//(X; ji )]. Since 
5, < Q x 7,, Theorem 9 implies that 


(60) max log p’ = w*(8,) > w*(Q), 
j 


(61) w*(Q) = w*(v,) = D(ql8) = 2:a71og q’. 
J 

The most natural choice for f is the uniform portfolio (1/m), <<, which 
allocates an equal amount to each of the m stocks and whose yield (8, X) is the 
arithmetical average retum m !(X!-- :-- +X”). Then D(q||B) = log m — 
X (q), where X(q) = ~X,q/log q is the Shannon entropy of the probability 
vector g. In general, one may interpret h*(Q) = log m — w*(Q) as the mini- 
mum loss of expected log return relative to the ideal reference level r( P) + log m. 
When the chain of inequalities 0 < w*(6,) x w*(Q) x w*(m,) x log m is rewrit- 
ten in terms of h*(Q), one obtains 


(62) 0 x min(—log gq’) = h*(8,) < h*(Q) = logm - w*(Q), 
j 


(63) h*(Q) < h*(m,) = (gq) = Za (log q”). 

J 
If one starts with a point mass ô, located at p € 4/ and repeatedly dilates mass, 
then Q traces out a linearly < -ordered chain of distributions all having 
barycenter p(Q) = u in X. Ultimately, one ends up with a measure m,, when all 
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mass is swept to the corners (extreme points) of the simplex 4. Initially (when 
Q = 5,), one can place all bets on the stock(s) j for which the minimum 
information loss (— log q7) is minimum, but in the end (when Q = 7 ,) one has to 
place proportional bets and concede an average loss equal to the Shannon 
entropy J(q) = Ly <j <mq(— log q”). 

Gambling on the next outcome of a horse race is a special type of investment 
game. Proportional betting is log-optimum, and the asymptotic optimality 
principle and asymptotic equipartition property can be formulated in a way that 
does not seem to involve a maximization, since the log-optimum strategy is 
explicitly known. The same is true for proportional betting on the next outcome 
of a random process with values in a Polish space. Indeed, let p(xj,..., x, 1) 
denote the marginal density with respect to some dominating measure of 
the first n outcomes of a random process {X,}, and let g(x,,..., x, ,) de 
note the density under some alternative distribution. The likelihood ratio 
Xo- X n) PX, ..., 1) is then a nonnegative supermartingale con- 
verging almost surely to a ando. variable Y with E{Y} < 1, and 


q( X... t3 X, y 
(64) lim sup n~ log} ——— — ——,, < 0 as. 
p(Xo,.. Xn-1) 
If, moreover, ( X,) is stationary ergodic and densities are taken with respect to a 
Markovian reference measure, then p( X,..., X, ,) will grow exponentially fast 
with constant limiting rate almost surely equal to the relative entropy rate of 
the true distribution with respect to the reference measure, 1.e., 


(65) n !log p( X,..., X, ,) > supE {log p(Xo,..., X, .,)) as. 


n 


See Barron (1985) and Orey (1985) for a proof of this generalized Shannon- 
MeMillan-Breiman theorem using Breiman's extension of the ergodic theorem 
and Algoet and Cover (1988) for a sandwich proof. 
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UNIVERSAL PORTFOLIOS 


THOMAS M. COVER! 


Departments of Statistics and Electrical Engineering, Stanford University, 
Stanford, CA 


We exhibit an algorithm for portfolio selection that asymptotically outperforms the best 


stock in the market. Let x; = (xj, Xj, .. . , Xim)! denote the performance of the stock 
market on day j, where xij is the factor by which the jth stock increases on day i. 
Let b; = (bi, bi, ... bis bj = 0, Ejbj = l, denote the proportion b;; of wealth 


invested in the jth stock on day i. Then $, = tin b!x, is the factor by which wealth 
is increased in 7 trading days. Consider as a goal the wealth S* = max, II7. , b'x; that can 
be achieved by the best constant rebalanced portfolio chosen after the stock outcomes are 
revealed. It can be shown that $7 exceeds the best stock, the Dow Jones average, and the 
value line index at time n. In fact, S7 usually exceeds these quantities by an exponential 
factor. Let x,,X2,..., be an arbitrary sequence of market vectors. It will be shown 
that the nonanticipating sequence of portfolios b, = (b IL £7! b'x; db/SII A=)! b'x; db yields 
wealth $, = Mk=; bx, such that (1/7) In(S? FS, )20, for every bounded. sequence 
Xi, X, .. ., and, under mild conditions, achieves 
^A Shum — 1)! (2r/n) 12 

where J, is an (m — 1) x (m — 1) sensitivity matrix. Thus this portfolio strategy has the 
same exponential rate of growth as the apparently unachievable S7. 


KEYWORDS: portfolio selection, robust trading strategies, performance weighting, 
rebalancing 


1. INTRODUCTION 


We consider a sequential portfolio selection procedure for investing in the stock 
market with the goal of performing as well asif we knew the empirical distribution 
of future market performance. Throughout the paper we are unwilling to make 
any statistical assumption about the behavior of the market. In particular, we 
allow for the possibility of market crashes such as those occurring in 1929 and 
1987. We seek a robust procedure with respect to the arbitrary market sequences 
that occur in the real world. 

We first investigate what a natural goal might be for the growth of wealth for 
arbitrary market sequences. For example, a natural goal might be to outperform 
the best buy-and-hold strategy, thus beating an investor who is given a look at a 
newspaper n days in the future. 

We propose a more ambitious goal. To motivate this goal let us consider all 
constant rebalanced portfolio strategies. Let x = (X1, X2, ... , Xm)" = 0 denote a 
stock market vector for one investment period, where x; is the price relative for 


! T wish to thank Hal Stern for his invaluable contributions in obtaining and analyzing the stock 
market data. I also wish to thank the referee for helpful comments. This work was partially supported 
by NSF Grant NCR 89-14538. 

Manuscript received January 1990; final revision received July 1990. 
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the ith stock — i.e., the ratio of closing to opening price for stock i. A portfolio 
b = (b, b, ...,b,)', b; = 0, Eb; = 1, is the proportion of the current wealth 
invested in each of the m stocks. Thus S = b'x = E b,x; where b and x are 
considered to be column vectors, is the factor by which wealth increases in one 
investment period using portfolio b. 

Consider an arbitrary (nonrandom) sequence of stock vectors X4, X2, .. . , X, 
e R7. Here x; is the price relative of stock j on day i. A constant rebalanced 
portfolio strategy b achieves wealth 


n 


(1.1) S,(b) = [[ hx. 


i=l 
where the initial wealth Sp(b) = 1 is normalized to 1. Let 


(1.2) Si = max S, (b) 


denote the maximum wealth achievable on the given stock sequence maximized 
over all constant rebalanced portfolios. Our goal is to achieve Sž. 

We will be able to show that there is a “universal” portfolio strategy Dis where 
b, is based only on the past xj, Xo, .. . , X4. ;, that will perform asymptotically as 
well as the best constant rebalanced portfolio based on foreknowledge of the 
sequence of price relatives. At first it may seem surprising that the portfolio b. 
should depend on the past, because the future has no relationship to the past. 
Indeed the stock sequence is arbitrary, and a malicious nature can structure future 
X,'s to take advantage of past beliefs as expressed in the portfolio b,. Nonetheless 
the resulting wealth can be made to track S7. 

The proposed universal adaptive portfolio strategy is the performance weighted 
strategy specified by 


a {1 1 1 a  _ SbS,(b) db 
ay a temen 
where 

k 
(1.4) S,(b) = [] b'x;, 


i=l 


and the integration is over the set of (m — 1)-dimensional portfolios 
(1.5) B= beR™: b= 0, Y bid . 
iz 
The wealth S, resulting from the universal portfolio is given by 
(1.6) $, = II bix,. 
Kz1 


Thus the initial universal portfolio b is uniform over the stocks, and the port- 
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folio b. at time k is the performance weighted average of all portfolios b e B. An 
approximate computation will be given in Section 8, and a generalization of this 
algorithm will be given in Section 9. 

We will show that 


(1.7) (1/n)In $, — (1/n)In S* > 0, 


for arbitrary bounded stock sequences x,, X2, . ... Thus $, and S7 have the same 
exponent to first order. A more refined analysis for two stocks shows 


(1.8) Sn Calor STs 


in a sense that will be made precise. It is difficult to summarize the behavior of 
$, relative to S,* because of the arbitrariness of the sequence and the fact that we 
cannot assume a limiting distribution. For example, even the limit of (1/n)ln S* 
cannot be assumed to exist. 

The goal of uniformly achieving S¥ (X1, x;, .. . , X), as specified in (1.7), was 
partially achieved by Cover and Gluss (1986) for discrete-valued stock markets 
by using the theory of compound sequential Bayes decision rules developed 
in Robbins (1951), Hannan and Robbins (1955), and the game-theoretic 
approachability-excludability theory of Blackwell (1956a, b). Work on natural 
investment goals can be found in Samuelson (1967) and Arrow (1974). The vast 
theory of undominated portfolios in the mean-variance plane is exemplified in 
Markowitz (1952) and Sharpe (1963), while the theory of rebalanced portfolios for 
known underlying distributions is developed in Kelly (1956), Mossin (1968), Thorp 
(1971), Markowitz (1976), Hakansson (1979), Bell and Cover (1980, 1988), Cover 
and King (1978), Cover (1984), Barron and Cover (1988), and Algoet and Cover 
(1988). A spirited defense of utility theory and the incompatibility of utility theory 
with the asymptotic growth rate approach is made in Samuelson (1967, 1969, 1979) 
and Merton and Samuelson (1974). 

We see the present work as a departure from the above model-based investment 
theories, whether they be based on utility theory or growth rate optimality. Here 
the goal S; = max,l;.., b'x; depends solely on the data and does not depend 
upon underlying statistical assumptions. Moreover, Theorem 5.1, for example, 
provides a finite sample lower bound for the performance S, of the universal 
portfolio with respect to S,*. Therefore the case for success rests almost entirely 
on the acceptance of S7 as a natural investment goal. 

The performance of the universal portfolio is exhibited in Section 8, where 
numerous examples are given of S, (b), S7, and S, for various pairs of stocks. In 
general, volatile uncorrelated stocks lead to great gains of S7 and S, over the best 
buy-and-hold strategy. However, ponderous stocks like IBM and Coca-Cola show 
only modest improvements. 
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2. ELEMENTARY PROPERTIES 


We wish to show that the wealth $, generated by the universal portfolio strategy 
5, exceeds the value line index and that S, is invariant under permutations of the 
stock sequence x), X,...,X,. We will use the notation 


Q.1) W(b, F) = l In b'x dF(x), 
Q.2) W*(F) = max W(b, F), 


and we will denote by F, the empirical distribution associated with xj, x2, . .., 
x,, where 7, places mass 1/7 at each x;. In particular, we note that 


n 
(2.3) S* = max S,(b) = max J] b'x; = e". 
b b jl 


For purposes of comparison, we pay special attention to buy-and-hold strategies 
b =e; = (0,0,...,0,1,0,...,0), where e; is the jth basis vector. Note that 


n n 
(2.4) S, (ej) = Il ejX, = Il Xkj 
k=) k=1 


is the factor by which the jth stock increases in n investment periods. Thus S, (e;) 
is the result of the buy-and-hold strategy associated with the /th stock. 
We now note some properties of the target wealth Sj: 


PROPOSITION 2.1 (Target Exceeds Best Stock). 
(2.5) Sr = max S,(e;). 
1,2 m 


Proof. Sj is a maximization of S,(b) over the simplex, while the right-hand 
side is a maximization over the vertices of the simplex. c 


PROPOSITION 2.2 (Target Exceeds Value Line). 


m l/m 
(2.6) S* > (I sace) . 
j=l 


Proof. Each S,(e;) is € Sj. Oo 
The next proposition shows that the target exceeds the DJIA. 


PROPOSITION 2.3 (Target Exceeds Arithmetic Mean). If œ; 20, La;=1, 
then 


m 


(2.7) S; = >) ojS, (ej). 


iz 
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Proof. 
(2.8) S,(e)) € Sy, j212,...,m. L1 
Thus S% exceeds the arithmetic mean, the geometric mean, and the maximum 
of the component stocks. Finally, it follows by inspection that S7 does not 


depend on the order in which x;, x2, .. . , X, occur. 


PROPOSITION 2.4 Sz (x4, X3, .. ., X4) is invariant under permutations of the 
Sequence X}, Xz, .. . , Xs. 


Now recall the proposed portfolio algorithm in (1.3) with the resulting wealth 
^ n ^ 
(2.9) Sna = Il b,X,. 


It will be useful to recharacterize $, in the following way. 


LEMMA 2.5. 
n 
(2.10) $= TI Bix, = |S, (b) av | fab 
k=1 
where 
n 
(2.11) S, (b) = [[ b'x;. 
i=] 


Thus the wealth S, resulting from the universal portfolio is the average of 
S, (b) over the simplex. 


Proof. Note from (1.3) and (1.4) that 
^ k~-1 k—l 
(2.12) bx, = [vx, I] b'x; db | | I] bx: ab 
i=l i=l 
k k-l 
(2.13) =| II bx: av || [[ »x db. 
i=l j=] 


i= 


Thus the product in (2.9) telescopes into 
n n 
(2.14) $, = [[ bxi = | IT ^x av || av =} 5,0) av Tav. o 
kzl i=} 


We observe two properties of the wealth $, achieved by the universal portfolio. 
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PROPOSITION 2.5 (Universal Portfolio Exceeds Value Line Index). 


n m 1/m 

(2.15) S, 2 P Sen) . 
j=l 
Proof. Let F, be the empirical cumulative distribution function induced by 

X|,X2,...,X,. By two applications of Jensen’s inequality and writing 
(2.16) | S(b) av [| av = E,S,(b), 
we have 
(2.17) 


S, = EpS,(b) = Ey exp(nW(b, F,)} 


= exp(n£,W(b, F,)} = exp| nb In b'x dF, w) 
= exp| nb inl ` bier) ar) =e fne, X 2, b; j | In(ejx) dF, e9| 
j=l 


= ofa (=, Xi men arw) = "E «e. o 


Thus the wealth induced by the proposed portfolio dominates the value line index 
for any stock sequence X4, X2, . . . , Xp, for all n. 

Next, we observe that although b, depends on the order of the sequence 
Xi, Xy... X, the resulting wealth $, = II bi x, does not. 


PROPOSITION 2.6. S, is invariant under permutations of the sequence 
Xi, X2... , Xs. 


Proof. Since the integrand in 


n n 
S. n f _ ty 
(2.18) §, -II bi x, =| S) av || av =|, II b'x, av [| ev 


is invariant under permutations, so is S. 0 

This observation guarantees that the crash of 1929 will have no worse conse- 
quences for wealth S, than if the bad days of that time had been sprinkled out 
among the good. 


3. THE REASON THE PORTFOLIO WORKS 


The main idea of the portfolio algorithm is quite simple. The idea is to give an 
amount db/fp db to each portfolio manager indexed by rebalancing strategy b, 
let him or her make S,(b) = e"" F» db at exponential rate W(b, F,), and 
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pool the wealth at the end. Of course, all dividing and repooling is done *on paper" 
at time k, resulting in by. Since the average of exponentials has, under suitable 
smoothness conditions, the same asymptotic exponential growth rate as the 
maximum, one achieves almost as much as the wealth S,* achieved by the best 
constant rebalanced portfolio. The trap to be avoided is to put a mass distribution 
on the market distributions F(x). It seems that this cannot be done in a satisfactory 
way. 


4. PRELIMINARIES 


We now introduce definitions and conditions that will allow characterization of 
the behavior of S /S£. Let F,,(x) denote the empirical probability mass function 
putting mass 1/n on each of the points xj, Xj, ..., X, € RF. Let the portfolio 
b* = b*(F,) achieve the maximum of S,(b) = IU_,b'x; Equivalently, since 
S,(b) = e"” F2, the portfolio b*(F,) achieves the maximum of W(b, F, ). 
Thus, 


(4.1) S* = max S,(b) = e" (P2, 
beB 


DEFINITION. We shall say all stocks are active (at time n) if (b*(F,)); > 0, 
i>1,2,...,m, for some b* achieving W*(F,,). All stocks are strictly active if 
inequality is strict for all / and all b* achieving W*(F,). 


DEFINITION. We shall say xj, X;, .. . , x, € R” are of full rank if xj, X2, ... , X, 
spans R”. 


The condition of full rank is usually true for observed stock market sequences 
if n is somewhat larger than m, but the condition that all stocks be active often 
fails when certain stocks are dominated. The next definition measures the 
curvature of S,(b) about its maximum and accounts for the second-order 
behavior of §, with respect to S. 


DEFINITION. The sensitivity matrix function J(b) of a market with respect to 
distribution F(x), x e R?, is the (m — 1) x (m — 1) matrix defined by 


(xj 7 Xm) (x; — Xm) 
(b'x)? 
The sensitivity matrix J* is J(b*), where b* = b*(F) maximizes W(b, F). 
We note that 


(4.2) Jj(b) =| dF(x, 1<i,j<m-1. 


| O'W((bf, bf... bes 1 — Er bf), P) 


(4.3) J= ab; ðb, 
iO Uj 


LEMMA 4.1. J* is nonnegative definite. It is positive definite if all stocks are 
strictly active. 
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5. ANALYSIS FOR TWO ASSETS 


We now wish to show that §,/S* ~ J2x/nJ,, where J, is the curvature or 
volatility index. We show in detail that J27/nJ, is an asymptotic lower bound on 
§,/S*, and we develop explicit lower bounds on §,,/S* for all n and any market 
sequence X,,...,X,. We develop an upper bound by invoking strong conditions 
on the market sequence. Section 6 outlines the proof for m assets. 

We investigate the behavior of $, for m — 2 stocks. Consider the arbitrary 
stock vector sequence 


(5.1) x; = (Xp X9)eRA,  i-1,2,.... 


We now proceed to recast this two-variable problem in terms of a single variable. 
Since the portfolio choice requires the specification of one parameter, we write 


(5.2) b = (b, 1 — b), 0Ozbz<l, 


and rewrite S, (b) as 


n 


(5.3) S,(b) = [] (bx + (1 — 5)x2,  0sbzt. 
iz] 
Let 
(5.4) S; = max S,(b), 
O<sb<] 


and let b denote the value of b achieving this maximum. Section 8 contains 
examples. 
The universal portfolio 


(5.5) b, = (by, 1 — by) 
is defined by 


^ 1 1 
(5.6) b, =| bS,(b) av [| S,(b) db 
0 0 
and achieves wealth 
n 
(5.7) $, = [[ bixa + (1 — boxi). 
i=] 
Let 
1 
(5.8) W,(b) =~ In $, (b) 
1 n 
(5.9) = n In (bx; + (1 — b)x;2) 
i=l 


(5.10) = | In(bx, + (1 — b)xj) dF,(x), 
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where F,(x) is the empirical cdf of (xi]/-;. By Lemma 4.1, the wealth $, 
achieved by the universal portfolio b, is given by 


^ 1 

(5.11) $, =| e^t av. 
In order to characterize the behavior of S, we define the following functions 
of the sequence x,,X2,...,x,. Define the relative range r, of the sequence 
Xb X5, ..., X, to be 

maxíx;; 
(5.12) n = 20( matt J 

miní(x;) 


where the minimum and maximum are taken over i = 1,2,...,; /= 1,2. Let 


n 


l (xi — Xa)? 
5.13 J,=— > 7 
(5.13) " n (bax, + (1 — bz)xo)?' 


i=l 
where bz? maximizes W,(b). Let 


(5.14) Wr = max W,(b) = W,,(b¥). 

zbzl 
Thus 7, corresponds to the relative range of the price relatives and J, denotes the 
curvature of In S,(b) at the maximum. 


THEOREM 5.1. Let Xj, X», ..., be an arbitrary sequence of stock vectors in 
RÀ, , and let a, = min{b*, 1 — b*, 3J,/71). Then for any 0 < & < 1, and for any 


S, 2r _ 2 ex —&^(1 + £)a,J4n 
S*#~nJ,(1+6) el oaJ, P 2 ' 


REMARKS. This theorem says roughly that A > V2r/nJ„,. So the 
universal wealth is within a factor of C/4/n of the (presumably) exponentially large 
S7. It will turn out that every additional stock in the universal portfolio costs an 
additional factor of 1/Jn. But these factors become negligible to first order in 
exponent. It is important to mention that this theorem is a bound for each n. The 
bound holds for any stock sequence with bound a, and volatility J,. 


Proof. We wish to bound $, — f e". qb. We expand W,(b) about the 
maximizing portfolio bř, noting that W,,(b) has different local properties for each 
n and indeed a different maximizing b*. We have 


pt 
(5.6) Wy(b) = W, (b3) + (b = b) Wz Co) + CP” wb) 


b — b»? 
"n ) 


s WO, 
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where b, lies between b and b*. 
We now examine the terms. 
(i) The first term is 


1 
(5.17) W,(bj) = W*(F,) = z 08 Sn 
where S7 is the target wealth at time n. 


(ii) The second term is 


Xx) — X2 


(5.18) W (bx) = bx 


dF,(x) 


= 0, if 0 < b% <1, 


by the optimality of b*. 
(iii) The third term is 


(x; — x? 


(b¥x)? dF,(X) = — Jt. 
n 


(5.19) Wy (bn) -| 
Thus, W, (bž) = 0, with strict inequality if 0 < b; < 1 and xj # xj for some 
time 7. This term provides the constant in the second-order behavior of S,. 

(iv) The fourth term is 


(x, — x2)? 


(5.20) wir(d,) = 2f tins - Bx 


dF, (x). 


We have the bound 


met 2(X| — x3) 
5.21 W;(b,)| = - - F 
(5.21) | Wa" (Br) | f (Guo do Bay an) 
(5.22) <z, forall b,e[0, 1]. 
Thus 


_ K*]3 23 
(5.23) S,(b) = exp( ni = S (b — b*)?J, — "emm 


for 0 <b < 1, where 


(x, — x) 


(bx, + (1 — b¥)x2)? dF, (x). 


(5.24) J, = | 


We now make the change of variable 
(5.25) u = |n (b — b), 
where the new range of integration is 


(5.26) — nbi sus n (1 — bi). 
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Then, noting e"? = S5, we have 
^ l 
(5.27) $,- [, Sn(b) db 
Yn (1—ba) 1 1 
.28 > — >J, — —= |u? |r] du. 
(5.28) E h zi "LL apy islet) a 


We wish to approximate this by the normal integral. To do so let 0 < & < 1 and 
note that 


1 ul?r3 1 
(5.29) -3 u^, — nos EI uL, (1 + 8) 
for 
(5.30) us 3ejnJ,/7. 
Let ® denote the cdf of the standard normal 

1 

(5.31) $(x) = — [. e^"? du, 
and let 
(5.32) a, = min(b*, 1 — b, 3J,/ri). 


Thus a, is a measure of the degree to which S,(b) has a maximum of reasonable 
curvature within the unit interval. Then from (5.28), for any 0 « & < 1, 


Yn(1-bi l l 
(5.33) in, = | À Pep[ -za — — Jule JE: 


(SE Ying 2°" 6 Ín 
5.34 > [77 exp -lpgnasla 
( ' ) = Xp 2" n è H 
œ ] 2 — vn ane 1 2 
(5.35) =| exp} —5 WJ, (1 +e) du - 2| exp} - 5! J, (1 +8)} du 


2T 8x 
(5.36) = Ta + — Feu * Cei FS). 


We use the inequality 


5.37 l e MEA b(—-x)< Le 2] 
-~|{l-s - xp|- =>], 
( ) Jax? *P 2 x? 2zx? P 2 


for x > 0, to obtain the bound 


1 e anJ, (1 + £) 
(5.38) &(—sa, JNJ (1 + 6)) < err or | . 
NUT A28 a2 nJ,(1 + 8) 2 


Hence, 
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{ns 2T 2 & a?nJ (1 + 8) 
(5.39) "= — exp, — ————————— 
S; VJ, (1 + 6) £a, J, (1 + €) Jn 2 
for any O < e x 1, for all n, and all xj, X2, ..., which proves the theorem. O 


The explicit bounds in Theorem 5.1 may be useful in practice, but a cleaner 
summary of performance is given in the following weaker theorem. 


THEOREM 5.2. Let x;, X;,... be a sequence of stock vectors in RÀ. and 
supposed xb; < 1 — ô, T, X v < oo, and J, = J > 0, for a subsequence of times 
nynj,.... Then 

SS; 
(5.40) lim inf — 


— ml 
n>œ [2x/nJ, = 


along this subsequence. 


Proof. The conditions of the theorem, together with Theorem 5.1 imply 
§,/S* ] 2lJ, 


MM — 
[2s/nJ, Dt & e, |2znJ min(ó, 3J/7?] 


where 7 is the bound ratio, and where we are free to choose £, € [0, 1] at each n. 
Noting that J, « 7? < œ and letting &, = n^ !/^ proves the theorem. a 

We have just shown that S/S is as good as J2m/nJ,. We now show that it is 
no better. For this we consider a subsequence of times such that W,(b) is 
approximately equal to some function W(b), and we argue that upper bounds on 
f) e^"? db suffice to limit the performance of the wealth §,. Toward that end, 
let us consider functions W such that 


(5.41) 


(i) W(b) is strictly concave on [0, 1]. 
(5.42) (ii) W "(b) is bounded on [0, 1]. 
(ii) W(b) achieves its maximum at b*e (0, 1). 


We plan to pick out a subsequence of times such that W,(b) = (1/n): 
E;.,ln b'x; approaches W(b). We can expect such limit points from Arzelà's 
theorem on the compactness of equicontinuous functions on compact sets. 
Let b? maximize W,(b). Let {n;} be a subsequence of times such that for 


N=Ny,M,..., 
(i) W,(b) < Wb), Oxzbczl. 
(5.43) 
(i) Wb) > W"(b*). 


Recall the notation J, = —W;(b;). The following theorem establishes the 
tightness of the lower bound in Theorem 5.2. 


THEOREM 5.3. For any X,,X2,... c RÀ and for any subsequence of times 
ni, m, ... such that W,(b) satisfies conditions (5.43) for some W(b) satisfying 
(5.42), we have 
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(5.44) SS? ~ [2x/nJ, 


along the subsequence. 


Proof. The lower bound follows from Theorem 5.2. From Laplace’s method 
of integration we have 


l . 2a 
5.45 me) dy ~ ere) | — 
( ) fe e n|g"(u*)| 


if g is three times differentiable with bounded third derivative, strictly concave, 
and the u* maximizing g(-) is in the open interval (0, 1). Consequently, 


^ l 1 
646 $, =[ e" qb < [ e" db 


0 0 
* 2T 2T 2T 
~ nW(b") ee S* o o S* 2 
e aawo awon ^ Na 


and the theorem is proved. L1 


6. MAIN THEOREM 


Here we prove the result for m assets under the assumption that all stocks are 
active and of full rank and b; (F,) > b*eint(B). We discuss removing the 
conditions in Section 9. For example, lack of full rank reduces the dimension from 
m to m’, as does the existence of inactive stocks. Finally, b2(F,) need not have 
a limit, in which case we can describe the behavior of $, for convergent 
subsequences of b*(F,,), as well as develop explicit bounds for all n. 

From Lemma 2.1, we have 


$,- [, S, (b) al, db, 
where 


k 
S(b) = lI b'x;, 


i=] 


Dei = | bS,(b) db Ji S, (b) db, 
^ n ^^ 
Sn = Il b, Xg. 
k=1 
A summary of the performance of b, is given by the following theorem. 


THEOREM 6.1. Suppose x\,%,...€[a,c]",0O<asc< om, and at a 
subsequence of times nj, n3, ... , W,(b) 7W(b) for beB, JX > J*, b; > b*, 
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where W (b) is strictly concave, the third partial derivatives of W are bounded on 
B, and W(b) achieves its maximum at b* in the interior of B. Then 


$, - | ert (m — 1)! 


S EN |J*|12 


n 
in the sense that the ratio of the right- and left-hand sides converges to 1 along 
the subsequence. 


(6.1) 


Proof. (Outline) We define 


(6.2) C= [t 0s... 04:62 0, ers 1] 
and 
n 
(6.3) S,(c) = II b'(c)x;, cec, 
i-i 
where 
m-1 
(6.4) b(c) — [esto cse 1- M a). 
i=l 
Note that 
(6.5) Vol(C) ={ à 
. = e = ——,, . 
° f. (m — 1)! 


We shall prove only the lower bound associated with (6.1). From Lemma 2.1, the 
universal portfolio algorithm yields 


(6.6) $,- | S, (b) av [| db = ES, (b), 


where b is uniformly distributed over the simplex B. Since a uniform distribution 
over B induces a uniform distribution over C, we have 


(6.7) $,- (m-1) [5.0 dc. 
We now expand S, (c) in a Taylor series about €* = (bf, ..., bj), where b* 


maximizes W(b, F,). We drop the dependence of b* on n for notational 
convenience. By assumption, b? > 0, for all ij. We have 


(6.8) S,(c) = e", 
where 
1 n 
(6.9) W,(e) =- 9) In b'x; = | In b'x dF, (x) 


i=] 


$ Ep, Inb'X 
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and 


(6.10) b- (s 1 — DD . 
Expanding W,(c), we have 


(6.11) W,(c) = W,(c*) + (c — c*)' VW, (c*) -5e — e*)'J; (c — c*) 


1 
ur 2, (ci — c) (cj — Cf) (cy — c Er, 
i Jk 


S? (č) 
where é = Ac* + (1 — A)e, for some 0 < ^ < 1, where ^ may depend on c, and 
m-1 m-l 
(6.12) S(c) = 5, cjX; + ( -5 cı) X,. 
i=] i=l 
Here 
n 
(6.13) S,(e) = [[ b'(e)x;, 
izi 
m-l! 
bo = [aio a), 
i=l 
m-l! m-i 
W(e) = | in >) oxi € -5 2 dF,(x), 
izl 1 
OW, Xj — Xm 
= dF, ; 
ac; | se) hn) 
Ww, (Xi — Xm) (Xj — Xm) 
-—-|————5—————— dF, ; 
ðc; ðc; | S?(c) a(x) 
9?W,(c*) 
eo [——h5 s 
(6.14) Ji = | óc, àc; | 


The condition that all stocks be strictly active implies by Lemma 4.1 that 
|J7 | > 0, where |:| denotes determinant. We treat the terms one by one. 


(i) By definition of b*, 
(6.15) W(c*) = W(b*, F,) = W*(F,). 


(ii) The second term is 0 because b* is in the interior of B, W,(b) is differenti- 
able, and b* maximizes W,. Thus, 
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àW,(c*)  . Xi;— Xm 
(6.16) à; | ^ b"X 


= 0, ij-1,2,...,m — l. 


(iii) The third term is a positive definite quadratic form, where 
Jj = J*(b*(F,)). 
(iv) For the fourth term, 


(6.17) 
A i (ci = Ye = «fX, = of Ep AMAL t A 

we examine 

6.18 En ME 

We note 

(6.19) S3(é) = (bx) = » z > qj, 


since X; = a for all į. Also since X; — Xm x 2c, we have 


3 3 
7 (X; — Xm) (X; — Xm) (X, — Xm) _ 8c 
(6.20) TE ao 


We now make the change of variable u = Vn (c — c*), where we note the new 
range of integration u € U = Jn(C — c*). Thus 


n n 
(6.2) | $,()- epfaw te) — 7 (eTe e e") t1 X 


1 1 
— LEN 
= exo( ns REN ) J . 


where 
n5 X; — Xm) (X; — Xm) (X, — X, 
(6.22) 5 = 5 sui Er, ( i m)( L m) (Xx 2). 
3 AKI S (č) 
Note that 
m-1 3 8c 
^ 0 [Gus 
i= 
Observing 


1/2 
(6.24) lvl = (X n?) Jm 
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yields 
1 3/2 3g 3 
(6.25) S,(c) = exo[ nz - LuJ?u - ” luli 8c 
2 3 
3Ín a 


The lower bound on $, becomes 


(6.26) $, = (m — D , Sa (€) de 


32 3 m-1 
> —] ts* 1 m llull 8c l 
(m ) n fevel- u'Jfu 3n PE Ja du, 
which can now be bounded using the techniques in the two-stock proof. The upper 
bound follows from Laplace’s method of integration, as in Theorem 5.3, from 
which the theorem follows. L] 


7. STOCHASTIC MARKETS 


Another way to see the naturalness of the goal S* = e" (P (Fr). Fs) is to consider 


random investment opportunities. Let X,, X5,... be independent identically 
distributed (i.i.d.) random vectors drawn according to F(x), x e R", where F is 
some known distribution function. Let S,(b) — IIj.,b'X; denote the wealth at 
time n resulting from an initial wealth Sy = 1 and a reinvestment of assets 
according to portfolio b at each investment opportunity. Then 


II b'X; = exp| Y In vx, 
i=] j 


i=] 


(7.1) S,(b) 


exp{n(Elnb'X + 0,(1))} = exp(n(W(b, F) + 0,(1))} 


by the strong law of large numbers, where the random variable 0,(1) > 0, a.e. 
We observe from the above that, to first order in the exponent, the growth rate 
of wealth S, (b) is determined by the expected log wealth 


(7.2) W(b, F) = | In b'x dF(x) 


for portfolio b and stock distribution F(x). 

It follows for X,,X;,,...,ii.d. ~ F that b*(F) achieves an exponential 
growth rate of wealth with exponent W*(F). Moreover Breiman (1961) establishes 
for i.i.d. stock vectors for any nonanticipating time-varying portfolio strategy 
with associated wealth sequence S, that 


— 1 
(7.3) lim " In S, x W*{F}, a.e. 


Finally, it follows from Breiman (1961), Finkelstein and Whitley (1981), Barron 
and Cover (1988), and Algoet and Cover (1988), in increasing levels of generality 
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on the stochastic process, that lim,.,4, n7! In S,/S* x 0, a.e., for every sequential 
portfolio. Thus b*(F) is asymptotically optimal in this sense, and W*(F) is the 
highest possible exponent for the growth rate of wealth. Thus Sž is asympto- 
tically optimal. 

We omit the proof of the following. 


THEOREM 7.1. Let X; be i.i.d. ~ F(x). Let b*(F) be unique and lie in the 
interior of B. Then the universal portfolio b, yields a wealth sequence S, 


satisfying 
1 ^ 
(7.4) zin $, > WF), ae. 


Thus, in the special case where the stocks are independent and identically 
distributed according to some unknown distribution F, the universal portfolio 
essentially learns F in the sense that the associated growth rate of wealth is equal 
to that achievable when F is known. 


8. EXAMPLES 


We now test the portfolio algorithm on real data. Consider, for example, Iroquois 
Brands Ltd. and Kin Ark Corp., two stocks chosen for their volatility listed on 
the New York Stock Exchange. During the 22-year period ending in 1985, 
Iroquois Brands Ltd. increased in price (adjusted in the usual manner for 
dividends) by a factor of 8.9151, while Kin Ark increased in price by a factor of 
4.1276, as shown in Figure 8.1. 

Prior knowledge (in 1963) of this information would have enabled an investor 
to buy and hold the best stock (Iroquois) and earn a 791% profit. However, a 
closer look at the time series reveals some cause for regret. Table 8.1 lists the 
performance of the constant rebalanced portfolios b — (b, 1 — b). The graph of 
S,(b) is given in Figure 8.2. For example, reinvesting current wealth in the 
proportions b — (0.8, 0.2) at the start of each trading day would have resulted in 
an increase by a factor of 37.5. In fact, the best rebalanced portfolio for this 
22-year period is b* = (0.55, 0.45), yielding a factor S* = 73.619. Here S; is the 
target wealth (with respect to the coarse quantization. of B= [0,1] we have 
chosen). The universal portfolio b. achieves a factor of $, — 38.6727. While $, is 
short of the target, as it must be, $, dominates the 8.9 and 4.1 factors of the 
constituent stocks. The daily performance of both stocks, the universal portfolio, 
and the target wealth are exhibited in Figure 8.3. The portfolio choice b, as a 
function of time & is given in Figure 8.4. 

To be explicit in the above analysis, we have quantized all integrals, resulting 
in the replacements of 


(8.1) S; = max S,(b) by S= max S,(i/20) 
b i=0, t,...,20 


and 
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TABLE 8.1 
Iroquois Brands Ltd versus Kin Ark Corp 


b S, (b) 
1.00 8.9151 
0.95 13.7712 
0.90 20.2276 
0.85 28.2560 
0.80 37.5429 
0.75 47.4513 
0.70 57.0581 
0.65 65.2793 
0.60 71.0652 
0.55 73.6190 
0.50 72.5766 
0.45 68.0915 
0.40 60.7981 
0.35 51.6645 
0.30 41.7831 
0.25 32.1593 
0.20 23.5559 
0.15 16.4196 
0.10 10.8910 
0.05 6.8737 
0.00 4.1276 


Target wealth: S* — 73.619 

Best rebalanced portfolio: bf = 0.55 
Best constituent stock: 8.915 
Universal wealth: S, = 38.6727 


(8.2) 


^ 1 1 ^ j 
De = [, bS, (b) av ||; S,(b) db by by, = » — Sx 
i=0 


The resulting wealth factor 


n 
(8.3) $, = [[ &x. 


k=l 


is calculated using 


(8.4) b= D 55 5) [> 2 E 


Telescoping still takes place under this quantization and it can be verified 


that $, in (8.3) can be expressed in the equivalent form 


^ 1 2 i 
(8.5) S, = n» S) 


Thus $, is the arithmetic average of the wealths associated with the constant 


rebalanced portfolios. 
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Finally, note the calculation of the portfolio Prat = (Brats 1 — bad in this 
example. Merely compute the inner product of the b and S,(b) columns in Table 
8.1 and divide by the sum of the S,(b) column to obtain b. ;. Note in particular 
that the universal portfolio b,,, is not equal to the log optimal portfolio 
b*(F,) = (0.55, 0.45) with respect to the empirical distribution of the past. 

A similar analysis can be performed on Commercial Metals and Kin Ark over 
the same period. Here Commercial Metals increased by the factor 52.0203 and Kin 
Ark by the factor 4.1276 (Figure 8.5). It seems that an investor would not want 
any part of Kin Ark with an alternative like Commercial Metals available. Not so. 
The optimal constant rebalanced portfolio is b* — (0.65, 0.35), and the universal 
portfolio achieves $, = 78.4742, outperforming each stock. See Table 8.2. 

Next we put Commercial Metals (52.0203) up against Mei Corp (22.9160). Here 
Sž = 102.95 and $, — 72.6289, as shown in Figure 8.6 and Table 8.3. However, 
IBM and Coca-Cola show a lockstep performance, and, indeed, $, barely out- 
performs them, as shown in Figure 8.7. 

A final example crudely models buying on 50% margin. Suppose we have four 
investment choices each day: Commercial Metals, Kin Ark, and these same two 
stocks on 50% margin. Margin loans are settled daily at a6% annual interest rate. 
The stock vector on the ith day is 


(8.6) xX; = (x,2xj—l—r,y42yj—1-—r), 


TABLE 8.2 


Commercial Metals versus Kin Ark 


b $,(b) 
1.00 52.0203 
0.95 68.2890 
0.90 85.9255 
0.85 103.6415 
0.80 119.8472 
0.75 132.8752 
0.70 141.2588 
0.65 144.0035 
0.60 140.7803 
0.55 131.9910 
0.50 118.6854 
0.45 102.3564 
0.40 84.6655 
0.35 67.1703 
0.30 51.1127 
0.25 37.3042 
0.20 26.1131 
0.15 17.5315 
0.10 11.2883 
0.05 6.9704 
0.00 4.1276 


Target wealth: S* = 144.0035 

Best rebalanced portfolio: b} = 0.65 
Best constituent stock: 52.0203 
Universal wealth: § = 78.4742 
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TABLE 8.3 
Commercial Metals versus Mei Corp 


b S, (b) 
1.00 52.0203 
0.95 61.0165 
0.90 70.0625 
0.85 78.7602 
0.80 86.6815 
0.75 93.4026 
0.70 98.5414 
0.65 101.7927 
0.60 102.9589 
0.55 101.9691 
0.50 98.8869 
0.45 93.9033 
0.40 87.3172 
0.35 79.5057 
0.30 70.8890 
0.25 61.8932 
0.20 52.9162 
0.15 44.3012 
0.10 36.3178 
0.05 29.1538 
0.00 22.9160 


Target wealth: S* — 102.9589 

Best rebalanced portfolio: b = 0.60 
Best constituent stock: 52.0203 
Universal wealth: $, = 72.6289 


(8.7) r = 0.000233, 


where x; and y; are the respective price relatives for Commercial Metals and 
Kin Ark on day į. Plunging on margin into Commercial Metals yields a factor 
19.73, plunging into Kin Ark a factor of 0.0 (to four significant digits). Good as 
these stocks are, they cannot survive the down factors induced by the leverage. 
But a random sample of the simplex of portfolios listed in Table 8.4 reveals 
$, = 98.4240, while the optimal rebalanced portfolio b* = (0.2, 0.5, 0.1, 0.2) 
results in a factor $* — 262.4021. Clearly 98.4 beats the factor of 78 achieved 
when margin is unavailable. Both factors exceed the performance 52.02 of the best 
stock. . 

We observe that 5, — 98.4 exceeds the factor $, — 78.47 obtained for these 
stocks when margin is unavailable. This is borne out by the fact that b* is positive 
in each component, calling for a small amount of leverage in the a posteriori 
optimal rebalanced portfolio. 


9. THE GENERAL UNIVERSAL PORTFOLIO 


If the best rebalanced portfolio b, lies in the interior of a boundary k-face, then 
only & stocks are active in the best rebalanced portfolio. Thus we expect to obtain 
the previous bounds on S,/S* with m replaced by k. This is achieved if we start 
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TABLE 8.4 
Two Stocks with Margin 


Commercial metals 52.0203 = II; x; 

Commercial metals on margin 19.7335 = M; x; 1-7), 

Kin Ark 4.1276 = F yj 

Kin Ark on margin 0.0000 = N; (2y;j-1—7) 

r = 0.000233/day = 6%/year 

$t = 262.4021 bt = (0.2, 0.5, 0.1, 0.2) 

Best constituent stock 52.0203 

Wealth achieved by universal portfolio $, = 98.4240 

b S, (b) 

(0.8, 0.2, 0.0, 0.0) 57.0535 
(0.8, 0.1, 0.0, 90.1) 148.9951 
(0.6, 0.1, 0.1, 02) 207.1143 
(0.6, 0.0, 0.4, 0.0) 140.7803 
(0.5, 0.0, 0.2, 0.3) 60.8358 
(0.4, 0.0, 0.4, 0.2) 47.6074 
(0.3, 0.5, 0.1, 0.1) 212.8928 
(0.3, 0.4, 0.1, 02) 261.0452 
(0.3, 0.2, 0.2, 0.3) 89.0330 
(0.3, 0.1, 0.2, 0.4) 19.4840 
(0.3, 0.0, 0.1, 0.6) 0.7700 
(0.2, 0.7, 0.0, 0.1) 121.0142 
(0.2, 0.2, 0.3, 0.3) 45.2562 
(0.1, 0.8, 0.1, 0.0) 67.5882 
(0.1, 0.5, 0.2, 0.2) 233.6328 
(0.1, 0.4, 0.2, 0.3) 112.6695 
(0.4, 0.3, 0.1, 0.5) 12.7702 
(0.1, 0.2, 0.4, 0.3) 19.4840 
(0.1, 0.1, 0.2, 0.6) 0.2354 
(0.00 0.5, 0.4, 0.1) 225.2524 
(0.00 0.4, 0.2, 0.4) 31.8076 
(0.2, 0.5, 0.1, 0.2) 262.4021 


with some mass on each face. To accomplish this, we let u, be the measure 
corresponding to the uniform distribution on B(S) = (be R":Y b; = 1, b; = 0, 
ieS^J, where SS (1,2,..., m]. Thus y, puts unit mass on the | S|-dimensional 
face of the portfolio simplex. 

Let » be the mixture of these measures given by 


I 
(9.1) BT omy labs 


where the sum is over all S + Ø, Sc (1,2,...,m)]. The generalized 
universal portfolio now becomes 
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(9.2) [ES l bS, (b) p (db) Ji S,(b) u (db) 
with 

(9.3) S,(b) = II b/x;, So(b) = 1. 


i=l 


To state the results we define JP’ (F,) to be the k x k sensitivity matrix with 
respect to the active stocks S, |S| = k, where S is the smallest set of stocks such 
that all optimal rebalanced portfolios b*(F) are in the interior of B(S). Then 


$, (k-1) 2r 
(9.4) Sp Nd; e VETT 
n 


will be the asymptotic behavior of S /Sg. 


10. CONCLUDING REMARKS 


We now try to be sensible and ask how the universal portfolio works in practice. 
Of course, the examples are encouraging, as the universal portfolio outperforms 
the constituent stocks. However, we have ignored trading costs. In practice we 
would not trade daily, but only when the current empirical holdings were far 
enough from the recommended b,. (À rule of thumb might be to trade only if the 
increase in W is greater than the logarithm of the normalized transaction costs.) 

We are really interested in whether $, will *take off," leaving the stocks behind. 
We first discuss the target wealth S*. The best rebalanced portfolio b*(F,) based 
on prior knowledge of the stock sequence x;, X2, . .. , X, yields wealth S = en. 
Now S7 grows exponentially fast to infinity under mild conditions. For example, 
if one of the constituent stocks is a risk-free asset with interest rate r > 0, then 
W* = In(1 + r) > 0, for all n, and S* > (1 + r)" > œ. Since the universal port- 
folio yields 


(10.1) $,- [n [ws - o(R5)). 


it follows that S, will tend to infinity, and $, will have the same exponent as $5 
differing only in terms of order (In n)/n. 

What state of affairs do we expect in the real world? Certainly we expect the 
stock sequence to be of full dimension m for n slightly greater than m. However, 
we do not expect all stocks to be active. But we do expect that two or more stocks 
will be active. This 1s important because it guarantees that the target growth rate 
WF will be strictly greater than the growth rate of the constituent stocks. Conse- 
quently, we believe that the universal portfolio will achieve 


S /S,(e;) > œ, i=1,2,...,m, 


exponentially fast, where S,(ej) is the wealth relative of the ith stock at time n. 
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However, n may need to be quite large before this exponential dominance 
manifests itself. In particular, we need n large enough that the difference in 
exponents between Sf and the stocks overcomes the O((In n)/n) penalties 
incurred by universality. We conclude that $, will leave the constituent stocks 
exponentially behind if there are at least two strictly active stocks in the best 
rebalanced portfolio. 
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THE COST OF ACHIEVING THE BEST PORTFOLIO 
IN HINDSIGHT 


ERIK ORDENTLICH anp THOMAS M. COVER 


For a market with m assets consider the minimum, over all possible sequences of asset prices 
through time n, of the ratio of the final wealth of a nonanticipating investment strategy to the 
wealth obtained by the best constant rebalanced portfolio computed in hindsight for that price 
sequence. We show that the maximum value of this ratio over all nonanticipating investment 
strategies is V, = [X 2 "mp MC T n, 9))]7*, where H(-) is the Shannon entropy, 
and we specify a strategy achieving it. The optimal ratio V,, is shown to decrease only polynomially 
in n, indicating that the rate of return of the optimal strategy converges uniformly to that of the 
best constant rebalanced portfolio determined with full hindsight. We also relate this result to the 
pricing of a new derivative security which might be called the hindsight allocation option. 


1. Introduction. Hindsight is not available when it is most useful. This is true in 
investing where hindsight into market performance makes obvious how one should 
have invested all along. In this paper we investigate the extent to which a nonantici- 
pating investment strategy can achieve the performance of the best strategy determined 
in hindsight. 

Obviously, with hindsight, the best investment strategy is to shift one's wealtb daily 
into the asset with the largest percentage increase in price. Unfortunately, it is hopeless 
to match the performance of this strategy in any meaningful way, and therefore we must 
restrict the class of investment strategies over which the hindsight optimization is per- 
formed. Here we focus on the class of investment strategies called the constant rebalanced 
portfolios. A constant rebalanced portfolio rebalances the allocation of wealth among the 
available assets to the same proportions each day. Using all wealth to buy and hold a 
single asset is a special case. Therefore the best constant rebalanced portfolio, at the very 
least, outperforms the best asset. 

In practice, one would expect the wealth achieved by the best constant rebalanced 
portfolio computed in hindsight to grow exponentially with a rate determined by asset 
price drift and volatility. Even if the prices of individual assets are going nowhere in the 
long run, short-term fluctuations in conjunction with constant rebalancing may lead to 
substantial profits. Furthermore, the best constant rebalanced portfolio will in all likelihood 
exponentially outperform any fixed constant rebalanced portfolio which includes buying 
and holding the best asset in hindsight. 

The intuition that the best constant rebalanced portfolio is a good performance target 
is motivated by the well-known fact that if market returns are independent and identically 
distributed from one day io the zext, the expected utility, for a wide range of utility 
functions including the log utility, is maximized by a constant rebalanced portfolio strat- 
egy. Additionally, ‘‘turnpike’’ theory (see Huberman and Ross 1983, Cox and Huang 
1992, and references therein) finds an even broader class of utility functions for which, 
by virtue of their behavior at large weaiths, constant rebalancing becomes optimal as the 
investment horizon tends to infinity. In all these settings, the optimal constant rebalanced 
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portfolio depends on the underlying distribution, which is unknown in practice. Targeting 
the best constant rebalanced portfolio computed in hindsight for the actual market se- 
quence is one way of dealing with this lack of information. 

The question is: To what extent can a nonanticipating investment strategy perform as 
well as the best constant rebalanced portfolio determined in hindsight? We address this 
question from a distribution-free , worst-sequence perspective with no restrictions on asset 
price behavior. Asset prices can increase or decrease arbitrarily, even drop to zero. We 
assume no underlying randomness or probability distribution on asset price changes. 

The analysis is best expressed in terms of a contest between an investor and nature. 
After the investor has selected a nonanticipating investment strategy, nature, with full 
knowledge of the investor's strategy (and its dependence on the past), selects that se- 
quence of asset price changes which minimizes the ratio of the wealth achieved by the 
investor to the wealth achieved by the best constant rebalanced portfolio computed in 
hindsight for the selected sequence. The investor selects an investment strategy that max- 
imizes the minimum ratio. Ín the main part of the paper we determine the optimum 
investment strategy and compute the max-min value of the ratio of wealths. 

It may seem that such an analysis is overly pessimistic and risk averse since in reality 
there is no deliberate force trying to minimize investment returns. What is striking, how- 
ever, is that if investment performance is measured in terms of rate of return or exponential 
growth rate per investment period, even this pessimistic point of view yields a favorable 
result. More specifically, the main result of this paper is the identification of an investment 
algorithm that achieves wealth $, at time n that satisfies 


A n 
(1) S, > s: / Y ( pom ees Hun) — SEY,, 
My eee Mn 


Zaun 


for every market sequence, where S7 is the wealth achieved by the best constant rebal- 
anced portfolio in hindsight, and H(p,,..., Pa) = ~ È p; log p; is the Shannon entropy 
function. l 

Since it can be shown that V, ~ y2/(z) (for m = 2 assets), this factor, the price of 
universality, will not affect the exponential growth rate of wealth of $, relative to Sx, 
i.e, lim inf( 1/n) log( $,/55) = 0. In other words, the rate of return achieved by the 
optimal strategy converges over time to that of the best constant rebalanced portfolio 
computed in hindsight, uniformly for every sequence of asset price changes. The bound 
(1) is the best possible; there are sequences of price changes that hold $,/5; to this bound 
for any nonanticipating investment strategy. 

The problem of achieving the best portfolio in hindsight leads naturally to the consid- 
eration of a new derivative security which might be called the hindsight allocation option. 
The hindsight allocation option has a payoff at time n equal to S7, the wealth earned by 
investing one dollar according to the best constant rebalanced portfolio (the best constant 
allocation of wealth) computed in hindsight for the observed stock and bond performance. 
This option might, for example, interest investors who are uncertain about how to allocate 
their wealth between stocks and bonds. By purchasing a hindsight allocation option, an 
investor achieves the performance of the best constant allocation of wealth determined 
with full knowledge of the actual market performance. 

In $4 we argue that the max-min ratio computed above yields a tight upper bound on 
the price of this option. Specifically, Equation (1) suggests that $, is an arbitrage oppor- 
tunity if the option price is more than !/V,. We compare this bound to the no-arbitrage 
option price for two well-known models of market behavior, the discrete time binomial 
lattice model and the continuous time geometric Wiener model. We consider only the 
simple case of a volatile stock and a bond with a constant rate of return. It is shown that 
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the no-arbitrage prices for these restricted market models have essentially the same as- 
ymptotic cvn behavior as the upper bound 1/V,. Different model parameter choices ( vol- 
atility, interest rate) can yield more favorable constants c. 

The pricing of the hindsight allocation option in the binomial and geometric Wiener 
models can also be thought of in terms of the max-min framework. The models can be 
viewed as constraints on nature's choice of asset price changes. The underlying distri- 
bution in the geometric Wiener model serves as a technical device for constraining the 
set of continuous asset price paths from which nature can choose. Because these markets 
are complete for the special case of one stock and one bond, the best constant rebalanced 
portfolio computed in hindsight can be hedged perfectly given a unique initial wealth. 
This wealth corresponds to the no-arbitrage price of the hindsight allocation option. Fur- 
thermore, the max-min ratio of wealths obtained by the investor and nature, when nature 
is constrained by these models, must be the reciprocal of this unique initial wealth. 

Early work on universal portfolios ( portfolio strategies performing uniformly well with 
respect to constant rebalanced portfolios) can be found in Cover and Gluss (1986), Larson 
(1986), Cover (1991), Merhav and Feder (1993), and Cover and Ordentlich (1996). 

Cover and Gluss (1986) restrict daily returns to a finite set and provide an algorithm, 
based on the approachability-excludability theorem of Blackwell (1956a, 1956b), that 
achieves a wealth ratio $,/S* = e *", for m = 2 stocks, where c is a positive constant. 
Larson (1986), also restricting daily returns to a finite set, uses a compound Bayes ap- 
proach to achieve Ŝ,IS* = e7", for arbitrarily small 6 > 0. Cover (1991) defines a family 
of u-weighted universal portfolios and uses Laplace’s method of integration to show, for 
a bounded ratio of maximum to minimum daily asset returns, that $,/S* = c,/n"- ? 
for m stocks, where c, is the determinant of a certain sensitivity matrix measuring the 
empirical volatility of the price sequence. Merhav and Feder (1993) establish polynomial 
bounds on S,,/S* under the same constraints. 

The first individual sequence (worst-case) analysis of the universal portfolio of Cover 
(1991) is given in Cover and Ordentlich (1996), where it is shown that a Dirichlet(4) 
weighted universal portfolio achieves a worst case performance of $,/57 = c/n'"- !!?, 
This analysis is also extended to investment with side information, with similar results. 
Jamshidian (1992) applies the universal portfolio of Cover (1991) (with x uniform) to a 
geometric Wiener market, establishing the asymptotic behavior of §(t)/S*(t), and show- 
ing (1/t) log §(t)/S*(t) > 0, for such markets. 

The paper is organized as follows. Section 2 establishes notation and some basic defi- 
nitions. The individual-sequence performance and game-theoretic analysis are established 
in §3. Section 4 contains the hindsight allocation option pricing analysis. 


2. Notation and definitions. We represent the behavior of a market of m assets for 
n trading periods by a sequence of nonnegative, nonzero (at least one nonzero component) 
price-relative vectors xi, ..., x, € RY. We refer to x” = x,,..., x, as the market 
sequence. The jth component of the ith vector denotes the ratio of closing to opening 
price of the jth asset for the ith trading period. Thus an investment in asset j on day i 
increases by a factor of xj. 

Investment in the market is specified by a portfolio vector b = (bi, ..., ba) with 
nonnegative entries summing to one. That is, b € B, where 


s-[»isenniEn-i]. 
j=l 


A portfolio vector b denotes the fraction of wealth invested in each of the 7 assets. ^n 
investment according to portfolio b; on day i multiplies wealth by a factor of 
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m 
t — 
bix; = Y, byx;. 
j=l 
A sequence of n investments according to portfolio choices b;, ..., b, changes wealth 


by a factor of 


A constant rebalanced portfolio investment strategy uses the same portfolio b for each 
trading day. Assuming normalized initial wealth Sọ = 1, the final wealth will be 


S, (x", b) = [[bx. 


i=) 


For a sequence of price-relatives x” it is possible to compute the best constant rebalanced 
portfolio b * as 


* = arg max S,(x", b), 
be? 


which achieves a wealth factor of 


S*(x") = max S,(x", b). 
bez 


The best constant rebalanced portfolio b * depends on knowledge of market performance 
for time 1, 2,..., n; it is not a nonanticipating investment strategy. 
This brings up the definition of a nonanticipating investment strategy. 


DEFINITION ]. A nonanticipating investment strategy is a sequence of maps 
b RI >B, i-L2,:-— 
where 


b; = bi(Xi,..., Xii) 


i—i 


is the portfolio used on day ¿į given past market outcomes x^! = Xj, ..., Xii. 

3. Worst-case analysis. We now present the main result, a theorem characterizing 
the extent to which the best constant rebalanced portfolio computed in hindsight can be 
tracked in the worst case. Our analysis is best expressed in terms of a contest between an 
investor, who announces a nonanticipating investment strategy b; (^), and nature, who, 
with full knowledge of the investor's strategy, selects a market sequence x" = xi, X2,..., 
x, to minimize the ratio of wealths $,(x")/$* (x"), where $,(x^) is the investor's wealth 
against sequence x" and is given by 
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Thus, nature attempts to induce poor performance on the part of the investor relative to 
the best constant rebalanced portfolio b * computed with complete knowledge of x". The 
investor, wishing to protect himself from this worst case, selects that nonanticipating 
investment strategy b; (-) which maximizes the worst-case ratio of wealths. 


THEOREM | (Max-min ratio). Form assets and all n, 


max min $00) 
1 = ay 
b e SEED 
where 
n n 

(2) V, = | x ( ) 2 —naHQnufu... e» ; 

nHO HRR Misses Am 
and 

H(pi,....p,)7 È p log p; 


j=l 


is the Shannon entropy function. 


REMARK. Form = 2, the value V, is simply 


z "EOD )- 


and it is shown in 83.2 that 2/vn + 1 = V, z 1/(2yn + 1) for all n. Thus V, behaves 
essentially like 1/Vn . For m > 2, V, ~ c(1/vn)" . 


REMARK. [Itis noted in $3.3 that 


Li m 
(2) (377 


in the sense that 


n =| 


lim 


er 
"RUPES 


For m = 2 this reduces to V, ~ /2/z (1/dn). 


REMARK. ‘The max-min optimal strategy for m = 2 will be specified in Equations 
(8)- (13). These equations are followed by an alternative definition of the optimal strat- 
egy in terms of extremal strategies. 

We note that the negative logarithm of the max-min ratio of wealths given by Equation 
(2) also corresponds to the solution of a min-max pointwise redundancy problem in 
universal data compression theory. The pointwise redundancy problem was studied and 
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solved by Shtarkov (1987) and earlier works referenced therein. A principal result of the 
present work, which is developed in greater information theoretic detail in Cover and 
Ordentlich (1996) and Ordentlich (1996), is that worst sequence market performance is 
bounded by worst sequence data compression. 

The strategy achieving the maximum in Theorem 1, as developed in the proof below, 
depends on the horizon n. We note, however, that Cover and Ordentlich (1996) exhibits 
an infinite horizon investment strategy, the Dirichlet- weighted universal portfolio, denoted 
by b?(-), which for m = 2 assets achieves a wealth ratio $2 (x^)/S (x^) satisfying 


ED n 
$90 ly 


ae Ue SEK") an 


At time i, the Dirichlet-weighted universal portfolio investment strategy uses the portfolio 


; "— J,bS-.(b,x^)du(b) — » 
( ) ; =D; (x )= f, Sa, x^ )dp(b) >» €= 1,2, 


where 


S;(b, x‘) = S(b)-[[b*x, and Sy(b, x°) = 1. 


j=! 


The measure p on the portfolio simplex Bis the Dirichlet (1/2, ..., 1/2) prior with density 


r(3) 


mol —]/2 m-l 
E X s) TI b; "db, 


j=l 


mot 
XbsL bz0 jolh..m-l 


j=! 


where [(:) denotes the Gamma function. The running wealth factor achieved by the 
Dirichlet-weighted universal portfolio through each time n is 


$2(x") -Í S, (b, x')du(b). 


Thus the max-min ratio can be achieved to within a factor of V2 for all n by a single 
infinite horizon strategy. The bound (4) generalizes to m > 2 so that for each m, the 
worst-case wealth achieved by the Dirichlet-weighted universal portfolio is within a con- 
stant factor (independent of n) of V,. 

The significance of Theorem | can be appreciated by considering some naive choices 
for the optimum investor strategy b. Suppose, for m = 2 assets, that b corresponds to 
investing half of the initial wealth in a buy-and-hold of asset 1 and the other half in a 
buy-and-hold of asset 2. In this case the first two portfolio choices are 
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f, | 1\' j xu X12 ‘ 
b = nam d 2 = - . 
(6) (5) and b (= +X Xu e) 


Since we are allowing nature to select arbitrary price-relative vector sequences, nature 
could set x, = (0, 2)' and x; = (2, 0)', in which case the investor using the split buy- 
and-hold strategy (6) goes broke after two days. On the other hand, for this two-day 
sequence, the best constant rebalanced portfolio is b* = (1/2, 1/2)! and yields a wealth 
factor S7 (x,, x;) of 1. 

Suppose the investor instead opts to rebalance his wealth daily to the initial (1/2, 1/2) 
proportions. Here b, is the constant rebalanced portfolio b — (1/2, 1/2)'. If nature then 
chooses the sequence of price-relative vectors x" = (2, 0)', (2, 0), ..., (2, OY the 
investor earns a wealth factor $,(x") = 1 while the best constant rebalanced portfolio b * 
—(l,0)'eams $*(x”) = 2". The ratio §,/S* of these two wealths decreases exponentially 
in n while the max-min ratio V, decreases only polynomially. In particular, the wealth 
achieved by the max-min optimal strategy is at least 2"/(2vn + 1) for this sequence. 

These two investment strategies are particularly naive. A more sophisticated scheme 
might start off with b, = (1/2, 1/2)' and then use the best constant rebalanced portfolio 
for the observed past. This scheme, however, is also flawed, since if nature chooses x, 
= (1,0), the investor would use b, = (1, 0)' the following day and then would go broke 
if nature set x» = (0, 1)'. One might think of fixing this scheme by using a time varying 
mixture of the (1/2, 1/2) portfolio and the best constant rebalanced portfolio for the past. 
However, this class of strategies also fails to achieve V,,. 

We now proceed with the proof of Theorem |. The following lemma is used. In the 
sequel we adopt the conventions that a/0 = œ if a > 0, and that 0/0 = 0. 


LEMMA |. f&n... a € 0,8, ..., B, = 0, then 
D’ a; a 
(7) - = min. 
i-i Bi i B; 
PROOF OF LEMMa I. Let 


. ay 
J= arg min #. 
j j 


The lemma is trivially true if a, = O since the right side of (7) is zero. So assume a, 
> 0. Then, if B, = O the lemma is true since both the left and right sides of (7) are 
infinity. Therefore assume a, 0 and B, > 0. Then 


a: 
Zr + E.A) 
an a; ay ay 


PE i JJ E 
/ / J l Nav 
e( tug, 


because 


M 
Sk 


DR 


which implies 
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aj. B 
a, B 


for all j. 


PROOF OF THEOREM |. For ease of exposition we prove the theorem for m = 2. The 
generalization of the argument to m > 2 is straightforward. 
Thus, for the case of m — 2 we must show that 


$a) 
max min " 
- S*(x ny 


where 


We prove that 


= (x^) 
v St) 


n? 


by explicitly specifying the max-min optimal strategy b. We define the strategy by keeping 
track of the indices of the terms in the product []'., bix,. For sequences j” € (1, 2)" let 
ni (j") and n;(j") denote, respectively, the number of |’s and the number of 2's in j”. 
That is, if j^ = (4, .... Jr), 


(8) n") =E =r), 


i=l 


where /(-) is the indicator function. Let 


jf") = mG") mon nig") nj") 
(9) wgn = v : ( ) | 


n 


Then, since je 15,» wCj") = 1, w(j") is a probability measure on the set of sequences 
j€í(1,2y.Forl <n, let 


(10) wg)- X WEF iets i) 


be the marginal probability mass of ji, ..., jj. This marginal probability may also be 
denoted by w(j'"', jj). Finally, define the nonanticipating investment strategy b, = bn, 
br)’ 


Eje aay! wG^ |, 1) IZ xy, 
Byte aye! w(j™') Hi Xij ' 


(11) by (xt) = 


and 
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12) bx!) = ABER: x s 

pt Ze ay w(j^!) Wiz: Xij 

with 

(13) b,-—w(1) and n= w(2). 


An alternative characterization of the max-min optimal strategy, which turns out to be 
equivalent to the above, is as follows. Break the initial wealth into 2” piles, one corre- 
sponding to each sequence j”, where the fraction of initial wealth assigned to pile j” is 
precisely w(j") as given in (9). Now invest all the wealth in pile j” in asset j, on day I. 
From then on, for each day i, shift the entirety of the running wealth for this pile into 
asset j;. Do this in parallel for each of the 2" piles j". We refer to the strategy used to 
manage pile j” as the extremal strategy corresponding to the sequence j”. 
The wealth factor achieved by the investor using (11) and (12) is 


S. = IT bix, 


i-i 
En wG ^. Van Het = | 

n + Sjea wG, 2)xn IEZI x; 
Djeg) w(j^!) Wii Xij 


7 Zieuay w(j^) Ilt Xij 
Dime (2y! w(j^ ') Izi Xij; 


f=| 


(14) = X wG»IIx 
j"e (1.2)" isl 
k k _ pN\n-k 
iter 
kao \” n 
where 


X(k) & py IH x 


PMG") Sk i21 


and (14) follows from a telescoping of the product. 

It is apparent from Equation (14) that the extremal strategy formulation of the max- 
min optimal strategy is equivalent to the portfolio formulation (8)—(13). The extremal 
strategies simply “‘pick off ° the product of the price relatives corresponding to the se- 
quence of assets with indices j”. Equation (14) represents the sum of the wealths obtained 
by the extremal strategies operating in parallel. 

Note that forO =k x n, 


k k EM k n—k 
(15) ( ) (* ) = max b^(1 — b)'7*, 
n n Oxbzxl 


Also note that S (x^) can be rewritten as 
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Six") = [Ib*x 


i=l 


»X [LF x, 


j'e(u 2)" i=l 


(16) 


= > b**(1- b*)"*X(k), 


k=0 


where b* = (b*, I — b*)' achieves the maximum in (16). 
Therefore, for any market sequence x”, Lemma 1 and the above imply that 


k E n—k 
. vzo (2) (5 *) X(k) 
S,(x") n n 


SH") — ZXLob**(1— b*)" X (k) 


k k n—k nck 
n n 
17 = V, min —— 
(17) "anas DEL — p*y'^ 


(18) 


N 


Vis 
where (17) follows from a combination of Lemma | and the cancellation of the sums of 
products of x;,, and (18) follows from (15). Since the above holds for all sequences x”, 
we have shown that 


OS) 
19 t mV. 
(19) max min S¥(x") n 


To show equality in (19) we consider the following possibilities for x". For each j” 
€ (1, 2)" define x"(j") = Xi), -9 Xa Ja)» as 


(1,0) if f= 1, 
(20) xi Ui) = "E 
(0,1) if j = 2. 


Let 
K = (x" (j^) Vu € (1, 2) } 
be the set of such extremal sequences x". 
An important property shared by all nonanticipating investment strategies b(-) on the 


sequences (20) is that 


(21) Y S,(x") = 1. 


Also note that, for x"(j") € K, the best constant rebalanced portfolio is easily verified to 
be 
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1 
b*(x"(j")) = s. (m G"), ni G^)» 


so that 
p dr» cm A ( j") mG") n ( j") nij 
sse = (8) (E 
n n 
_ wg") 
V, 
Therefore 
X S*( ") 1 
f(x")—-——-. 
xE V, 


Since the minimum is less than any average, we obtain equality in (19) from 


"100 Sr) Y SG) 
(22) BD ay Ge en 
(23) - 25 V 


de Seg SR 


l 


24 = - 
ve Ser SFA") 


(25) =V 
which holds for any b. Thus 


$,(x") 
max min cycn = V,. 


S3 Èx") 


Combining this with (19) completes the proof of the theorem. O 


Complexity. It appears from (11) and (12) that computing the max-min optimal port- 
folio requires keeping track of the products [Tiz1 x;; for each sequence j^ '. This quickly 
becomes prohibitively complex, since the number of such sequences is exponentially 
increasing in /. Fortunately, a simplification can be made. 

This follows from the observation that w( j”) defined in (9) depends on j” only through 
its type (m, (j^), n3(j")), the number of I's and 2's. This implies that w(j"^', ja), for 
fixed j,, is 2 function of j”~' only through (n, (j^^), m2(j""')). The same applies to 
w(j"!) = E, w(j^^', ja). Thus, by induction w(j^ !, jj) and w(j^ '), for all J, are 
constant Gar ^i with the same type. 

Using this fact, the numerator and denominator of (11) and (12) can be evaluated by 
grouping the products []/2| xj, according to the type of j^ '. More specifically, the nu- 
merator of (11), for example, can be written as 
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1-1 i-t 1-1 
Eo wi Oa, = Dwi E lx 
i=l k=0 


Je (ny! Poi) =k ind 


~ 
i 


Wt, (k, 1) Xi, (k), 


li 


x- 
I 
o 


where w; ,(k, 1) equals wCj^ !, 1) when n; (j^!) = k and 


im] 
Xo()-2 X [] xy. 


Solin FO Sk i=l 


The denominator can be rewritten in a similar way. 

It is now clear that only the quantities X,_,(k) need be computed and stored instead of 
the exponentially many products [i21 x,,. The complexity of this is linear in /, since there 
are only / such quantities. The simple recursions 


X((k) = xi Xi i(k — 1) + xo Xa (k) 
X0) = xi X.,(0) 
XC) = xu Xi — 1) 
suffice to update the X, , (k). 

The above generalizes in the obvious way to m > 2 assets resulting in a computational 
complexity growing like /”~'. Therefore, the max-min optimal portfolio is, in fact, com- 
putationally feasible for moderate m. 

3.1. Game-theoretic analysis. A full game-theoretic result can also be proved. Spe- 


cifically, we imagine the same contest as above, except that mixed strategies are allowed. 
The payoff function is 


As before, the investor and nature respectively try to maximize and minimize the payoff. 
Let §, denote the game when played with this payoff function. 

A mixed strategy for the investor is a probability distribution ?(b) on the space of 
nonanticipating investment strategies, b - (b, b. (xi),... xr ')). Similarly, nature's 
mixed strategies are probability distributions on the space of price-relative sequences and 
will be denoted by 2(x"). The following theorem can then be proved. 


THEOREM 2. The value of the game &, is 


max min EA(b, x") = min max EA(b, x") = V,, 
Ab) 40x") ax") AD) 


where V, is given by (2). Further, the investor's optimum strategy P* is the pure strategy 
specified by (8) - (13). 


Proor. We prove this for m = 2, the generalization being obvious. The pure strategy 
?* is precisely the max-min optimal strategy (8) - (13) achieving the maximum in The- 
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orem 1. Nature's optimum mixed strategy 2* (for m — 2) consists of choosing sequences 
from 


K= "PI P| {1,2}"} 
according to the probability distribution w( j”) given by (9). The proof of 


(26) min max EA(b, x") = V,, 


x") AB) 


follows from Equations (22) through (25). The theorem follows from (26) and (19). 
The full game-theoretic analysis brings out a nice symmetry between the optimal in- 
vesiment strategy and nature's optimal strategy. The optimal investment strategy ?* is a 
pure strategy constructed from the distribution w(j") on binary strings given by (9). 
Nature's optimal strategy, on the other hand, is to choose 0-1 price-relative vectors at 
random according to this same probability distribution. 
This analysis generalizes to games with payoff 


z $, (x" 
Ag(b, x") = (S) ; 


for which the following holds. 


THEOREM 3. For concave nondecreasing $, the game §,(¢) with payoff Agh, x") 
has a value V($,($)) given by 


V($,(5)) = PV), 
where V, is given by (2) and the optimal strategies are the same as those for §,. 


3.2. Bounds on V,. We prove the following lemma for m = 2. 


LEMMA 2. Foralln, 


< V, = 7 
Wn +1 vn +1 
PROor. We first prove the lower bound. In Cover and Ordentlich (1996), a sequential 
portfolio selection strategy called the Dirichlet( 1/2, ..., 1/2) weighted universal port- 


folio was shown to achieve a wealth $P(x") satisfying 


CRY l 
mi = 


n $ ny [ : 
v SX) na I 


Therefore 
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oS. X7) 
V, = max min C. — 
b x" 57 (x ) 
S» x^) 


z min F 
x SE(x") 


I 


Wnt 


= 


proving the lower bound on V,. 
We now establish the upper bound. Write 1/V, as 


wr BGS) 


T(in+1) 2 k'(n — ky" 


n" È Tks Din key’ 


k=0 


(27) 


where I(x) = á t" l'edristhe Gamma function. If x is an integer, then P(x + 1) = x!. 
In Marshall and Olkin (1979), it is shown that (xi, x3)  (xtx3)/(F(x, + 1) (xo + 
1)) is Schur convex. This implies that under the constraint x, + x» = n, it is minimized 
by setting x, = x, = n/2. Therefore, each term in the summation (27) can be bounded 
from below by 


ni2? u/2 
I ctt 
k*(n — ky" u 2 2 
I(& - ID)T(n— k + 1) Pa r+] 
2 2 
to obtain 
ni2 ni2 
m on 
l T(n+ 1 2 2 
J Roe o 15 
V, BIS P iom l 
2 2 


= (n+ IMa +1) 
w + J 


The identity (see Rudin (1976)) 


2" n+l n 
(28) I(n+1)= r( y +1), 
in 


can now be applied to obtain 
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(5 -) 
MEE EMT 2 


V dn (241) 
2 


(29) 
The log convexity of (x) (see Rudin 1976) now implies that 
1/2 1/2 
p qe ep tu ree: 
2 202 2.32 


n l n+} 
3 -Dz-z ; 
(393 (5 ;) 2 


where we have used the identity "(x + 1) = xI'(x). Combining (29) and (30) we obtain 


Ii n+l S 
V, n dim 
2 
| f2(n + 1) 
E 7 
thereby proving that 
V, - z 


foralln. O 
This bound can be generalized to m > 2 with the help of 


'(" + J 
m m 
=] 


i 
m 


3.3. Asymptotics of V,. The following lemma characterizes the asymptotic behavior 
of V, for m stocks. 


I(n+ 1) =m" 


an extension of (28) to general m. 


LEMMA 3. For all m, V, satisfies 


(31) V, 


(3) ey" 


in the sense that 
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li : =I 
im . 
om n 2 (n-1y/2 

2 n 


The quantity V, arises in a variety of settings including the max-min data compression 
problem (see Shtarkov 1987), the distribution of the longest common subsequence be- 
tween two random sequences (Karlin 1996), and bounds on the probability of undetected 
errors by linear codes (see Klove 1995, Massey 1978, and Szpankowski 1995). Lemma 
3 1s proved in Shtarkov, Tjalkens, and Willems (1995) and an asymptotic expansion of 
V,, to arbitrary order is given in Szpankowski (1995, 1996). A direct proof of Lemma 3 
based on a Riemann sum approximation is given in Ordentlich (1996). 
In addition, Shtarkov (1987) obtains the bound 


v, = È C) a HMT 


implying one half of the asymptotic behavior in Equation (31). 


4. The hindsight allocation option. The results of the previous section motivate the 
analysis of the hindsight allocation option, a derivative security which pays S7 (x^), the 
result of investing one dollar according to the best constant rebalanced portfolio computed 
in hindsight for the observed market behavior x”. Let 


Certainly the price of the hindsight allocation option should be no higher than H,. This 
follows because H, dollars invested in the nonanticipating strategy described in the proof 
of Theorem 1 is guaranteed to result in wealth at time n no less than S; (x") for all market 
sequences x”. If the price of the hindsight allocation option were more than H,, selling 
the option and investing only H, of the proceeds in the above strategy would be an 
arbitrage. Note that this argument assumes the existence of a riskless asset for investing 
the surplus. 

Therefore, H, is an upper bound on the price of the hindsight allocation option valid 
for any market model (with a risk free asset). Furthermore, while the return of the best 
constant rebalanced portfolio is expected to grow exponentially with 2, the upper bound 
on the price of the hindsight allocation option H, behaves like Vn. This polynomial factor 
is exponentially negligible relative to S#. 

Is H, a reasonable price for the hindsight allocation option? Probably not; the price 
should be lower. Pricing the option at H, may be appropriate if no assumptions about 
market behavior can be made. This is the case in $3, where no restrictions are placed on 
nature's choice for the market behavior. Returns on assets can be arbitrarily high or low, 
even zero. Actual markets, however, are typically less volatile. We gain more insight into 
this issue by using established derivative security pricing theory to determine the no- 
arbitrage price of the hindsight allocation option for two much studied models of market 
behavior, the binomial lattice and continuous time geometric Brownian motion models. 


4.1. Binomial lattice price. We consider a risky stock and a riskless bond. Accord- 
ingly, the price-relatives x, are assumed to take on one of two values 
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x E€{(l+u,l4+r)',(1+d,1+4+nr)} 


with r = 0, u > r > d. The first component of x; reflects the change in the price of the 
stock as measured by the ratio of closing to opening price. The second component indicates 
that the riskless bond compounds at an interest rate of r for each investment period. The 
parameters of the model are thus u, d, and r. If the stock price changes by a factor of | 
+ u it has gone ‘‘(u)p’’; if it changes by a factor of 1 + d it has gone ‘‘(d)own.”’ 

We will find that the no-arbitrage price H, of the hindsight allocation option for this 
model is closely related to H,, the upper bound obtained in the previous sections. It will 
be apparent that for certain choices of d, u, and r, the upper bound H, is essentially 
attained. 

For a sequence of n price-relatives x” = x,,..., x,, the wealth acquired by a constant 
rebalanced portfolio b = (b, 1 — b)' can be written as 


$,(b) = [1 € rt b(u — r)' TI + r+ b(d r)", 


where k is the number of vectors x; for which x;, = | + u. Since log S,(b) is concave 
in b, the best constant rebalanced portfolio b* = (b*, 1 — b*)' is easily determined using 
calculus. For 0 « k « n, define b* as the solution to 


dlog S,(b) _ 


db 0. 


It is given by 


pe 0n k —) 


n r-d u-r 
For k = 0, set b* = 0, and for k = n, set b* = 1. Then b* is given by 
b* = max(0, min( 1, 6*)). 


We then obtain the wealth achieved by the best constant rebalanced portfolio as 


S*(x") = [1 +r + b*(u — r) + r+ b*(d — r)]"* 


(1 +r)" if b* - 0, 
= 4 (1 t u)*(1 + dy" if b* =1, 
[Lr b*Qu — nlt re b*(a-rn]"7* if O<b* <1. 


If 0 < b* < 1, the wealth achieved can be written more explicitly as 


k n—k k 
S¥*(x") (1 trt+(1¢4 [x vE n) 


k n—k n=k 
(1 rAd ^ tana | n) ` 


which simplifies to 
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. kN (n—kN*/u —dVfíu— dV 
sse -ae»(2)( n (4-4 (i ` 


It is well known that for this model the no-arbitrage price P, of any derivative security 
with payoff S, at time n is given by 


i 


=— ge 
ü ry (Si), 


P, 


where the expectation is taken with respect to Q, the so called equivalent martingale 
measure on asset price changes. The unique equivalent martingale measure for this market 
is a Bernoulli distribution on the sequence of ‘‘up’’ and ‘‘down’’ moves of the asset price 
with the probability of an **up'' equal to p, = (r — d)/(u — d) and the ''down" probability 
equal to p, = 1 — (r — dY(u — d) = (u — rY(u — d). 

We note that for the case of 0 < b* < ] 


k k EN k n—k 
S*(x") = (1 f »( (* p.poU 
n n 


A S¥(k). 
Therefore, 
ES?) 
"no (1 4 r)” 
-—— y seo (7 pirr pt X da (T pio 
CL D poche kp C1 +7)" peo k 
+ | Y Gal ear pipi“ 
CL n uc kj ort 


which simplifies to 


n k k n— k n-k n 
uU X Y | ( Jeter 
kOcb*«l k n A P k i 
n 1+u\\4 14d\\rr4 
+ . 
x eye) 


The range of k such that 0 < b* < I is 


eke ut | 
Pu n P EE] i 


Thus 
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nV f kN f n — kNI7* n i 
A, = 2 G J( ) EÈ ( Joti 
pamkn puit 1 (r1) n n kInzep, k 
n | -uNM lL+da\\r* 
ae be) rz) 5) - 
ki nz ufu Vy d) 


It 1s useful to note that 


Pu 


This implies that 


k n-k 
nif Kk n-k 
Pu kingpin t Yr 1) k n n 


and 


nV kN fn — kV 
A, E B ( ( ( | 3 
Pukin gpu rti) k n n 


Notice the similarities between these bounds and the expression for V, — 1/H, given by 
(3). It is possible to choose r, u, and d so that p, < 1/n and p,((u + 1)/(r € 1)) » (n 
— 1)/n, in which case the value of the hindsight allocation option is at least H, — 2. 

In summary, the no-arbitrage price H, of the hindsight allocation is given by 


n k k n— k n-k a 
n - ( n Y E oÈ ( Jip" 
pyzkinpyu MG 1) k/ Nn n kins py k 
n {+u\\* ltd\\rk 
+ 2 WC )) (edi )) 
KInzepyuct Vf Gr M) +r Em 


where the first summation comprises the bulk of the price for reasonable parameter values. 
The terms appearing in this sum are identical to those in the expression (3) for V, — 1/ 
H,. The number of such terms appearing in the sum depends on the parameter values. A 
Reimann sum approximation argument shows that for fixed parameter values H, 
~ cvn, where the constant c depends only on the parameters. 


4.2. Geometric Brownian motion price. In this section, we give the price of the 
hindsight allocation option for the classical continuous time Black-Scholes market model 
with one stock and one bond. The stock price X, follows a geometric Brownian motion 
and evolves according to the stochastic differential equation 


dX, = uX,dt + oX,dB,, 


where u and o are constant, and B is a standard Brownian motion. Note that here X, 
denotes a price, not a price-relative. The bond price 6, obeys 
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df, = Brat 
where r is constant and therefore 
B, = e". 
Let S,(b) be the wealth obtained by investing one dollar at t = 0 in the constant rebalanced 


portfolio b = (b, | — b)', where b is the proportion of wealth invested in the stock. Then 
S, (b) satisfies the stochastic differential equation 


dS, (b) dX, d, 
32 = t(l—-b ; 
(32) S(b) X, ) B, 
which can be solved to give 
b?c?t X, ct 
- - + + + . 
(33) S(b) ap 2 biog X; 2 ) (1 bre) 


That this solves (32) can be verified directly using Ito's lemma (see Duffie 1996, Karatzas 
and Shreve 1991). Notice that, for fixed c? and r, the wealth S, (b) depends on the stock 
price path only through the final price X,. 

The best constant rebalanced portfolio in hindsight at time 7 is obtained by maximizing 
the exponent of (33) for £ = T under the constraint that O = b = |. This results in 


o? 


3 l 1/T » E 
(34) bł = max (0 min( 1.5 + ( ) log Xo/ Xo) =)) , 


The wealth achieved by the best constant rebalanced portfolio is then obtained by eval- 
uating (33) at b = b# resulting in 


e” if bž=0 
SŽ =S/(bF) = l eTO if Oxbt =) 
X ' 
x if be =]. 
0 


From the martingale approach to options pricing, the no-arbitrage price at ¢ = 0 of the 
hindsight allocation option with duration T is given by 


Sr(b7) 
Br 


Hor = BoEg 


(35) | 
= e^" EoS, (bf), 


where Q is the equivalent martingale measure or the unique (in this case) probability 
measure under which X,/£, is a martingale, and assuming that S1 (b7) is integrable under 
Q, which it is. 

It is well known (see Duffie 1996) that under the equivalent martingale measure Q the 
stock price X, obeys 


dX, = rX,dt + o X,dB,. 
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This and Ito’s lemma imply that under Q, the expression log( X7/Xo) appearing in the 
exponent of (33) is normally distributed with mean (r — (1/2)o*)T and variance c?T. 
Therefore, the random variable 


log(X7/X,) — (r — (1/2)07)T 


yo?T 


YA 


is standard normal. It can be rewritten as 


c? 


(36) y= dar} a GP esa 7) 


so that, by equation (34), 


Yo ?T(b?) = max(0, min(Vo?T, Y)). 


Equation (36) can be solved for X7/X; resulting in 


Xr = ghe Tercer 


Xo 
The expectation (35) is then easily evaluated as 
e" Eg S; (bX)] = (Eol = 0)] + Eole”? (Y € (0, V o?7])] 
em + Eole TTRY > VPT). 


The first expectation is clearly equal to 3, since Y is standard normal. The middle expec- 
tation is 
Eje! "^ (Y € (0, V o?T])] = i 


] (ver 

2m * 0 
oF 
2m ` 


F 


335. a, 
e? 26 y 2 dy 


Finally, the third expectation is 


y 2 1 2 o? T-To^2. —y2, 
Eole” Ta RY > [e?ry] -=Í e T-T 2, Y gy 
EET. 
1 [* |. DoT 
=— € : dy 
TEES. 


T 
> 


232 E. Ordentlich and T. M. Cover 


COST OF ACHIEVING THE BEST PORTFOLIO IN HINDSIGHT 981 


Thus (37) reduces to a surprisingly simple form. The no-arbitrage price Ho; of the hind- 
sight allocation option is 


Hy 244A. 


The price is affinely increasing in the volatility ø and increases like the square root of the 
duration T. The dependence on duration matches the dn growth of the discrete-time upper 
bound H, and the binomial lattice price H,. If the hindsight allocation option payoff is 
redefined to be Sz (x") — e’ (the excess return of the best constant rebalanced portfolio 
beyond the return of the bond) then the price is simply ¥o?T/(27). This can be thought 
of as a premium for volatility. 


5. Conclusion. The worst sequence approach to the problem of achieving the best 
portfolio in hindsight leads to a favorable result: the max-min optimal portfolio strategy 
for m assets loses only ((m — 1)/2) (log n)/n in the rate of return in the worst case. This 
yields an asymptotically negligible difference in growth rate as the number of investment 
periods n grows to infinity. In practice we would expect even better performance, since 
real markets are less volatile than the max-min market identified here. This intuition 1s 
partially validated by the hindsight allocation pricing analysis for the binomial and geo- 
metric Wiener market models which indicates that the cost of achieving the best portfolio 
in hindsight depends monotonically on market volatility. 
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OPTIMAL STRATEGIES FOR REPEATED GAMES 


MARK FINKELSTEIN* AND 
ROBERT WHITLEY,* University of California, Irvine 


Abstract 


We extend the optimal strategy results of Kelly and Breiman and extend the 
class of random variables to which they apply from discrete to arbitrary 
random variables with expectations. Let F, be the fortune obtained at the nth 
time period by using any given strategy and let F* be the fortune obtained by 
using the Kelly-Breiman strategy. We show (Theorem 1(i)) that E,/F* is a 
supermartingale with E(F./F*) 1 and, consequently, E(lim F,/F*) S1. This 
establishes one sense in which the Kelly-Breiman strategy is optimal. How- 
ever, this criterion for ‘optimality’ is blunted by our result (Theorem 1(ii)) that 
E(F,/F*)=1 for many strategies differing from the Kelly-Breiman strategy. 
This ambiguity is resolved, to some extent, by our result (Theorem 2) that 
F*JF, is a submartingale with E(F*/F,)z1 and E(lim Ff/F,)z1; and 
E(F*/F,)=1 if and only if at each time period j,1Sj 3n, the strategies 
leading to F, and F* are ‘the same’. 


KELLY CRITERION; OPTIMAL STRATEGY; FAVORABLE GAME; OPTIMAL GAMBLING 
SYSTEM; PORTFOLIO SELECTION; CAPITAL GROWTH MODEL 


1. Introduction 


Suppose a gambler is given the opportunity to bet a fixed fraction y of his 
(infinitely divisible) capita! on successive flips of a biased coin: on each flip, 
with probability p 3 he wins an amount equal to his bet and with probability 
q=1-—p he loses his bet. What is a good choice for y and why is it good? 

This question is subtle because the obvious answer has an obvious flaw. The 
obvious answer is for the gambler to choose y —1 to maximize the expected 
value of his fortune. The obvious flaw is that he is then broke in n or fewer 
trials with probability 1— p", which tends to 1 as n tends to o. 

A germinal answer was given by Kelly [10]: a gambler should choose 
Y 7p-q so as to maximize the expected value of the log of his fortune. He 
shows that a gambler who chooses y = p —q will ‘with probability 1 eventually 
get ahead and stay ahead of one using any other value of y’ ([10], p. 920). 

In an important paper Breiman [4] generalizes and considers strategies other 
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than fixed-fraction strategies and generalizes the random variable as follows: 
Let X be a random variable taking values in (1,2, :::, s) - I, € be a class 
(A, Az 7^, A,} of subsets of I whose union is I, and 0,, 02, * ' * , 0, be positive 
numbers (odds). If for one round of betting a gambler bets fractional amounts 
Bi, B5, , B, Of his capital on the events {X € Aj], =: -, (X € A,], then when 
X =i he gets a payoff of } B,o, summed over those j with i in A, In this setting 
Breiman discusses several ‘optimal’ properties of the fixed-fraction strategy 
which chooses B,, 85, - - -, B, so as to maximize the expected value of the log of 
the fortune and then bets these fractions on each trial, leading to the fortune 
F* at the conclusion of the nth trial. He shows that if F, is a fortune resulting 
from the use of any strategy, then lim F,/F* almost surely exists and 
E(lim F,/F*) 1. In what follows we shall be concerned solely with magnitude 
results, like this asymptotic magnitude result of Breiman's, but the reader 
should be aware that under additional hypotheses Breiman also shows that 
T(x), the time required to have a fortune exceeding x, has an expectation 
which is asymptotically minimized by the above fixed-fraction strategy. 

The problem of how to apportion capital between various random variables 
is exactly the problem of portfolio selection, and so it is correct to suppose that 
these results on optimal allocation of capital are of considerable interest to 
economists, as Kelly recognized ([10], p. 926). He also prophetically realized 
that economists, familar with logarithmic utility, could easily misunderstand his 
result and think, incorrectly, that the choice of maximizing the expected value 
of the log of the fortune depended upon using logarithmic utility for money. 
For discussion see [15], p. 216 and [17]. An interesting concise discussion of 
the ‘capital growth model of Kelly [10], Breiman [4], and Latané [11]’ from an 
economic point of view can be found in [3]. 

A brief discussion of Kelly’s proof will motivate his criterion and allow us to 
make an important conceptual distinction between his results and Breiman’s. 
Suppose a gambler bets the fixed fraction y of his capital at each toss of the 
p-coin. Kelly considers the exponential growth rate 


G =lim log [(F,/Fo)””"]. 
If our gambler has W wins and L losses in the first m trials, F, = 
(1+ y)" (1— y)-Fo, so 
. (W L 
G - lim (tog (1+) +—log 1.9) ) = p log 19) +4 log 19) 


by the law of large numbers. The growth rate G is maximized by y = p — q, and 
if he uses another y his G will be less and therefore eventually so will his 
fortune. A complication enters when we consider, as Kelly did not, strategies 
which are not fixed-fraction strategies. In that case we can have different 
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strategies with the same G, e.g., use y= 1 for the first 1000 trials and then use 
y 7 p — q. This complication is intrinsic in the use of G and it has consequences 
which are quite serious for any application. For example, two strategies which 
at trial n give fortunes, respectively, of 1 and exp (Vn), both have G = 1! It is 
obviously unsatisfactory to regard these two strategies with the same G as 'the 
same', but it is done because using G makes it easy to extend the Kelly results 
to more general situations which involve more general random variables; using 
G the argument is a simple one employing either the law of large numbers or 
techniques which 'rely heavily on those used to generalize the law of large 
numbers' ([15], p. 218). Breiman understood the problems created by using G 
and so he considered F,/F*, not (F,/F*)'". This is mathematically more 
difficult, but the results are more useful. 


2. Definitions and lemmas 


We shall consider situations with the property that at each time period a 
gambler can lose no more than the amount he invests, e.g., buying stock or 
betting on Las Vegas table games. Since there is a real limit to a gambler's 
liability, based on his total fortune, a broad interpretation of the phrase ‘the 
amount he invests’ will allow the inclusion of such situations as selling stock 
short or entering commodity futures contracts. 

We suppose that there are a finite number of situations 1,2, «+ +, N on which 
a gambler can bet various fractions of his (infinitely divisible) capital. The 
random variables X,, X5, -:-, X, represent, respectively, the outcome of a 
unit bet on situations 1,2,**:, N. Because the loss can be no more than the 
investment, X, 2-1 for 1 £k £N. (Breiman considers the amount returned to 
the gambler after he has given up his bet in order to play, a real example of this 
sequence of events being betting on the horses. Here the amount the gambler 
gets back is 20, which corresponds to the amount he wins being z —1). We 
further suppose, with no loss of applicability, that in all of what follows each X, 
has an expectation, i.e., that E(|X,) is finite. These will be the only restrictions 
on the random variables, and so we are considering a substantially larger class 
than those discrete random variables Breiman considers. 

We also suppose that the gambler can repeatedly reinvest and change the 
proportion of the capital bet on the situations. The outcome at time j corres- 
ponds to the random variables X9, XU, ---, XË. For each k, 15k £N, the 
results of repeated betting of one unit on the kth situation is a sequence 
XW, XQ,---, X$,--- of independent random variables, each having the 
same distribution as X,. In contrast to this independence, it is quite important 
for applications that X,, X5, --*, Xy be allowed to be dependent. 
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A strategy for the game will be a sequence y, - - - , 49, ++ of vectors, 
y - (Q7, 489, +--+, 48?) giving the fractional amount y("? of the capital 
which at the mth bet is bet on the kth situation. Thus y(" z0, 15 k € N, and 
ENa yg s1. We allow the possibility that y can depend, as a Borel- 
measurable function, on the past outcomes Q ll XO, 

Qi MOD oe XI, e, KY, (Breiman includes the sure-thing bet 
X921, so that betting yo on X, is the same thing as putting yo aside; in this 
way his y's always sum to 1. We shall not do this.) 

Letting E, be the fortune which is the result of m bets using 


y, yD, 4", and F, be the initial fortune, 


, r-n fi $ we] 

j=1 k=1 
To simplify the notation, let X” —(X9,-.-, X9) and denote the scalar 
product with y? 2 (y^, yP,- +, YP) by y? XP, obtaining 


(2) F,=Fo] | ny. xo). 
jul 

A. fixed-fraction strategy is a strategy y"? = (yi, yo, *** , Yn) which bets the 
same amount y, on situation k for all m. The result of using y = y?, 1S js m, 
for m bets is F,, = Fo [IN (1 y : X9). We shall be particularly interested in 
‘the’ fixed-fraction strategy y*=(y*, y¥,---,y%) which maximizes 
E(log (F,,)). In Lemma 3 we shall show that y* exists, and Lemma 1 shows in 
what sense it is unique. The strategy y* maximizes E(log (F,,)) if and only if it 
maximizes the function 


(3) oly) = 6Qi, +, Yn) = E(log (1+ È yv,X,)) = E(log (1+-y + X) 
over the domain 
(4) D-(vy,: YN): «EO 1SksN, È w S1). 
Lemma 1. 
(i) The function $ of (3) is concave. 
(ii) If d(aa+(1—a)B) = aé(a) - (1 a)6(B) for 0a«1 with $(a) and 


(B) finite, a: X - B- X almost surely. In particular, if a =(a,, a2,°°*, ay) 
and B =(B,, B2, ***, By) both maximize œ over its domain D, then Y a, X, = 
¥ BX, a.s. 

(iii) At y=(y1,°°°, Yn) in D with Yy,<1, the partial derivative 44/dy; 
exists and equals E(X,/(1+y- X), 1SiSN. 

Proof. 

(i) The function f(x)=log (1- x) is strictly concave on (—1, œ), and so for 

x =(x,,°°', Xx) a value of X, a and B in D, and 0<a<1, 


(5) f(aa : x «(1-a)8 : x) &af(a- x) c (1- a)f(8 - x), 
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an inequality which also holds if either a > x or 8 : x is —1. Integrating (5) with 
respect to the probability measure P of the space on which X is defined, 


(aa +(1-a)8)= [f(aa + X «(1— 8 + X) dP 


z [lafla X)+(1-a)f(@ + X) dP = a6(2)* 1 6 (9) 


(ii) Since $ is concave the set where it attains its max is convex. If 
$(a)-4(8) is a max then for O<a<1, dé(aa-*t(1-a)gp)- 
a$(a) * (1— a)$(B), or 


(6 [(f(a «x +(1-a)B - 20-af(a - 20-1 - a)f(8 : X) dP =0. 


From (5) and (6), 
(7) f(aa - X*(1-a)B: X) - af(a: X) *(1-a)f(8: X) as. 
Because f is strictly concave, aa : X+(1-a)B :X=a:X=8:X at all values 
of X where f is finite. Both sides of (7) are —e only at values of X where 
a: X-g-X--1.Inany case, a: X— 8 * X almost surely. 

(iii) Choose e >0 so that Y y, « 1— €. Then 1/|1 +y: X| 3 1/e. The difference 
quotient for àd/à-y is 


(8) log (14 Y. wXe + (41 +Ay)X,)—log (1+5 X) 
[omae 
Ay; 
Let x —(x,,:::,xy) be a value of X and consider the function g(y,)= 


log (1+ X,4; VX, + yx). By the mean value theorem, 


[sacro £00 Ixl 
Ay, LHE yox, + &x| ' 


0« & <Ay,. So the integrand in (8) is dominated by the L’ function |X;|/ for 
Ay, small. Result (iii) follows from the Lebesgue dominated convergence 
theorem. 


Here is a simple example which conceptually illustrates a practical use of the 
Kelly-Breiman criterion: maximize E(log F,,). 


Example 1. Define two random variables X, and X, by flipping a fair coin: 
if heads, then X,=100 and X,=-10, if tails, then X,— —1 and X,=1. The 
payoff from X, is far superior to the payoff from X5, but because X; and X, 
are (completely) correlated and have payoffs with opposite signs, the criterion 
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will mix both in order to smooth out the rate of capital growth. A simple 
calculation shows that $(y) = E(log (1+ y, X, + y;X;)) is maximized over D on 
the face y, ^ y; — 1 at y¥=0.54 and ys =0.46. (Lemma 3 will discuss the basic 
problem created by maxima occurring at non-interior points of D.) The extent 
to which the criterion will sacrifice expectation is surprising: E(0.54X, + 
0.46X3) 224.7 vs. E(X,) - 49.5. 

A fascinating example of the use of this criterion, in which the underlying 
idea is the same as this example, is in hedging a warrant against its stock as 
described in [15], pp. 220-222. 


Example 2. For A0, let X have density Ae^^**? for x z—1, and 0 for 
x «—1; an exponential shifted to allow losses. We shall show that there is a 
unique y*, 0s y* « 1, which maximizes (y) = E(log (1+ yX)), OS y 81; y*= 
0 iff Az 1. 

By Lemma 1, 


°” x 
' = Ae ^ 9*9? . 
(y) l I+% e dx 


Further, 


$"(y) = [ TEN qiie ae <0, 


so œ is strictly concave on [0,1). Since $'(0)2 E(X)-A !-1, the strict 
concavity of @ shows that y* —0 iff $'(0) 0. 

It remains to show that for 0A «1 there is a unique maximizing point y* 
with 0 « y* « 1. By a change of variable, $'(y) = (A/y)e*[g(a) — E1(a)], where 
a-A((1/y)-1), g(a)=e-/(a+A), and the exponential integral El(a)- 
fae tdt. Since E1(0) 2o and g(0)=1/A, $'(y) «O0 for y close to 1; as $ is 
strictly convex, there is thus a unique point y*,0« y* «1, at which 4 is 
maximized. 

For future reference we note that it is not obvious that is continuous at 1; 
part of the computation involves an integration by parts and a change of 
variable to obtain o(y)=log(1—-y)+e°E1(a), a as above. The expansion 
E1(a)-e *(-loga- yo*-0(a)) ([1], p. 229), where y9-0.577- -- is Euler's 
constant, shows that lim, ,, $(y) = —log (A) — yo. 

In Example 2 y*=0 iff E(X)S0, i.e., a gambler bets on X only if it has 
positive expectation. This is a special case of a more general result. Breiman 
[4], p. 65, calls a game favorable if there is a strategy such that the associated 
fortune F, tends almost surely to © with n, and he shows that this condition is 
equivalent to $(y") being positive ([4], Proposition 3, p. 68). Lemma 2 
establishes the equivalence with the intuitive Condition (iv). 
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Lemma 2. The following are equivalent: 
(i) There is a strategy with the associated fortune 


F,—9 as. as no 


(ii) F* >œ as. 

(iii) 6(y*) » 0. 

(iv) E(X,)>0 for at least one i, 18i £N. 

Proof. 

(i) implies (ii). In Theorem 1 we show that F,/F* tends almost surely to 
a finite limit. 

(ii) implies (iii). If oy") = 0, then F* = F, for all n. 

(iii) implies (iv). If y* - X=0, then $(y*) 2 0. So y* >0 for some i with X, 
not identically 0. Fix all variables in 4 but y and set W(y,)= 
olyf, y3, ys yo Yf’ tte, 7%), a concave function which has a positive 
max at y*. If E(X,) €0, then V'(0) £0, and Y has a local max at 0 and so has a 
global max there because it is concave. Hence E(X;) » 0. 

(iv) implies (i). Define Y by setting all the variables but y; equal to 0 in 
$: Y(0,0,:::,0, 4,0,:::,0). As E(X,)>0, V'(0)—0 and so V(y,)>0 for y; 
close to 0. Thus $(y 9-0. Since log (F*/Fy) =Y} log(1+y*+ X9?) and 
E((log F*/F)/n) = $(y*) »0, the strong law of large numbers shows that 
F*/F,— œ almost surely. 


Example 3. Let X, and X, be the coordinates of a point distributed 
uniformly on [-1, b] x(-1, b]. Then é(y,, yj) = E(log (1+ y,X, + y2X2)) has a 
maximum at y* = (2,3) if b zlog (16) 1. 

If (y,, Y2) is a point where ¢ attains its max, then so is (y2, yi) by symmetry. 
Since ¢ is concave, (ó(ys, Y2) + 6 (yo, ¥1)) 8 GY + Y2), (yi + Y2), and we 
may look for the maximum of ¢ along the diagonal (y, y), OS y x1. Then 


b X, +X, 


d 1 b 
a, YY) = gi J. J. (19 05 x) 


The second derivative is «0, and 4' is decreasing. A direct but tedious 
integration and calculation shows that 


i denen (269) 


dx, dx». 


Hence ó(v, y), which can be shown to be continuous on [0, H increases up to 
its max at (3, 3) as long as b zlog(16)-1- 1.77 - 

It is interesting to compare this situation with betting on only one variable, 
say Xj. Then ¢,(y,) = E(log (1- y, X,)) is continuous on [0, 1]. Continuity on 
[0, 1) follows from Lemma 1 or inspection. Because of the singularity at 
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yi X; —-—1 in the integrand, continuity at 1 is not by inspection; one must 
compute $,(1) «log (1-- b) - 1 and show it is equal to the limit. (The situation 
in two variables, which we dismissed with a word, is not easier.) 

The function $, is differentiable on [0, 1) by inspection or Lemma 1. It is 
only differentiable at 1 in an extended sense; we can show that $(1) = —o» and 
that lim, $1(y) =—®. 

The function $, has a unique maximum at a point yT, 0« yf « 1. From 
Lemma 2, y* —0 iff E(X) =(b—1)/2S0. The existence of yf «1 follows from 
¢$4(0) =(b—1)/2 and $(1) = —, the uniqueness from strict concavity, . 

The surprising fact is that for one variable X, a gambler does not bet all his 
fortune no matter what b is, but he does bet all his fortune on two 
independent copies of X, for b large enough. 

The reader who has carried out the calculations of Examples 2 and 3 knows 
that, because of the possible singularity on Y, y, =1, it is not clear that 4 
attains a maximum, and the differentiability of @ on the boundary Y y, = 1 is 
even less clear. 

Think of a continuous strictly increasing concave function f on [0,1] and 
redefine it at 1 so that its value there is less than f(0). If this redefined function 
were E(log(1-yX)) then X would be a most interesting game with no 
Kelly-Breiman optimal strategy: with unit fortune, if a gambler bet an amount 
less than 1 he could always do better by betting slightly more, but betting all 
would be worst. 

One result of Lemma 3 is that there is an optimal y* so no game can have 
the property discussed in the paragraph above. Another result of Lemma 3 is 
that $ is continuous, when finite. This is important because when we compute 
y*, a numerical calculation which will generally give y* to a certain number of 
decimals, we want to know that using this approximation to the exact y* will 
give close to optimal performance. 

The other result is a substitute for differentiation when y* has Y yf=1, 
which allows us to derive the basic inequalities (9) and (10). Note that if all the 
random variables X,, X5, -- - , Xy are discrete, with a finite number of values, 
as they are in [4], then we can differentiate $ at y*: for then if y* - X equals 
—1 it does so with positive probability and $(y*) = —« $(0) —0, contrary to 
$(y*) a maximum; thus ¢ is actually defined on a neighborhood of y* (which 
may extend outside D) and is differentiable as in Lemma 1. The problems 
which Lemma 3 resolves are those which arise from more general random 
variables. 


Lemma 3. 
(i) The function $ is continuous where finite, and attains a maximum at a 
point y* in D. 
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(ii) Let K={k: yt #0}. For k in K, E(X,J(1 y* - X)) is finite and non- 
negative. If k belongs to K, then for all m 


(9) E(X,/(1+ y* - X) Z EX,J( * y* - X)). 
If both k and m belong to K, 
(10) E(X A+ y* - X) = E(X,/(1+ y* - X)). 


Proof. Suppose that y™ converges to y® with $(y9) finite. Denote the 
positive and negative parts of log by Jog* and log": log*(x)=logx if x21, 
log* (x) - 0 if x 81 and log (x) - -log (x) if x &1, log (x) -0 if x21. Since 
log* (1- y - X) Sly: X| S max |X], $ is infinite only if it is —. By the Lebes- 
gue dominated convergence theorem lim flog* (14 y? - X) dP= 
flog* (1 y9- X) dP, by Fatous lemma liminfflog (1+y™ - X) dPz 
flog" (1+ y99 - X) dP, and putting these two facts together, lim sup ó(y) =£ 
$ (319). Thus $ is upper semicontinuous and therefore attains its maximum on 
the compact set {y in D: $(y)z 0). 

For O«a«1, d(ay*(1i-a)y9)za$(y)-(1—a)ó$(y9), and so 
lim inf, „o (ay - (1—a)y(9)z o(y™) if (y) is finite, i.e., @ is continuous 
along lines directed towards y from points where 4 is finite. 

Suppose that y” and y? are in D with $(y?) finite, and let a e[0, 1). Now 


1+ay + X+(1-a)y> X 214+a¥ yO(-1)+(1-a)y -X 
2(1-a)+(1-a)y@- X 
=(1—a)(1+y - X). 


Since log (1-2) is a decreasing function of z, log (1tay-X+ 
(1-a)y™ - X) Slog (1—-a)(1-- y™ - X) which equals —log (1—a)— 
log (1-- 4 - X) when (1~a)(1+ 49 - X) 1, and equals 0 otherwise. Hence 
flog: (1+ ay? - X -- (1—a)y'?- X) dP S~log (1—a) * flog" (1+ y? - X) dP «c, 
and thus $ (ay? t (1— a)49?) is finite for a 1. 

Let a point y be a given at which 6 is finite. For 1S k S N, let e“ be the 
vector in D whose kth coordinate is 1,e(?928,,e(9 20, and set y= 
ae? c(1—a)y,a in (0,1), for OSkKSN. Note that D is the convex hull 
cole”, e(D, -.., e"). As we have seen above, $(y?) is finite and so ¢ is 
continuous on the line joining y“ to y. Given e>0, by choosing a small 
enough we have |$(y/?) - $(y)| Se for OX kx N. For any vector v in the 
convex hull U=co(y, 49,..-, Y), v=} ay, a, z0, Y a, — 1, we have 
o(v)2¥ a. (y?) z (y) ^e. The convex hull U is easily seen to have an 
interior (relative to D). Since @ is upper semicontinuous, V= 
{a: b(a)<o(y)+e} is open. Therefore for v in the neighborhood U?n 
V, |d(v)-—(y)\Se and ¢ is continuous at y. 
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For y in D with È y, <1, and OS a £1, define V(a) - $(ay *- (1— a)*). We 
have seen that ¥ is continuous on [0,1] Given 0<e<1, for a in 
[e, 1], 12 (ay - (1-2 3) y ')  Xz&(1-Y; y)» 0. As in Lemma 1, 
loy — y) + X/(1+ (ay -(1—a)y') - X)| is bounded by the L! function |(y— 
y: X/e(1-Y y)| and we may use the Lebesgue dominated convergence 
theorem to justify differentiating under the integral to obtain 


(y-y9: X dP 
1+(ay+(1-—a)y*)-xX 
Since £ was arbitrary, this holds on (0, 1]. 

If Y yf «1, then Lemma 1 establishes the fact that the expectations in (10) 
are 0. As in Lemma 2, if m is not in K, then fixing all the variables in $ but Ym 
and considering that concave function with a max at 0 shows that d¢/d7,,(y*) = 
0 and (9) follows. 

Now suppose that Y yf=1 and let x 2 (x,, ***, xy) be some value of the 
random variable X in which not all the x, = —1 for k in K. Note that the event 
X, 7—1 for all k in K has probability 0 since the integrand in the integral 
defining $ is —œ there and ¢(y*) is finite. 

For 0x a «1 and y in D°, define 


f(a) =log (1+ (ay * (1 a)y*) : x). 


The function f is finite because x was so chosen, and is differentiable with 


Y'(a)7 | 


(y- y): x 


Fa)-1tGy-ü-aw9x 


Because f" $0, f'(a) increases to (y — y*) : x/(1+* : x) as a decreases to 0. For 
a5 0, (y— y): XI(1- (ay - (1—a)y') * X) in L? we may apply the B. Levi 
theorem to obtain 
| [Qv-y9: X 
lim v) = | Tea an 
By the mean value theorem, (W(a) — ¥(0))/a = Y'(b), 0 « b <a. Then, because Y 
is increasing, lim, o V'(a) = limao ((¥(a) — ¥(0))/a). Since $(y*) is a maximum 
the right-hand side is 0 and we obtain the basic 
(y-y9- X p 

(11) | IL dP sO. 

For 0<e<1 and k in K, the choice y, = y* for j#k and y, = yz(1— 6) in 
(11) gives -f X,/(1 2 y* - X) dP $0, and f X,/(14 y* - X) dP is non-negative 


and therefore finite. 
For k in K and any m, and 0 «e « y£, the choice y; = yf for j neither k nor 
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m, Yn = y*—e, Ym = y& +e in (11) gives (9). Finally, (10) follows from (9) by 
symmetry. 


3. Theorems 


If F, is a gambler's fortune at the nth time period, obtained by using some 
strategy y, y™,---, y, ---, and F* is the fortune obtained by using the 
fixed fraction strategy y*, y*, :::, y*,:--, then Breiman concludes ([4], 
Theorem 2, p. 72) that lim F,/F* almost surely exists and E(lim F,/F*)S1. We 
extend these results in Theorem 1 below. Two comments are in order. 

First, the advantage in using y*, as indicated by the fact that E(lim FJF*) £ 
1, does not require passage to the limit. This was noted by Durham, for the 
case of two branching processes, in the proof of Theorem 1 of [6], p. 571. In 
fact, given that the limiting result is true and given a finite strategy 
yP,>++, y™, extend it by setting y“ = y* for j>m; then FJF* = E,JF* for 
n zm and E(F,/F*) S1. This should reassure the careful investor who won- 
ders whether a strategy good in the long run may not be inferior in any 
practical number of trials—disregard of this point leads to an overevaluation of 
games of the type which produces the St. Petersburg paradox. 

Second, by our analysis of $, we are able to show in Theorem 1 that the 
presence of the expectation in E(lim F,/F*)=1 raises serious problems in any 
superficial attempt to use this as an indication of the superiority of y*. 


Theorem 1. Let F, be the fortune obtained by using a strategy y= 
yi) +++, y™ for n repeated investment periods, and let F* be the fortune 
obtained by using the fixed fraction strategy y*. Then 

(i) EJF* is a supermartingale with E(F,/F*)S1. Consequently, lim F./F* 
exists almost surely as a finite number and E(lim F,/F*) x 1. 

(ii) Suppose that y bets only on those X, with y¥>0, i.e., that y? =0 if l¢ K, 
1X lx N and all j. If Y, y¥=1, then further suppose that Y; yË — 1 for all j. Then 
F,/F* is a martingale with E(F,/F*) = 1. 


Proof. Let m be given and let Em be the sigma-algebra generated by XË, 
1SKSN, 1Sjsm. Then 


F +1 ) = . Xo*0 F,, | ) 
m -EÍ—————-—— .—-|«.1. 
E * au Cm E 1+y* . Xm» Ft m 


Since E,/F* is €,,-measurable, this equals 


E, 14oyy0m*D . Xo*D 
F* ( 1-y*- X 


«) 


Because y/"*" is a strategy depending on the past values of the X^, it too is 
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€,,-measurable. That, together with the independence of X'"*P and the 
previous X®, shows this equals 


ra Ehra) ior EM 
E (m+1) (A) 
F* Mey*- X È Yk 1+y*. X 


Since X+? has the same distribution as X, this can be written 


E x) 3x) 
2m — |+ (m+1) — 2k = 
(12) F* E yX p Ye ^E lay X A, say. 

By Lemma 3, E(X,/(1+ y* - X)) equals a constant c for k in K and is Sc for 
all k; this constant c =0 if Y yf « 1. Hence 


1-y*:. X| F, 

E C s * ix eat |= Fe E iesus x)- FÉ 
We have shown that F,/F* is a positive supermartingale, with E(F,/F*)S 
E(F,_,/F*_,) S++ : SE(F,[F,) =1, and so by the supermartingale convergence 
theorem it converges almost surely to a finite limit. Using Fatou's lemma, 
E(lim F,/F*) Slim inf E(F,/F*) 1. 

An examination of (12) shows that under the conditions of (ii), 
E(Fasi/F* 41 | Em) = F,/Ft, and (ii) follows. 


The requirement in Theorem 1(ii) that if Y y*=1 then we must have 
Y. yi? — 1, for all j, in order to be sure to get a martingale, is made clear by a 
one-variable example, Let X =2. Then $(y) —log (1+2+) is maximal at y* = 
1. The gambler will do worse betting any amount less than 1, even though he 
still bets on the same random variable as y* does, the key observation being 
$'(1) 7» 0. In general, the ‘partial derivatives’ E(X,/(1 + y* - X)), k in K, may be 
positive if Y y* — 1, whereas they are all 0 if Y yf «1 

The surprising result of Theorem 1 is the broad conditions in (ii) under which 
E(F,/F*)=1. To see what the surprise is, we shall superficially interpret 
Theorem 1(i): since ‘on the average’, and ‘for large n’, F,/F* £1, the gambler 
‘does better’ with F* than with F,. But then Theorem 1(ii) tells us that if the 
gambler simply bets on the same variables as y* does, but in any proportions at 
all, and if y* bets all so does he, then E(F,/F*) = 1. So with the same intuitive 
interpretation as above, ‘on the average’ the gambler does the same with F, as 
with F*, so it really does not matter which strategy he uses! But we know that 
it does matter. For example, in a repeated biased-coin toss, if he plays a fixed 
fraction strategy betting an amount y £ y* - p — q, then almost surely E/F* > 
0. Yet we have E(F,JF7) — 1. In general it will not help to look at lim F,/F*. 
For example, if on the first flip of the coin he bets all his fortune, and from then 
on he bets p-q, F,/F*>1 with probability p. 


AS 
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Theorem 2 will help our understanding of this situation by showing that F* 
is the only denominator with E(F, FP) 31 for all F,; in fact E(F*/F,)>1 if F, 
does not come from a strategy equivalent to using y* repeatedly. The suspi- 
cious reader will note that this characterization of the sense in which y* 
optimal contains an expectation. Anyone attempting to state intuitively the 
result of Theorem 2(i) in the form ‘F* is better than F, because, on the 
average, F*/F, 21’, should also be willing to apply the same interpretation to 
Theorem 1(ii) and conclude that often, ‘F,/F*=1 on the average and so F, 
and F* are often the same after all'. 


Theorem 2. 
(i) F*/F, is a submartingale with E(F*/F,)21. Lim F*/F, almost surely 
exists as an extended real number and E(lim F*/F,) z 1. 
(ii) E(FR]F,) = 1 iff y O, y, --, 4"? are all equivalent to y*, ie, iff 
y9 - X= y* - X almost surely for almost all values of X,-++,X® (of which 
yo is a function) for 1X jn. 


Proof. By Theorem 1, the non-negative lim F,/F* almost surely exists, and 
so lim F*/F, almost surely exists as an extended real number. 
As in Theorem 1, 
Ft. le.) -En ( Tty* Xim*n | n) 
eg Cm EF 10 ym». X5 € 


Suppose that (XP, --, X?) takes on the value w in R™Y, at which point 
y'™* takes on the value y“"*?(w). Then 


ER EP. | )- (C us)-B 
(13) se m+1 QXD om XO") ey E 1+ y" * (o) OX , say, 


because X“"* is independent of the values of (X“, +--+, X") ([5], Corollary 
4.38, p. 80). By Jensen's inequality, 


(14) Bzexp(E(log (1 y* - X"*))— E(log (1+ y"*"(o) - X"*?)) 
=C, say. 


By Lemma 1, C>1 unless y* - X= y™*'(w) - X almost surely, in which case 
C « 1. Thus 


F* ) F* 
imti >L 
(15) Eà Cm Fin 


with equality holding iff y* - X = y(5'? - X almost surely for almost all values w 
in the range of (X®,---,X™). If E(Ff+ı/Fm+1)=1, then 1= 
E(E(Fm+i/Fn+1 | €, )EECSEJE- SEE(FJF)-1. By (15) equality 
holds in (15) and therefore y* * X = y" *P(c) - X for almost all w in the range 
of (XD, +--+, X™). Continue for m, m—1,::*,1, to obtain (ii). 


248 M. Finkelstein and R. Whitley 


428 MARK FINKELSTEIN AND ROBERT WHITLEY 


Let Y=lim F*/F,. By Theorem 1, P(Y =0)=0 and E(1/Y) x 1. The func- 
tion g(x) 7 1/x is convex in (0,) and Jensen's inequality applies, to obtain 
12 E(1/Y)21/E(Y) which completes the proof. 
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18 
The Effect of Errors in Means, 
Variances, and Covariances on Optimal 
Portfolio Choice* 
Vijay K. Chopra and William T. Ziemba 


Good mean forecasts are critical to the mean-variance framework 


There is considerable literature on the strengths and limitations of mean- 
variance analysis. The basic theory and extensions of MV analysis are dis- 
cussed in Markowitz [1987] and Ziemba & Vickson [1975]. Bawa, Brown & 
Klein [1979] and Michaud [1989] review some of its problems. 

MV optimization is very sensitive to errors in the estimates of the inputs. 
Chopra [1993] shows that small changes in the input parameters can result in 
large changes in composition of the optimal portfolio. Best & Grauer [1991] 
present some empirical and theoretical results on the sensitivity of optimal 
portfolios to changes in means. This article examines the relative impact of 
estimation errors in means, variances, and covariances. 

Kallberg & Ziemba [1984] examine the question of mis-specification in 
normally distributed portfolio selection problems. They discuss three areas 
of misspecification: the investor's utility function, the mean vector, and the 
covariance matrix of the return distribution. 


They find that utility functions with similar levels of Arrow-Pratt absolute 
risk aversion result in similar optimal portfolios irrespective of the functional 
form of the utility’; Thus, mis-specification of the utility function is not a 
major concern because several different utility functions (quadratic, nega- 
tive exponential, logarithmic, power) result in similar portfolio allocations 
for similar levels of risk aversion. 


Misspecification of the parameters of the return distribution, however, does 
make a significant difference. Specifically, errors in means are at least ten 
times as important as errors in variances and covariances. 


We show that it is important to distinguish between errors in variances 
and covariances. The relative impact of errors in means, variances, and co- 
variances also depends on the investor's risk tolerance. For a risk tolerance of 


*Reprinted, with permission, from Journal of Portfolio Management, 1993. Copyright 
1993 Institutional Investor Journals. 
1For an investor with utility function U and wealth W, the Arrow-Pratt absolute risk 
aversion is ARA = —-U"(W)/U'(W). Friend and Blume [1975] show that investor behavior 
is consistent with decreasing ARA; that is, as investors’ wealth increases, their aversion to 
a given risk decreases. 
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50, errors in means are about eleven times as important as errors in variances, 
a result similar to that of Kallberg & Ziemba.? Errors in variances are about 
twice as important as errors in covariances. 


At higher risk tolerances, errors in means are even more important relative 
to errors in variances and covariances. At lower risk tolerances, the relative 
impact of errors in means, variances, and covariances is closer. Even though 
errors in means are more important than those in variances and covariances, 
the difference in importance diminishes with a decline in risk tolerance. 


These results have an implication for allocation of resources according to 
the MV framework. The primary emphasis should be on obtaining superior 
estimates of means, followed by good estimates of variances. Estimates of 
covariances are the least important in terms of their influence on the optimal 
portfolio. 


Theory 


For a utility function U and gross returns r — i (or return relatives) for assets 
i = 1,2,...,N, an investor's optimal portfolio is the solution to: 


N 
maximize Z(z) = E(U (Wo Y "(ri)z:)] 
i=l 
N 
such that z; > 0, > =1, 


i=] 


where Z(z) is the investor's expected utility of wealth, Wọ is the investor's 
initial wealth, the returns r; have a distribution F(r), and z; are the portfolio 
weights that sum to one. 


Assuming a negative exponential utility function U(W) = — exp(—aW) 
and a joint normal distribution of returns, the expected utility maximization 
problem is equivalent to the MV-optimization problem: 


N INN 
maximize Z (z) = 3^ Ef[r:]z; — i $5 Y az; Eloi] 
t=1 


i=1j=1 


N 
such that z; > 0, EF =1, 
i=] 


?The risk tolerance reflects the investor’s desired trade-off between extra return and 
extra risk (variance). It is the inverse slope of the investor’s indifference curve in mean- 
variance space. The greater the risk tolerance, the more risk an investor is willing to 
take for a little extra return. Under fairly general input assumptions, a risk tolerance of 50 
describes the typical portfolio allocations of large US pensions funds and other institutional 
investors. Risk tolerances of 25 and 75 characterize extremely conservative and aggressive 
investors, respectively. 
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EXHIBIT 1: 

List of Ten Randomly Chosen DJIA Securities 
Aluminum Co. of America 
American Express Co. 
Boeing Co. 
Chevron Co. 
Coca Cola Co. 
E.I. Du Pont De Nemours & Co. 
Minnesota Mining and Manufacturing Co. 
Procter & Gamble Co. 
Sears, Roebuck & Co. 
United Technologies Co. 


SOPNOABRWN YD 


ren 


where E|r;] is the expected return for asset i, t is the risk tolerance of the 
investor, and E[cij] is the covariance between the returns on assets i and j.? 


A natural question arises: How much worse off is the investor if the distri- 
bution of returns is estimated with an error? This is an important considera- 
tion because the future distribution of returns is unknown. Investors rely on 
limited data to estimate the parameters of the distribution, and estimation 
errors are unavoidable. Our investigation assumes that the distribution of 
returns is stationary over the sample period. If it is time-varying or non- 
stationary, the estimated parameters will be erroneous. 


To measure how close one portfolio is to another, we compare the cash 
equivalent (CE) values of the two portfolios. The cash equivalent of a risky 
portfolio is the certain amount of cash that provides the same utility as the 
risky portfolio, that is, U(CE) = Z(z) or CE = U^![Z(z)] where, as defined 
before, Z(z) is the expected utility of the risky portfolio.* The cash equivalent 
is an appropriate measure because it takes into account the investor's risk 
tolerance and the inherent uncertainty in returns, and it is independent of 
utility units. For a risk-free portfolio, the cash equivalent is equal to the 
certain return. 


Given a set of asset parameters and the investors risk tolerance, a MV- 
optimal portfolio has the largest CE value of any portfolio of those assets. The 


3Although the exponential utility function is convenient for deriving the MV problem 
with normally distributed returns, the MV framework is consistent with expected utility 
maximization for any concave utility function, assuming normality. 

*For negative exponential utility, Freund [1956] shows that the expected utility of port- 
folio z is Z(z) = 1— exp(—aE[z] + (a?/2)Var[z]), where E[X] and Var[z] are the expected 
return and variance of the portfolio. The cash equivalent is CE, = (1/a)log(1 — Z(z)). 
If returns are assumed to have a multivariate normal distribution, this is also the cash 
equivalent of an MV-optimal portfolio. See Dexter, Yu & Ziemba [1980] for more details. 
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percentage cash equivalent loss (CEL) from holding an arbitrary portfolio, x 
instead of an optimal portfolio o is 
CE, — CE, 


CEL = CE, 


where CE, and CE, are the cash equivalents of portfolio o and portfolio x 
respectively. 


Data and Methodology 


The data consist of monthly observations from January 1980 through Decem- 
ber 1989 on ten randomly selected Dow Jones Industrial Average (DJIA) se- 
curities. We use the Center for Research in Security Prices (CRSP) database, 
having deleted one security (Allied-Signal, Inc.) because of lack of data prior 
to 1985. Each of the remaining twenty-nine securities had an equal probabil- 
ity of being chosen. The securities are listed in Exhibit 1. 

MV optimization requires as inputs forecasts for: mean returns, variances, 
and covariances. We computed historical means (7;), variances (cj), and 
covariances (c;;), and assumed that these are the ‘true’ values of these pa- 
rameters. Thus, we assumed that E[r;] = 7;, Elou] = don, and Eloi;] = vij. A 
base optimal portfolio allocation is computed on the basis of these parameters 
for a risk tolerance of 50 (equivalent to the parameter a = 0.04). 

Our results are independent of the source of the inputs. Whether we use 
historical inputs or those based on a complete forecasting scheme, the results 
continue to hold as long as the inputs have errors. 

Exhibit 2 gives the input parameters and the optimal base portfolio re- 
sulting from these inputs. To examine the influence of errors in parameter 
estimates, we change the true parameters slightly and compute the resulting 
optimal portfolio. This portfolio will be suboptimal for the investor because 
it is not based on the true input parameters. 

Next we compute the cash equivalent values of the base portfolio and the 
new optimal portfolio. The percentage cash equivalent loss from holding the 
suboptimal portfolio instead of the true optimal portfolio measures the impact 
of errors in input parameters on investor utility. 

To evaluate the impact of errors in means, we replaced the assumed true 
mean 7; for asset i by the approximation 7;(/ + kz;) where z; has a standard 
normal distribution. The parameter k is varied from 0.05 through 0.20 in 
steps of 0.05 to examine the impact of errors of different sizes. Larger values 
of k represent larger errors in the estimates. The variances and covariances 
are left unchanged in this case to isolate the influence of errors in means. 

The percentage cash equivalent loss from holding a portfolio that is optimal 
for approximate means 7;(1+ kz;) but is suboptimal for the true means r, is 
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EXHIBIT 2: 
Inputs to the Optimization and the Resulting Optimal Portfolio for a 
Risk Tolerance of 50 (January 1980-December 1989) 
Alcoa Amex Boeing Chev. Coke Du Pont MMM P&G Sears U Tech 

Means (% 

per month) 1.5617 1.9477 1.907 1.5801 2.1643 1.6010 1.4892 1.6248 1.4075 1.1537 
Std. Dev. (% 

per month) 8.8308 8.4585 10.040 8.6215 5.988 6.8767 5.8162 5.6385 8.0047 8.212 


Correlations 


Alcoa 1.0000 

Amex 0.3660 1.0000 

Boeing 0.3457 0.5379 1.0000 

Chev. 0.1606 0.2165 0.2218 1.0000 

Coke 0.2279 0.4986 0.4283 0.0569 1.0000 


Du Pont 0.5133 0.5823 0.4051 0.3609 0.3619 1.0000 

MMM 0.5203 0.5569 0.4492 0.2325 0.4811 0.6167 1.0000 

P&G 0.2176 0.4760 0.3867 0.2289 0.5952 0.4996 0.6037 1.0000 

Sears 0.3267 0.6517 0.4883 0.1726 0.4378 0.5811 0.5671 0.5012 1.0000 

U Tech 0.5101 0.5853 0.6569 0.3814 0.4368 0.5644 0.6032 0.4772 0.6039 1.0000 


Optimal Port. 
Weights 0.0350 0.0082 0.0 0.1626 0.7940 0.0 0.0 0.0 0.00 0.00 


then computed. This procedure is repeated with a new set of z values for a 
total of 100 iterations for each value of k. 


To investigate the impact of errors in variances each variance forecast oj; 
was replaced by o,;(1 + kZ;). To isolate the influence of variance errors, the 
means and covariances are left unchanged. 


Finally, the influence of errors in covariances is examined by replacing 
each covariance gi; (i # j) by dij + kzi; where zi; has a standard normal 
distribution, while retaining the original means and variances. The procedure 
is repeated 100 times for each value of k, each time with & new set of z values, 
and the cash equivalent loss computed. The entire procedure is repeated for 
risk tolerances of 25 and 75 to examine how the results vary with investors' 
risk tolerance. 


Results 


Exhibit 3 shows the mean, minimum, and maximum cash equivalent loss over 
the 100 iterations for a risk tolerance of 50. Exhibit 4 plots the average CEL 
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EXHIBIT 3: 
Cash Equivalent Loss (CEL) for Errors of Different Sizes 
k (size of Parameter Mean Min. Max. 
error) with Error CEL CEL CEL 
0.05 Means 0.66 0.01 5.05 
0.05 Variances 0.05 0.00 0.34 
0.05 Covariances 0.02 0.00 0.25 
0.10 Means 2.45 0.01 15.61 
0.05 Variances 0.22 0.00 1.39 
0.10 Covariances 0.11 0.00 0.66 
0.15 Means 5.12 0.15 24.35 
0.15 Variances 0.55 0.00 3.35 
0.15 Covariances 0.27 0.00 1.11 
0.20 Means 10.16 0.17 36 09 
0.20 Variances 0.90 0.01 4.16 
0.20 Covariances 0.47 0.00 1.94 


as a function of k. The CEL for errors in means is approximately eleven 
times that for errors in variances and over twenty times that for errors in 
covariances. Thus, it is important to distinguish between errors in variances 
and errors in covariances.? For example, for k = 0.10, the CEL is 2.45 for 
errors in means, 0.22 for errors in variances, and 0.11 for errors in covariances. 


Our results on the relative importance of errors in means and variances 
are similar to those of Kallberg & Ziemba [1984]. They find that errors in 
means are approximately ten times as important as errors in variances and 
covariances considered together (they do not distinguish between variances 
and covariances). 


Our results show that for a risk tolerance of 50 the importance of errors 
in covariances is only half as much as previously believed. Furthermore, the 
relative importance of errors in means, variances, and covariances depends 
upon the investor’s risk tolerance. 


Exhibit 5 shows the average ratio (averaged over errors of different sizes, k) 
of the CELs for errors in means, variances, and covariances. An investor with 
a high risk tolerance focuses on raising the expected return of the portfolio 

5The result for covariances also applies to correlation coefficients, as the correlations 


differ from the covariances only by a scale factor equal to the product of two standard 
deviations. 


The Effect of Errors in Means, Variances, and Covariances on Optimal Portfolio Choice 255 


The Effect of Errors 99 


*& Cash 
Equivalent Loss 


11 


10 
9 
8 
7 
6 
5 
4 
3 
2 
1 . Variances 
0 _ Covariances 
0 0.05 0.10 0.15 0.20 
Magnitude of error (k) 
EXHIBIT 4 


Mean percentage cash equivalent loss due to errors in inputs 


EXHIBIT 5 
Average Ratio of CELs for Errors in Means, Variances, and Covariances 
Risk Errors in Means Errors in Means Errors in Variances 
Tolerance versus Variances versus Covariances versus Covariances 
25 3.22 5.38 1.67 
50 1.98 22.50 2.05 
75 21.42 56.84 2.68 


and discounts the variance more relative to the expected return. To this 
investor, errors in expected returns are considerably more important than 
errors in variances and covariances. For an investor with a risk tolerance of 
75, the average CEL for errors in means is over twenty-one times that for 
errors in variances and over fifty-six times that for errors in covariances. 


Minimizing the variance of the portfolio is more important to an investor 
with a low risk tolerance than raising the expected return. To this investor, 
errors in means are somewhat less important than errors in variances and 
covariances. For an investor with a risk tolerance of 25, the average CEL for 
errors in expected returns is about three times that for errors in variances 
and about five times that for errors in covariances. 

Most large institutional investors have a risk tolerance in the 40 to 60 range. 
Over that range, there is considerable difference in the relative importance 
of errors in means, variances, and covariances. Irrespective of the level of 
risk tolerance, errors in means are the most important, followed by errors 
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in variances. Errors in covariances are the least important in terms of their 
influence on portfolio optimality. 


Implications and Conclusions 


Investors have limited resources available to spend on obtaining estimates of 
necessarily unknowable future parameters of risk and reward. This analysis 
indicates that the bulk of these resources should be spent on obtaining the 
best estimates of expected returns of the asset classes under consideration. 


Sometimes, investors using the MV framework to allocate wealth among 
individual stocks set all the expected returns to zero (or & non-zero constant). 
This can lead to a better portfolio allocation because it is often very difficult 
to obtain good forecasts for expected returns. Using forecasts that do not 
accurately reflect the relative expected returns of different securities can sub- 
stantially degrade MV performance. 


In some cases it may be preferable to set all forecasts equal. The opti- 
mization then focuses on minimizing portfolio variance and does not suffer 
from the error-in-means problem. In such cases it is important to have good 
estimates of variances and covariances for the securities, as MV optimizes 
only with respect to these characteristics. 


Of course, if investors truly believe that they have superior estimates of 
the means, they should use them. In this case it may be acceptable to use 
historical values for variances and covariances. 


For investors with moderate to high risk tolerance, the cash equivalent loss 
for errors in means is an order of magnitude greater than that for errors in 
variances or covariances. As variances and covariances do not much influence 
the optimal MV allocation (relative to the means), investors with moderate- 
to-high risk tolerance need not expend considerable resources to obtain better 
estimates of these parameters. 
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This paper considers the problem of investment of capital in risky assets in a dynamic capital 
market in continuous time. The model controls risk, and in particular the risk associated with 
errors in the estimation of asset returns. The framework for investment risk is a geometric 
Brownian molion model for asset prices, with random rates of return. The information 
filtration process and the capital allocation decisions are considered separately. The filtration 
is based on a Bayesian model for asset prices, and an (empirical) Bayes estimator for current 
price dynamics is developed from the price history. Given the conditional price dynamics, 
investors allocate wealth to achieve their financial goals efficiently over time. The price updat- 
ing and wealth reallocations occur when control limits on the wealth process are attained. 
A Bayesian fractional Kelly strategy is optimal at each rebalaneing, assuming that the risky 
assets are jointly lognormal distributed. The strategy minimizes the expected time to the upper 
wealth limit while maintaining a high probability of reaching that goal before falling to a 
lower wealth limit. The fractional Kelly strategy is a blend of the log-optimal portfolio and 
cash and is equivalently represented by a negative power utility function, under the multi- 
variate lognormal distribution assumption. By rebalancing when control limits are reached, 
the wealth goals approach provides greater control over downside risk and upside growth. The 
wealth goals approach with random rebalancing times is compared to the expected utility 
approach with fixed rebalancing times in an asset allocation problem involving stocks, bonds, 
and cash. 
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1. Introduction 


In capital accumulation. under uncertainty, a decision- 
maker must determine how much capital to invest in 
riskless and risky investment opportunities at each point 
in time. The investment strategy yields a stream of capital 
over time, with investment decisions made so that 
the distribution of wealth has desirable properties. 
An investment strategy which has generated considerable 
interest is the growth optimal or Kelly strategy, where the 
expected logarithm of wealth is maximized (Kelly 1956). 
The wealth distribution of this strategy has many attrac- 
tive characteristics (see e.g. Hakansson 1970, 1971, 
Markowitz 1976, Hakansson and Ziemba 1995). As 
Breiman (1960, 1961), and Algoet and Cover (1988) 
have shown, the Kelly strategy maximizes the long run 
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expected rate of growth of capital and minimizes the 
expected time to reach a fixed level of wealth for suffi- 
ciently large goals under mild conditions. Researchers 
such as Thorp (1975), Hausch e/ al. (1981), Grauer and 
Hakansson (1986, 1987), and Mulvey and Vladimirow 
(1992) have used the optimal growth strategy to compute 
optimal portfolio weights in multi-asset and worldwide 
asset allocation problems. 

The literature considers the stream of capital following 
from an investment policy from either a wealth or a time 
perspective. In most situations the expected utility of 
accumulated capital at a fixed point in time is analysed. 
There is particular interest in the logarithm of accumu- 
lated capital, and the corresponding rate of capital 
growth. Alternatively, the growth rate can be viewed as 
the time it takes for accumulated capital to reach wealth 
milestones. 

The distribution of accumulated capital to a fixed 
point in time and the distribution of the first passage 
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time to a fixed level of accumulated capital are variables 
controlled by the investment decisions. The Kelly strategy 
maximizes the expected logarithm of accumulated capital 
or the expected growth rate. However, the strategy is very 
aggressive. As Hausch and Ziemba (1985) and Clark and 
Ziemba (1987) have demonstrated, the optimal portfolio 
weights in the risky assets given by this strategy tend to be 
so large for favourable investments that the chances of 
losing a substantial portion of wealth are very high, par- 
ticularly if the probability estimates are in error. In the 
time domain, the chances are high that the first passage to 
subsistence wealth occurs before achieving the established 
wealth goals. 

When the investor is more risk averse, then this can be 
reflected in the utility function choice. If the utility 
function has an Arrow-Pratt risk aversion parameter, 
e.g. the constant relative risk aversion (CRRA) utility, 
then the value of this parameter captures risk tolerance; 
see Kallberg and Ziemba (1983). 

Another approach is to define a measure of risk which 
depends on the investment decision. A standard measure 
of risk is volatility as defined by the variance of wealth at 
a point in time or the variance of the passage time to a 
wealth target. Mean-variance analysis of wealth has been 
widely used to determine investment strategies; see 
Markowitz (1952, 1987). In the time domain the mean- 
variance approach yields different strategies. However, 
the logarithm of wealth and the first passage time have 
consistent mean-variance properties; see Burkhardt 
(1998). 

An alternative to variance is to use a downside risk 
measure (Breitmeyer et al. 1999). MacLean et al. (1992) 
considered, as risk measures, quantiles for wealth, log- 
wealth and first passage time in identifying investment 
strategies which achieve capital growth with a required 
level of security. Security is defined as controlling down- 
side risk. Growth is traded for security with fractional 
Kelly strategies. In discrete time models with general 
return distributions this strategy is generally suboptimal, 
but it has attractive wealth/time distribution properties. 
See MacLean and Ziemba (1999) for extensions of this 
research. The emphasis in that trade-off work is the prop- 
erties of the wealth process for a given fraction. For 
example, the probability of reaching upper wealth U 
before lower wealth L is calculated for various strategies. 
In this paper, the reverse problem of finding a strategy 
which achieves a specified probability (risk) is studied. 

The most common downside risk measure is Value at 
Risk (VaR) (Jorion 1997). VaR has been studied exten- 
sively (Artzner et al. 1999, Gaivoronski and Pflug 2005). 
Basak and Shapiro (2001) consider VaR in a model with 
CRRA utility. Although VaR is an industry standard it 
has weaknesses in controlling risk. The most serious 
shortcoming is the insensitivity to very large losses 
which have small probability—the essence of risk. 
Measures based on lower partial moments (incomplete 
means) such as CvaR (Rockafeller and Uryasev 2000) 
and convex risk measures based on target violations 
(Carino and Ziemba 1998, Rockafeller and Ziemba 
2000) attempt to deal with this problem. 


Similar to mean-variance, a variety of mean-risk 
problems have been studied. Bi-criteria problems such 
as maximizing expected logarithm of wealth subject to a 
VaR constraint are consistent with stochastic dominance, 
the traditional concept for ordering wealth distributions. 
(Ogryezak and Ruszezynski 2002). 

There is, however, another issue complicating the 
measurement of risk and return, which is the estimation 
of parameters defining the returns distribution. The value 
of the risk/return measures in practice are estimates of 
the true values, and the error of the estimates of measures 
is very sensitive to errors in the estimation of the returns 
distribution. Furthermore, the sensitivity to estimation 
errors of expected value measures (mean, CVaR) is 
much greater than quantile measures (median, VaR). 

In this paper parameter estimation and risk control 
are considered in a model where the filtration and control 
processes are separate. A dynamic stochastic model for 
asset prices is presented in section 2. The model is a gen- 
eralization to the multi-asset case of the random coeffi- 
cients model of Browne and Whitt (1996). The inclusion 
of many assets facilitates the estimation of model coeffi- 
cients since the correlation between assets is related to the 
intrinsic model structure. The existence of latent factors 
generating price movements is implied by the equations 
for price dynamics. Alternatively, price parameters could 
be related to observable state variables, as is the case in 
the model of Xia (2001). (See also Brennan 1998) 

Given the estimated price dynamics, an investment 
decision is made to control the path of future wealth. 
In section 3, an approach to control is developed based 
upon wealth levels as stopping rules. In practice, an 
investment portfolio cannot be continuously rebalanced 
and a realistic approach is to reconsider the investment 
decision at regular discrete time intervals (Rogers 2000). 
At each rebalance time, with additional data and a change 
in wealth, price parameters are re-estimated and a new 
investment strategy is developed. The process is 
illustrated in figure 1. The most significant aspect of the 
rebalancing is the accuracy of the estimated returns 
distribution. If forecasts are accurate, then wealth will 
accumulate as expected. If prices are not as forecast 
over the next hold interval then unacceptable wealth 
evels may result. To protect against that outcome wealth 
limits can be placed on the wealth process. If the wealth 
rajectory is within the limits, the wealth process is under 
control. However, the asset pricing model and the invest- 
ment decisions are reconsidered when a control limit is 
reached. So rebalancing occurs at a random time rather 
han at a given point in time. Intervention occurs when 
he wealth trajectory is not proceeding as expected and it 
can be concluded that the investment decision does not fit 
he true securities price process. 

Whether rebalancing occurs at fixed or random times, 
at the rebalancing the investor: 


1. generates new estimates for returns distributions; 

2. establishes next wealth limits for random time rebal- 
ancing, or the next hold time for the fixed rebalance 
interval; 
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Figure 1. Dynamic investment process. 


3. determines a new investment strategy based on the 
updated returns estimates, and control limits or 
hold time. 


At a rebalance point, the decision on investment 
is based on accumulated wealth at a planning horizon. 
A number of bi-criteria decision models have been men- 
tioned, but the model considered is maximization of 
expected utility at the horizon, with a negative power 
utility. It is significant that there exist wealth goals and 
an associated bi-criteria decision model which yields an 
identical investment strategy. Although demonstrated 
for the special case of negative power utility, this result 
is general, so that wealth goals reflect a wide range of 
preferences. 

The wealth goals decision is put in the context of 
capital growth with security. So a strategy is determined 
which minimizes the expected time to the upper 
limit (growth) while maintaining a high probability of 
achieving the upper limit before falling to the lower 
limit (security). The optimal strategy for the power utility 
and growth-security models is shown to be fractional 
Kelly—a blend of the risk free (cash) strategy and the 
optimal growth strategy. 

Although the negative power utility and wealth goals 
models are linked by the strategy, the approaches to risk 
are different. The expected utility model considers risk at 
the horizon, whereas the wealth goals model considers 
risk along the trajectory. In section 4, the wealth goals 
and power utility approaches are compared for the funda- 
mental problem of investing in stocks, bonds, and cash 
over time. It is shown that there is an advantage from 
wealth goals, since rebalancing occurs at times indicated 
by the wealth trajectory, ie. the wealth condition. 
This result is comparable to the superiority of condition 
based intervention in repairable systems (Aven and 


Jensen 1999). Appendix A contains proofs for proposi- 
tions in the text. 


2. Dynamic estimation of asset price distributions 


In the dynamic investment process shown in figure 1, 
the returns on risky assets are re-estimated at the time 
of rebalancing a portfolio. The updated returns distribu- 
tions are inputs to the investment decision models. In this 
section a linear pricing model is proposed where the esti- 
mation problem is separated from the control problem. 
The model is Bayesian, so that the updating of parameter 
estimates follows from Bayes theorem, and fits naturally 
into the dynamic investment process. 


2.1. Price model 


Suppose there are m risky assets, with P;(f) equal to the 
trading price of asset i at time t, /— ],..., m, and a risk- 
less asset with rate of return r al time /. The risky asset 
prices are assumed to have a joint log-normal distribu- 
tion. Letting Yi(r) = €nP(t), i—0,...,m, the price 
dynamics are defined by the stochastic differential 
equations 


d¥o() — r di, (1) 
dY;( 2oj(ndr-5;,dU, i=1,...,m, 
where dU; i-—l,...,m, are independent standard 


Brownian motions. They represent the variation in price 
specific to each security. The other component of price 
variation is generated by the instantaneous mean rate of 
return a2), 1— 1,..., m. Assume that a7) is a random 
variable with distribution defined by 


oj) = Mit yz) il... 


The Zi, i2 1,...,m, are correlated Gaussian variables, 
with pj; being the correlation between Z, and Z;. So the 
rates of return are correlated, with the covariance between 
a(t) and o) given by viyjbij. 

The price process defined by (I) and (2) is a variation 
on the Merton model (1992) with the condition that the 
rates oj, i= l,..., m, are stochastic and the volatilities à; 
i=1,...,m, are not stochastic. The intention is to empha- 
size uncertainty in the mean since errors in estimating the 
mean have by far the greatest impact on portfolio deci- 
sions (Chopra and Ziemba 1993). The asset prices in the 
model are correlated, with the correlation generated by 
the relationship between the random rates of return on 
assets. Presumably those rates of return are related 
through some common factors. The factors may be latent, 
as is assumed in this paper, or they may be observable 
state variables as presented by Xia (2001), who considers 
the single risky asset case. 

If the model for prices is correct then the error in 
forecasting prices arises from errors in parameter 
estimates. Error from incorrect modeling may be con- 
founded with estimation error, so attention to the correct 
model in an application may be significant. There are 
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generalizations of (1) and (2) which can be incorporated 
into the approach developed in this section. For example, 
the rates of return could be defined by stochastic 
differential equations: do;(r) = n; dr + y;dq;, where dq; 
are correlated Brownian motions, i— l,...,71. Then 
e(t) = a{0)+ uit + yi | Z;(D, and the rates are dynamic. 
The assumption in (2) simplifies the presentation, so that 
formulation is followed, but reference to the dynamic 
linear model will be made where appropriate. With regard 
to the stationary process (2), the model assumption applies 
until the next rebalance point. That is, the distribution for 
the random rates is the same between rebalance points, 
but the distribution at the next point may change. The 
model in (2) can be considered as an approximation 
between rebalance points. When used with control limits, 
where rebalancing occurs when the actual prices depart 
from model forecasts, this is a useful approach. 

From (1) and (2) the relevant distributions for 
asset prices follow. Let Y() -(Y,(0,..., Y4(0y,p = 


(ys s Hm) = (sess. On), A = diag(s},..., Sm) 
and P = (yj), where yy = Yyjog 
(Prior) «^ N(u, D), (3) 
(Conditional)  (Y(Ola, A) ^ N(at, tA), (4) 
(Marginal) Y(r)  N(u(n), E), (5) 


where p() = tj and E(r) = T (£^ = T(0 + A(1). 

The Bayesian pricing model defined by (3)-(5) has an 
alternative structural. model representation. The prior 
covariance has a decomposition T = AA’, where A is an 
m x i matrix with / «in. Consider the independent /atent 
factors F!  (Fy,..., Fi), F~ N(,1), and the errors 
&(t) = (ey (D, ... Eml O) &(G) ~ NO, AD). Let A(0 1A. 
Then the log prices at time ¢ are 


Y(f) 2 w+ AQF + e(t). (6) 


From (6), Y(r) ~ ND, E(D), where X(r) = ACMA) + 
A(1). The relationship between prices is driven by under- 
lying common factors which are unobserved. This 
manifests itself in rates of return, œ = i + AF, which 
are random. 

The factor model in (6) has been the subject of numer- 
ous asset pricing studies, with the factors representing 
traded portfolio's in some cases (Campbell et al. 1997). 
Here, the factor model is used to define estimates 
for parameters in the inter-temporal pricing model of 
(1) and (2). 


2.2. Bayes estimation 


At the time an investor is rebalancing a portfolio, infor- 
mation is available on past realized securities prices. 
Consider the data available at time t, (Y(0,0 « s < 4}, 
and the corresponding filtration FY = o(Y(0,0 « s < 0, 
the o-algebra generated by the process Y up to time /. 
Conditioned on the data, the distribution for the rate of 
return can be determined from Bayes theorem. That is, 


&() = (DIE) x NGC, PO), 0) 
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where à()) = Wl) FU — MEOW — n), with 
FO =+ YO, 
and 
PO = SU AME 0)50. 


The Bayes estimate for the rate of return o(r) is the 
conditional expectation &(t) = E@(1). This estimate is the 
minimum mean squared error forecast for the rate given 
the data. In the context of rebalancing, this is the 
planning value for œ until the next decision time ¿+r 
(when the stopping boundaries are reached or the 
fixed rebalance time occurs). Then, since no informa- 
tion is added in the hold interval, Ela ED = 
E(E(o (0| F), IFY), and the best estimate is á(r) through- 
out the interval. 

The Bayes estimate &(/} has an appealing form. It is 
inherently dynamic, so that estimates can be updated as 
more information on prices is obtained. In particular, 
at the next rebalance time / + r, the posterior &(/) becomes 
the next prior and Bayes theorem is used to update with 
the new information—Y(t-- r). This approach is 
described in MacLean ef al. (2004). There is another 
approach which emphasizes the current dynamics. The 
pricing model parameters (x, T, A) are interpreted as 
values for the most recent rebalance cycle, and only 
data from that time interval is used in estimating the 
parameters. With control limits, the recent data contains 
information on a change in dynamics, and a change in 
prior parameters. The technique for implying the values 
for the prior parameters from data is empirical Bayes 
estimation. 

To develop the estimation consider a rebalance interval 
(0, /) and prices at the discrete times Kt/n, k —0,...,n, 
with corresponding log prices Y(k):= Y[(k/m)]]. The 
change in log prices between times &//n and [(k + 1)/n] 


is e(k) = Y(k) — Y(k — D, k 2 L...,n. From the model 
equations 
ek) 2 ao [iaz ko eun (8) 
n n 


where Z ^ N(0, 1). 

The increment vectors defined by (8) are independent 
and Gaussian for each &. The covariance matrix for each 
vector is 


2 
f t 
E(Q = THA = DAD + AQ). 


and 


" 


E l 
¥() == Yee) 
k: 


Consider observed log prices on risky securities at the 


discrete time points: {Y i= 1,...,m, k 20,...,n]. 
With e = Ya — Y;,.,, the data on increments (first- 
order differences in log prices) are (eg,i — l,..., m, 


k =1,...,n}. Let the covariance matrix for the observed 
increments be S,,, the estimate of £,,(1). 
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From the random effects model (8) the theoretical 
covariance is factored as E(N) =T (A + A0. If the 
rank (P,(r)) 2 / < m, then the sample covariance matrix 
can be factored to produce estimates of L',(r) and A,(0. 
In terms of the structural model for prices in (6), the 
assumption rank (L,(0) 2/« m is equivalent to the 
assumption the number of latent factors | is strictly less 
than the number of risky assets m. 

Assuming /< m, a maximum likelihood factor analysis 
of Sa with / factors will yield a /oading matrix L,, and a 
specific error matrix D,,, where D, is diagonal. Since the 
number of factors is known, (Lan Dn is an efficient esti- 
mator of (A,(), A,()), with A,(9 = ((/n)A (Lawley and 
Maxwell 1971). Then G,, = L,,L;, is an efficient estimator 
of T,(0. 

The prior mean x for the rates of return on securities 
in a rebalance interval needs to be estimated. If it is 
assumed that the rates have a common prior mean, 
Le. ug, =p, i= l,...,m, then the overall mean Y(t) = 
(7D) 35.4 Yir) is an estimate of u, where Y,() = 
(1/0) Y(1). The assumption of a common value is consis- 
tent with a long run rate of return on securities. 
(Alternatively, securities in the same class could have 
a common mean, with differences between classes of 
securities.) 

The discussion has identified estimates for all model 
parameters within a rebalance interval. The conditional 
distribution for securities prices can be estimated, and in 
particular the conditional mean rate of return for the next 
time interval can be forecast. Since the forecast is the 
Bayes formula with the prior parameters estimated from 
data it is an empirical Bayes estimate. 


Proposition 1: Let ¥,,4 —0,...,n, be observations on 
the vector of log prices of risky assets at regular intervals 
of width tn, with the statistics computed from first-order 
increments: Sins Lin, Dm and Y, Then A — (n/0D,, 
A =(n/NLy, are estimates of the model parameters A 
and A, respectively. With Af) - 1A, AQ) =À and 
= AAO + AQ), an estimate for the conditional 
mean rate of return at time t is 


&,() = YO) + (0 — A(OSDORO — YO) (9) 


Furthermore, the limit as n > oo of à,(7) is G(r). 

The empirical Bayes estimate in (9) is a natural formu- 
lation given the proposed pricing model. If the model is 
correct then d,(r) has smaller mean squared error than 
well known estimates for the rate of return, such as the 
maximum likelihood estimate and the James-Stein esti- 
mate. If the model is correct but the number of latent 
factors is unknown, then there is additional estimation 
error from estimating that number. However, results 
from similar models indicate such error is small; see 
MacLean and Weldon (1996). If à dynamic model for 
rates, af) = e(0) + uit + yj I IZ(,; — 1,..., 1 is the 
correct formulation then the approach described above 
can be modified to obtain empirical Bayes estimates for 
parameters. The modification involves calculating second- 
order differences to obtain a random effects model with itd 
second-order increments. 


With this estimation procedure, an essential component 
of the dynamic investment process is in place. The next 
component, the planning model, will have the forecasts 
from the estimated model as inputs. 


3. Portfolio planning models 


Assuming that at a rebalance point the data has been 
filtered to obtain estimates for the parameters, the 
decision on how much capital to allocate to the various 
assets is now considered. The objective is to control the 
path of accumulated capital generated by the decision and 
the unfolding asset prices. The decision will be developed 
from the estimated price parameters whereas capital will 
accumulate from the true process. Of course, if the esti- 
mates used in computing a strategy are substantially in 
error, then the trajectory of wealth will not proceed as 
anticipated, but the trajectory may still be under control. 
At the rebalancing time /, there are estimates for the 
conditional rate of return á(/), and the volatility A(), 
based on the methods of section 2. The forecast dynamics 
for the price process, conditional on the estimates for 
parameters, are 


dY()) = ade + A" (naZ, (10) 


where the innovations process dZ is standard Brownian 
motion. 
An investment strategy is a vector process 


Lol, XO) r = 0] = [xo(0, 650... x0. 1 = OF, 
(11) 


where $ 7*9 x;() = 1 for any z, with xo(/) the investment 
in the risk-free asset. The proportions of wealth invested 
in the m risky assets are unconstrained since xo(7) can 
always be chosen, with borrowing or lending, to satisfy 
the budget constraint. 

The forecast change in wealth from an investment deci- 
sion X(r) is determined by the conditional price process 
(10). Suppose the investor at time / has wealth W(r) and 
let $0) = G,(2) + (1/2)8 (0, i= 1,...,m. Then the fore- 
cast for the instantaneous change in wealth is 


dW(n- » xi(X$;) — r) + | winder 
tat 


+ W() SO x(054Z;, (12) 
i=l 


where dZ; is standard Brownian motion. 
There is a growth condition which will be required of 
any feasible investment strategy: 


m 


bs IAAD- 1) r- 2 soso] >0 | : 


x,-1x(n 


i=] 


With a fixed mix strategy over the hold interval satisfy- 
ing the growth condition, the forecast wealth process 
W(t), t>, defined by (12) follows geometric 
Brownian motion such that W(t) > 0. 
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To determine an investment decision which controls 
wealth, two approaches are considered. First, a model 
which considers preferences for wealth at the planning 
horizon is defined. The standard approach is to maximize 
expected utility at a planning horizon. Because of uncer- 
tainty in the parameters for the asset pricing model, 
parameters are re-estimated and decisions revised at 
regular intervals in time up to the horizon. In the second 
approach, the preferences at the horizon are used to 
develop wealth goals (limits). Then a wealth goals 
model is defined, which has an equivalent investment 
strategy. The contrast from the approaches comes from 
rebalancing times. The wealth goals approach involves a 
random hold interval, with stopping/rebalancing occur- 
ring when specified upper or lower wealth goals are 
reached. 

The matching of preferences in the two approaches 
does not mean that the same fixed mix strategy is used 
for both models. The rebalance times are different, and 
therefore there are different parameter estimates used in 
the strategy calculation. 

The link between the planning models is stochastic 
dominance. Wealth W, dominates wealth W3 iff Eu(W|) 
is greater than or equal to £Eu(W;) with strict 
inequality for at least one utility function u € U. Orders 
of stochastic dominance are determined by classes of 
utilities. If Uy = (u|(—1Y- i? > 0,j =1,...,k}, so that 
U, D Uy D ..., then U; is the class of monotone utilities, 
Uz the class of concave monotone utilities,..., and 
U,,contains the CRRA utilities. Since the CRRA utilities 
are a limiting subset, they are a reasonable reference class. 

There are alternative formulations of the various orders 
of stochastic dominance. In particular, first order is 
equivalent to dominance in the cumulative distribution 
of wealth. The wealth goals model will be defined on 
the distribution of wealth, and therefore is based on 
first-order dominance principles. However, it is a relaxa- 
tion of first-order dominance in that it focuses on parts of 
the distribution in a bi-criteria problem. 


3.1. Expected utility strategy with fixed rebalance times 


Consider an investor whose objective is to maximize the 
expected utility of wealth at the end of a finite planning 
horizon T. To obtain an explicit solution we assume a 
constant relative risk aversion (CRRA) utility. The utility 
of wealth w is 


p 
w ! 
U(w) =F with B < 1, B40. 
At the current decision time, the parameters in the pricing 
model are estimated and an investment strategy is deter- 
mined, conditional on those estimates. Consider the 
conditional CRRA wealth problem. 


Definition 1: The conditional CRRA wealth problem at 
rebalance time t, with current wealth w,, horizon T and 
estimated parameters &(¢), A(d), is to find the strategy 


X(t) € X, which maximizes 


d] 


where the forecast dynamics of wealth W(t) are given 
by (12). 


Proposition 2: The optimal strategy for the conditional 
CRRA wealth problem is 


X= GA 00-79. — 03 


As B > 0 in definition | the utility is log and the efficient 
strategy is the benchmark optimal growth solution: 
X(r) = ^^ ($1) — re). The fraction f = 1/(1 — B) captures 
the aversion to risk (Pratt 1964). When 0 < £ < 1, the 
positive power utility case, the optimal portfolio invests 
more than the benchmark, with the over-investment 
financed through borrowing (levered strategies). 
Although the strategy X*(0),0 < B < 1, is optimal, it is 
problematic from the perspective of the distribution of 
terminal wealth W(T). Let Fg be the distribution function 
for terminal wealth following strategy X/(r), with Fo the 
terminal wealth distribution following the benchmark 
strategy. Distributions can be compared with the first- 
order stochastic dominance relation: Fg dominates 
Fg, iff Fg (w) < Fg,(w) for w € R with strict inequality 
for some w. 


Proposition 3: The terminal wealth distribution Fo for the 
benchmark portfolio first-order stochastically dominates 
the distribution Fg for 0 < B < 1. 


This result provides a rationale for a restriction B < 0 
to the negative power and log utility functions, and the 
resulting class of fractional Kelly strategies pX(t), where 
p < 1. First-order stochastic dominance is defined on the 
class of all monotone utilities, of which the CRRA utility 
is a small subset. Imposing the first-order dominance con- 
dition on the fraction from a CRRA utility model is a 
restriction, but other work, where a risk constraint such 
as VaR is included in the expected CRRA utility model, 
has shown that levered strategies are excluded (Basak and 
Shapiro 2001). 

The solution in (13) is independent of current wealth w, 
and the planning horizon T. 

The decision is sensitive to estimation and model- 
ing error, and at regular intervals of width s model 
parameters are re-estimated using the empirical Bayes 
methodology. 

The actual investment decision at times 4,/+5,..., 
would change with the new estimates. 


3.2. Wealth goals strategy and random rebalance times 


The uncertainty in returns on investment, whether from 
natural variability, variability in underlying factors, or 
errors in estimation, is the motivation for strategies that 
control risk. In the conditional CRRA utility problem the 
risk associated with the forecast wealth at the horizon is 
controlled with the risk aversion parameter 6, with the 
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estimation risk controlled by regularly updating estimates 
for parameters. Updates are appropriate when there is 
evidence that current estimates are substantially in 
error. This concept of updating based on the gap between 
expected and actual processes suggests another way to 
control the wealth process. That is, to set control limits 
(or wealth goals), as is the practice in process control. 
The wealth goals are action levels so that the investment 
portfolio is rebalanced if either of the goals is reached. 
With wealth w, at time 7, the lower and upper wealth goals 
w, and W, are set, where w, < w, < w,. If the investment 
strategy X(r) is chosen, then the strategy is followed 
until either w, or W, is reached. At that point the portfolio 
is rebalanced. So new estimates are generated, new goals 
are set and a new strategy is chosen. Between rebalancing 
points a fixed mix strategy is followed. Information on 
prices is obtained between rebalance points, but it is 
only used when the accumulated information on actual 
prices signals that the forecasts for price movements are 
substantially in error, that is, when a control limit is 
reached. 

The choice of wealth goals is an important component 
of the control process. Wealth goals can be an expression 
of preferences as naturally as a utility function. Our 
approach is to first consider the form of an investment 
strategy for given wealth goals, assuming the wealth goals 
are feasible, i.e. admit an investment strategy. Then a 
methodology for specifying goals based on expected 
utility is developed. 

The risk and return characteristics of the wealth pro- 
cess have traditionally guided the choice of strategy X(/). 
Within the wealth goals context, therefore: 


(i) the chance of hitting the lower wealth goal should 
be small. That is, the downside risk should be 
controlled; 

(ii) the upper wealth goal, the preferred target, should be 
reached as quickly as possible. That is, the growth 
rate of capital should be maximized. 


To formalize downside risk in the wealth goals 
approach, consider. 

T (X), w,]&(t0), AD) — first passage time to wealth w 
starting with wealth w, at time 1 and following investment 
strategy X(t}, conditioned on the estimates for parameters 
at time t. 

Let the set of wealth goals be denoted by 


B,- {b,1b, = (wp W,), 0 < w, < w, < w). 


For b, € B,, downside risk for a strategy X(/) € X, is 
measured by 


3, XC) 


e PT. (x0. wilet, Aw) STe (xo. wát), &q)]. 


(14) 


The risk is the probability the first passage time to w, is 
smaller than the first passage time to W,. 

Risk measures are associated with acceptance sets of 
capital accumulation paths (Artzner et al. 1999). Let 
Q(X) be the set of all paths of W(z), v > !, and let 


A(X(t)) be the set of acceptable paths. If A'(X(r)) is the 
complement, then the downside risk for X(f) is 
HX) = o(4*(X(0))). That is, the risk for a strategy is 
the probability measure of the unacceptable set of paths. 

Desirable properties of risk measures have been con- 
sidered in the literature. The following basic axioms for 
a downside risk measure are proposed in Breitmeyer et al. 
(1999). 


A: (non-negativity)  $8(X(r)) > 0. 

B: (normalization) — If A°(X()) = ¢, then R(X(d) = 0. 
C: (downside focus) If A*(X,(0) = A'(Xo(0)), then 
R(X 0) = (XDD). 

If A*(X4(0) € AGO), then 
866 0)) x ROD) 


D: (monotonicity) 


Consider the measure in (14). From the growth condition 
defining feasible strategies, X(r) € X,, any path eventually 
reaches W,, and the acceprance set for the measure is 
defined as 


A, (X(I)) = paths of W(1), x > 4, 
which reach Ww, before ever dropping to w,. 


Using this definition of acceptance sets, the risk measure 
defined in (14) satisfies axioms A-D. 

With an appropriate risk measure defined in terms 
of wealth goals, risk can be controlled with a constraint 
on the measure. For control limits ^, € B, and risk 
level 1—y, 0 < y < l, feasible strategies X(r) e X, are 
required to satisfy 9t, (X(1)) < 1 — y. 

The other characteristic of the wealth goals approach is 
the anticipation that the upper limit is reached as quickly 
as possible. With the expected time defined as 


Ga, (XU) = E| T GC lé, &q)]. (15) 


the objective is to minimize G,,(X(1)). Speed in the time 
domain is a growth measure. Suppose the random 
time to a fixed wealth goal Tz, is mapped onto random 
wealth at a fixed time T by Wr = w,exp(T — Ti). Then, 
E{In Wr} = [In iv, + 7] — E(Tz,]. Minimizing the expected 
time to the wealth goal W, is analogous to maximizing the 
expected log of wealth to the appropriate horizon T. The 
log optimal strategy maximizes the rate of growth of 
capital, so (15) is a capital growth criterion. With refer- 
ence to first-order stochastic dominance, the expected log 
of wealth is equivalent to the median wealth, since wealth 
is lognormal. 

With the risk-return characteristics specified by (14) 
and (15), a Growth-Security decision model can be 
defined. Security is the complement of risk, that is, 
Sp (XC) = 1 = Re (XO). 


Definition 2: For given control limits b, € B, and risk 
level 1 —y,0 x y <1, and estimates G(r), A(r), the con- 
ditional Growth-Security strategy X(r) is determined 
from the problem 


Minimize [G, (XIS (X) 2 y, X0) e X]. 06) 
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Explicit expressions for the optimal solution to (16) will 
be developed for the geometric Brownian motion model. 
Consider the wealth process W(r) or > t, with forecast 
dynamics given by (12). Let 


m 


WX) = X x(OGQO — 07 +r, 
i=l 


EXD = 35 x (O87), PAA) = 29,00) — 07 (XD), 


fal 
and 


e) 
e; QX() 


ax) = 


The measures G} (X(1)) and S,(X(0) = 1 — 31, (XC) for 
geometric Brownian motion are given by Karlin and 
Taylor (1981): 


2|n(59,/w,) 


G, (X(r)) = ———— 17 
», (X(t) e) " 7) 

E = c/w] 
S, (XC) = (18) 


[| = Qn /m PO] 


These expressions define growth and security in terms 
of the estimated parameters of the returns distribution 
and the control limits. Therefore, the growth-security 
strategy from (16) can be related to those inputs. 


Proposition 4: Suppose at rebalance time t, the investor 
has wealth w, parameter estimates &(), AM, and the 
return on the risk-free asset is r. If control limits (wealth 
goals) are b, € B, and the risk level is | — y, then the con- 
ditional growth-security efficient strategy at time t is 
XRD = Ga s. ox (OY with 


X$() = p.n bn YAW "(NPD — re), (19) 


where e is a vector of ones. 


The expression in (19) reveals the approach to risk 
control with wealth goals. The component X(r)— 
OO) — re) is the optimal growth portfolio (Merton 
1992), or the Kelly strategy (Kelly 1956) or the benchmark 
portfolio (Basak and Shapiro 2001). The function 
PW, b, y) defines a fraction invested in the risky assets 
that depends on current wealth, wealth goals, and risk 
level. Hence, the strategy which is growth-security effi- 
cient is fractional Kelly, as is the case for CRRA utility. 
An expression for the investment fraction follows from 
proposition 4. 


Corollory 1: For wealth goals b, € B,, the fraction of 
wealth invested in risky assets at the rebalancing time t is 
min{1,p,} for 


2rh, l 
CAO 
(20) 


pyGr, b, y) = hy HAX + | [^ HX)? + 
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where 
z XQ) -r 
no- O0 -l 
e; (X(0) 
[log w, — log w,] 
"flog we, — log w, — log y*]’ 
and y* is the minimum positive root of 


yy! — y (1 — y) 2 0 for 

. [logw, — log w,] 

~ [legis - log w] 

The solutions to the conditional CRRA problem and the 
conditional G-S problem apply to one stage in the multi- 
stage investment process. Although the form of the solu- 
tions is fractional Kelly for both problems, there are 
differences. The CRRA solution is independent of current 
wealth and the planning horizon. The G-S solution 
depends on the current wealth and the horizon through 
the wealth goals. Both solutions depend on the estimates 
for price parameters. 


€ 


3.3. Wealth goals 


The dynamic investment process of figure | is almost 
complete. However, the specification of wealth goals in 
the G-S approach has not been addressed. The purpose in 
rebalancing is to develop a new investment policy when 
the current policy is not working. The method for deter- 
mining goals in this analysis is based on preferences for 
wealth at the planning horizon as defined by the CRRA 
utility. A technique of matched strategies, which replicates 
the wealth preferences at the horizon, provides a fair 
comparison of the accumulated wealth for the contrasting 
models. That is, if expected utility is the standard, then 
performance of the G-S problem relative to that standard 
is studied. It is important to realize that the matching 
technique can be implemented with other standard 
problems. The wealth goals can be important milestones 
without reference to other problems. 

Consider choices for 8 and b, = (w,, w,) which produce 
the same strategy, that is p,Gv,, bp y) = 1/(1 — 8). From 
the definition of p,(w,,6,, y), there are potentially many 
combinations of goals which are linked to the same f. 
Since the expected utility fraction invested in the bench- 
mark portfolio is independent of current wealth, whereas 
the process control fraction depends on the current wealth 
w, and the probability y, the values w, and y will be fixed, 
and the £ > b, link explored. Consider, then, the condi- 
tional £-class of goals, with w, = w, y = y? 


l 
BY(B) = j bilo bn Y) = Qn 


rii 

The set B(8) C B, defines wealth goals with the same 
Arrow-Pratt relative risk aversion given the current 
wealth w and risk probability y°. Proposition 5 estab- 
lishes that such goals actually exist. 


Proposition 5: At the rebalance time t consider 
current wealth TA risk probability y? and risk aversion 
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parameter B. Assume that the benchmark portfolio satisfies 
the growth condition, i.e. 


VO) 1 
OX) Y 
Then BXA) is non-empty for <0. 


Consider the following matching heuristic for setting 
wealth goals at any rebalancing time r: 


1. wt = E(W£" (x +5)) is the expected wealth at time 
t+s with the CRRA strategy 


X'()- X(1). 


] 
a- 
2. we = max{w,|b, e B((8)). 


Setting goals at b? = (wz, wr) maintains the investment 
fraction at p, = 1/(1 — £) for any rebalancing time t, and 
links the goals to the holding time s. 


4. Comparisons 


The accumulated capital from the wealth goals with ran- 
dom rebalancing times, and expected utility with fixed 
rebalancing times is now analysed for the fundamental 
problem of allocating investment capital to stocks, 
bonds and cash over time. The matching heuristic 
approach is used to insure the problems are comparable, 
with the main difference being rebalancing time. 

Daily prices for stocks and bonds are simulated 
using the dynamic pricing model, discretized to days. 
If Pt) is the price for asset ;/ on day z, where i—0 
(cash), 1 (stock), and 2 (bonds), then consider the incre- 
ments eff) = In P+ 1) — In P.(0, £(—0, 1, 2. From the 
model, e(f) = o; --6Z(t), where Z(r) are independent 
Gaussian variables. For cash returns, oy — In(1 +r). It 
is assumed that (œ, œ2) are Gaussian variables dependent 
on a single latent state variable. So o; = p + A;jF,i = 1,2, 
with F standard Gaussian. There is a simple structure to 
model parameters in this single factor case. If the covar- 
iance for risky asset log prices is £, The model implies 
E= AA’ +A, where A = (4,45). and A = diag(67, 83). 
The factor solution is 


A = (JS p01, J002), 
^ = diag((1 — |p|on. (1 


e)» 


1el)o2), 


91192 


If the true covariance is known, then model parameters 
are specified and daily prices can be simulated. 
Conversely, if a sample covariance matrix is determined 
from a sequence of prices then the parameter values can 
be estimated using these equations. The estimates are 
empirical Bayes, based on the methods in section 2. 

For the simulation of asset prices, data on yearly asset 
returns corresponding to the S&P500, Solomon Brothers 
bond index, and U.S. T-bills for 1980-1990 were used 
to generate seed parameters. Statistics on the associated 


Table |. Daily rates of return. 


Stocks Bonds Cash 
Mean 0.00050 0.0003] 0.00019 
Variance 0.00062 0.00035 0 
Covariance 0.00014 
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Figure 2. Daily prices for stocks and bonds. 


daily returns are in table 1. The corresponding statistics 
for log-rates were used as seeds. 

Daily prices were generated for one year (260 trading 
days) and this was repeated 200 times. The one year 
horizon is appropriate for the simulation since it provides 
sufficient time for differentiation of capital accumula- 
tion from the PC and EU strategies. The contrasting 
approaches to selecting an investment strategy were 
applied to each trajectory of prices. Initial parameter esti- 
mates were taken from average daily returns for the prior 
year, 1979. For expected utility, rebalancing (revising 
estimates of price model parameters; calculating new 
strategy) was done every 30 days. For process control, 
rebalancing occurs when control limits are reached. The 
estimation is based on the covariance matrix. To increase 
stability the covariance analysed at the jth rebalancing 
at time / is 

à t l-r 


5 3 


j = 560 Sia. 


As time increases the updating is weighted to the observed 
covariance S. The procedure for setting limits (wealth 
goals) is described in section 4, where the process control 
fraction is maintained at p, = 1/(1 — f). 

The expected utility and wealth goals (process control) 
methods are shown for the trajectory of prices in figure 2. 
This trajectory has a volatile return on stock and is a 
useful example. 

The expected utility strategies and portfolio values for 
the setting | — B = I.I are given in table 2. The rebalanc- 
ing occurred at 30 day intervals. The strategy emphasizes 
stocks and is quite stable. The return on the portfolio at 
the end of the year was 24%. 
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Table 2. Expected utility performance. 


Table 4. Expected wealth ratios. 


Strategy l-y 
Time Stocks Bonds Port value 1-8 0.01 0.02 0.03 0.04 0.05 
1 0.4670 0.1465 1 1415 1.291 1.290 1.289 1.265 1.264 
31 0.7439 —0.1319 0.8943 1.10 1.322 1.321 1.320 1.298 1.293 
61 0.8688 —0.1770 0.9930 1.05 1.365 1.364 1.359 1.340 1.337 
91 0.8963 —0.2292 1.1287 1.00 1.410 1.408 1.403 1.384 1.381 
121 0.9701 —0.4437 1.4089 0.95 1.460 1.459 1.457 1.443 1.433 
151 0.9432 —0.4726 1.3114 0.90 1.523 1.522 1.520 1.507 1.491 
18] 0.9864 —0.4900 1.2164 0.85 1.608 1.608 1.607 1.602 1.584 
211 1.0125 —0.4181 1.1508 
241 0.9820 —0.3989 1.2178 


Table 3. Wealth goals performance. 


Strategy Wealth limits 
Time Stocks Bonds Port value Upper Lower 
1 0.4670 0.1465 1 .0112 0.5765 
64 0.5805 0.1140 1.0148 0281 0.5309 
65 0.6346 0.1010 1.0379 .0526 0.5212 
66 0.6656 0.0910 1.0562 .0717 — 0.5196 
67 0.6744 0.0864 1.0787 .0946. 0.5301 
74 0.6804 0.0898 1.1065 .1229 0.5427 
88 0.7106 0.0671 1.1433 1615 — 0.5385 
107 0.7566  —0.0011 1.1644 .1844 — 0.5231 
108 0.7889  —0.0509 1.1910 34427. 0.5177 
115 0.7995  —0.0936 1.2375 .2603 — 0.5351 
118 | 0.8060  —0.1302 1.2771 .3006 — 0.5514 
120 | 0.8089  —0.1571 1.3387 .3634 0.5779 
123 0.8242  —0.1957 1.3640 .3898 — 0.5798 
130 0.7864  —0.2152 1.4043 .4290 | 0.6234 


The process control methodology was run on the same 
rajectory, with settings 1 — $ = 1.1, 1 — y = 0.03. Those 
results are presented in table 3. There is initially more 
frequent rebalancing with wealth goals, because of the 
volatility. When prices settle down, the fixed mix strategy 
does not change. The return at the end of the year was 
429^. It is noteworthy that the return was monotone and 
he advantage of the PC strategy increases with time. 

The comparison of the expected utility and process 
control approaches is the expected wealth ratio at the 
end of the year. The wealth ratio for each trajectory is 
he accumulated wealth with the process control strategy 
divided by the accumulated wealth with the expected 
utility strategy. 

Table 4 presents mean wealth ratios for values for the 
CRRA utility parameter f, and the process control risk 
probability 1— y. The levered strategies (0<6<1) 
are included, although those strategies are not growth- 
security efficient. 

The wealth goals (process control) approach has a sub- 
stantial advantage over expected utility, demonstrating 
the effect of accounting for current wealth (at the control 
limit) in determining a strategy. Since the PC strategy is 
determined (matched) from the EU strategy, the expected 
wealth advantage of being less risk averse (decreasing 
1 — £) is embedded in the PC strategy, but the PC advan- 
tage over EU grows since the volatility is controlled by 


the upper and lower limits. As the security requirement is 
relaxed (y decreasing) the expected wealth advantage for 
PC decreases. This results from the specific construction 
of the strategy, where the lower control limit drops with 
decreased y and given upper control limit. 

The cases where (1 — f) € 1, the levered strategies, are 
high risk and are stochastically dominated as established 
in Proposition 4. 


5. Conclusion 


This paper considers the risk to accumulated capital in a 
multi-asset investment problem where the distribution of 
asset prices has random parameters. The basis for invest- 
ment strategies is either process control, where wealth is 
controlled by upper and lower goals, or expected utility of 
terminal wealth at the horizon. In the process control 
approach, the portfolio is re-balanced when either wealth 
goal is reached. 

At a re-balancing time the history of asset prices is used 
to update the conditional price distribution. An empirical 
Bayes estimator of distribution parameters is defined 
based on the observed correlation structure of asset 
prices. With the revised price distribution, new upper 
and lower wealth goals or the next re-balance time are 
selected and a new optimal strategy is derived. The pro- 
cess control strategy is growth-security efficient in the 
sense that it has maximum growth among strategies 
with a specified level of risk. Assuming a lognormal dis- 
tribution for asset returns, the growth-security strategy is 
a blend of the risk-free portfolio and the optimal growth 
portfolio. 

The wealth goals approach and the expected utility 
approach with constant relative risk aversion are com- 
pared. An equivalence between the wealth goals and the 
risk aversion parameter is developed. Although the alter- 
native models have the same portfolio solution with the 
appropriate choices of parameter values, the wealth goals 
approach has the advantage of a direct control of down- 
side risk and a re-balancing of the portfolio based on 
significant change in the price process and change in 
wealth. The benefit of intervening when the condition of 
wealth deviates from expectations rather than at regular 
intervals in time is clear. Re-balancing at specified 
points in time when wealth is proceeding as expected is 
analogous to tampering in process control, and typically 
introduces extra risk. 


At: 16:16 5 Occober 200 


Network! 


dge 


esearch Know! 


adian R 


aded By: | 


Downlo: 


Time to Wealth Goals in Capital Accumulation 


269 


Time to wealth goals in capital accumulation 353 


Acknowledgements 


This work was presented at the Bachelier World Congress 
2002, and seminars at The University of British Columbia 
and Dalhousie University. The authors wish to thank 
Yonggan Zhao for assistance with the computations. 
This research was supported by grants from the Natural 
Sciences and Engineering Research Council of Canada 
and the National Center of Competence in Research 
FINRISK, Swiss National Foundation. 


Appendix A 


This appendix contains proofs for the propositions in the 
paper. 

Proposition 2: The optimal strategy for the CRRA 
wealth problem is given by 


x= ph inden - ie). 13) 


Proof: The analysis is similar to theorem 6.2 in Janecek 
(1998). For investment strategy X(/) the forecast 
dynamics of wealth are 
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With X(t) = X(/) and $1) = $1), t > t, based on infor- 
mation to time /, integration from z to T gives the forecast 
wealth at the horizon 
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where Z; ~ N(0, 1). The expected utility of wealth is 
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Consider the expression 
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The first-order conditions 
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reduce to 
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and X*(r) is as stated. 


Proposition 3: The terminal wealth distribution Fo for 
the benchmark portfolio first-order — stochastically 
dominates the distribution Fg for 0< B<1. 


Proof: Let WX) and o(X) be the instantaneous mean 
rate of return and standard deviation, respectively, 
of the optimal growth (benchmark) portfolio. Let 
f — 1/(1 — £) and define 


VO) = vU) — fai an - 0) 4 n,s* 0) = £e? 00 


and 


eU) = WY) -P U). 


Under the assumptions of the model, terminal wealth 
W(rg-T) from the strategy X^ is log-normally 
distributed with 


In((w/w,) — [6/217 
olf) 7 
where N is the Gaussian cumulative distribution. 
Since o*( f) is increasing in f and $(/) is decreasing in f 


for f > | (optimality of benchmark), the statement in the 
proposition follows. 
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Proposition 4: Suppose at rebalance time t, the investor 
has wealth w,, forecast dynamics of asset prices given the 
history (Y,,0 < s x t], and the return on the risk-free asset 
isr. If control limits (wealth goals) are bh, € B, and the risk 
level is | — y, then the growth-security efficient strategy at 
time t is Xj, (0) = Qs (D... Xap, (DY with 


XG) = pw, bn PÂTE) — re), (19) 
where e is a vector of ones. 


Proof: Let w, — w,/k and W, = K^w,, where k > | and 
€ > 0. For simplicity the time variable ¢ is deleted. From 
(16) and (17) the growth-security efficiency problem is 
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Letting y 2 & 9, the constraint requires yyt! — y+ 
(1 — y) 2 0. Consider the minimum positive root y* of 
the equation yy^*! — y E (1— y) 20, where y* < ] since 
y*=1 is a root. Then yy/*! — y  (1— y) > 0 for y < y* 
and the constraint is satisfied if & ^ x y or 


logy o a 
ax > - (2E) =q" > 0, 


where q* depends upon w, „W, and y. But ((X)» q* 
if 29(X)—(1—9o?(X) » 0. Hence, the efficiency 
problem is 
max [24(4) — PIY 
—(1 +9°)o7(X) > 0, X feasible]. 


If X* is a solution to this problem then it also solves, for 
optimal multiplier A*, the Lagrangian problem 


max L(X, *), 
D d 
where 


L(X,X*) = 2900 — 0° (4) 4 X DOO = (0 4 46700]. 
The first-order conditions V.L(X*, 4*) = 0 imply that the 
optimal X* satisfies the linear system 

2( EAG: — 7) — AL A*( + gps] = 0, 
PE 


The solution to this system is X* = p(w, w, W, y)x 
A^ (ó(r) — re) where 
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Corollory 1: For wealth goals b, € B,, the fraction of 
wealth invested in risky assets at the rebalancing time t is 
min(l, p for 
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and y* is the minimum positive root of yy^! 


Q- y) = 0 for 


-y+ 


. flog i, — log w;] 


“= fogw, — logw;]: 
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if A*=0, i.e. when the constraint is not binding. If the 
security constraint is binding then X* is a solution to 
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K = y", Let Y = A^ (ó — re) and X* = pX. Then the 
equation becomes 


* 


[( BEP ee — [u(X) - r]p - 2r =0. 


The statement in the corollary follows. 


Proposition 5: At the rebalance time t consider current 
wealth w, risk probability y? and risk aversion param- 
eter B. Assume that the benchmark portfolio satisfies the 
growth condition, i.e. 


v) T 
oio» 2 
It follows that BB) is non-empty for B<0 


Proof: From corollary 1, pwa bj, y) 2 1/(1 — 8) implies 


d 
2r(1 — BP + 2HQOo? (O1 — 8) 


> 0. 


Ty 


Since 
log w, — log w, 
log w, — log w, — log y*' 


Riv, Ww) = 


any (w, W,) € B'(B) must satisfy the equation /i(w,, W,) = 
K. From the assumption £<0 and y/o? > 1/2 it 
follows that 0 < K <1. The solution y* of the equation 
yy —y4(1—y)20 has the property y* 41 as 
w, 4 w, for fixed w? < w,. So there exists (ww) such 
that A(w9,w?) > K. Also for fixed W7, Aw, W4) | 0 as 
w, 7 w, That is, there is aw} with 4(w7,w7) = K. 
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Abstract 


This chapter gives an overview of current research in evolutionary finance. We 
mainly focus on the survival and stability properties of investment strategies 
associated with the Kelly rule. Our approach to the study of the wealth dynamics 
of investment strategies is inspired by Darwinian ideas on selection and mutation. 
The goal of this research is to develop an evolutionary framework for practical 
investment advice. 


1 Introduction 


The principal objective of our evolutionary approach to the study of financial market 
dynamics is the development and analysis of models that constitute a plausible 
alternative to the conventional general equilibrium framework. Our aim is to provide 
a framework that is suitable for delivering practical investment advice. 

The equilibrium concept most commonly used in financial economics is due to 
Radner (1972). This equilibrium notion involves the plans and price expectations 
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of agents as well as market prices. A well-known drawback of that framework is 
the necessity of agents to have perfect foresight (rational expectations) in order 
to establish an equilibrium (see the discussion in Laffont (1989) and Dubey et al. 
(1987)) market participants have to agree on the future prices for each of the possible 
future realizations of the states of the world (without knowing which particular state 
will be realized). 

Our evolutionary model differs radically from that approach: only historical 
observations and the current state of the world influence the agents' behavior; no 
agreement about the future market structure is required and no coordinated actions 
by the agents are assumed. From a practical finance perspective, the important dis- 
tinction between our approach and the conventional general equilibrium paradigm 
lies in the data required to formulate the model. The evolutionary model does 
not use agents’ characteristics that are unobservable (such as individual utilities 
or subjective beliefs). We describe aims of investors in terms of properties (sur- 
vival, evolutionary stability, etc.) holding almost surely, rather than in terms of 
the maximization of expected utilities. We consider this robust modeling approach 
as the basis for developing a new generation of dynamic equilibrium models that 
could be used for practical quantitative recommendations applicable in the financial 
industry. 

'The general approach underlying this direction of work is to apply evolutionary 
dynamics — mutation and selection — to the analysis of the long-run performance of 
investment strategies. À stock market is understood as a heterogeneous population 
of frequently interacting portfolio rules in competition for market capital. The 
ultimate goal is to build à Darwinian theory of portfolio selection. Evolutionary 
ideas have a long history in the social sciences going back to Malthus, who played 
an inspirational role for Darwin (for à review of the subject see, e.g., Hodgeson 
(1993). A more recent stage of development of these ideas began in the 1950s with 
the publications of Alchian (1950) and others. An important role in this line of 
work has been played by the interdisciplinary research conducted in the 1980s and 
1990s under the auspices of the Santa Fe Institute in New Mexico, USA — see, 
e.g., Arthur et al. (1997), Farmer and Lo (1999), LeBaron et al. (1999), Blume and 
Easley (1992), and Blume and Durlauf (2005). 

Our model also revives the literature on stochastic dynamic games going back 
to Shapley (1953). The framework also shares some conceptual features with the 
games of survival pioneered by Milnor and Shapley (1957) and the dynamic market 
game in Shubik and Whitt (1973). 'The main difference is that we use the notion 
of survival strategies rather than that of Nash equilibrium. In a finance context, 
the focus on survival is an advantage (over those approaches invoking expectations) 
because it is à property holding almost surely and not requiring discounted or 
undiscounted utility. 

This survey is based on the research carried out by Amir et al. (2005, 2008, 
2009), Evstigneev et al. (2002, 2006, 2008) and Hens and Schenk-Hoppé (2005). 
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A general survey of evolutionary finance is provided by Evstigneev et al. (2009). 
The issue of survival of traders in Radner's setting has been studied by Blume and 
Easley (1992); see also the surveys Blume and Easley (2008, 2009). 

Section 2 explains in details the model, Section 3 discusses its dynamics and 
defines the concepts of survival and evolutionary stability, Section 4 surveys the 
results obtained in the literature, and Section 5 concludes. 


2 Model 


Consider a market in which K > 2 assets are traded. Each asset k = 1,2,..., K pays 
dividends at dates t = 1,2, .... The non-negative payoff D, ;,(s’) > 0 depends on the 
history s* = (s1, ..., S+) of states of the world up to date t. The states are random 
factors modeled in terms of an exogenous stochastic process $1, $2,..., where s; is a 
random element of a measurable space S;. We assume functions to be measurable 
throughout the following. At each date (and in each random situation), at least one 
asset is assumed to pay a strictly positive dividend: 


K 
> D, k(s*) > 0 for all t, sf. (1) 
k=1 


The total net supply of asset k is equal to the random amount V; (s') > 0 
(a constant Vo, > O at the initial time t = 0). The vector of market prices 
is pe = (Pii, -o Pek) € RX, where pk is the price per unit. We assume 
Vi k(s*)/V; 1a (s* +) > y > 0 for t > 0 and all s*. This condition is satisfied, e.g., 
if the asset supply is constant or increasing; the supply however cannot decrease at 
an ever-increasing rate. 

There are N > 2 investors, or traders, acting in the market. A portfolio of 
investor i at date t = 0, 1, ... is specified by a vector 6} = (014,...,0; x) € RẸ where 
0, is the number of units of asset k held in the portfolio. The value of investor 
i's portfolio at date t is (pj, 01) = Ya Pirhi p The state of the market at each 
date t is characterized by the set (p, 01, ..., 07") consisting of the price vector and 
all traders’ portfolios. 

Investors ? = 1,2,...,N enter the market at date t = 0 with endowments 
ws > 0 that form their initial budgets. At date t > 1, investor i's budget is 
(Dilst) + pr, 0$. .,), where Di(s*) = (Dii(s*),..., Di (sf)). This budget consists of 
two components linked to the portfolio 0$ ,: the dividends (D,(s*),0i ,) received 
and the market value (p;,05 ,) expressed in terms of the current prices ps. 

A fraction « of the budget is invested in assets. We will assume that the invest- 
ment rate « € (0,1) is a fixed number, the same for all the traders. The remaining 
amount of wealth is withdrawn from the market through a taz rate 1 — « (or, if 
you dislike taxes, through a consumption rate 1 — a). The assumption of a common 
1 — « for all the investors is quite natural when interpreted as à tax rate. As a 
consumption rate it might seem a restrictive condition, however, it is indispensable 
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since we focus on the analysis of the comparative performance of trading strategies 
in the long run. Without this assumption, an analysis of this kind does not make 
sense: a seemingly worse long-run performance of an investment strategy might be 
simply due to à higher consumption rate. 

For each t > 0, every trader 4 = 1,2,..., N selects a vector of portfolio weights 
Aj = Op oss M) according to which he/she plans to distribute the available bud- 
get between assets. Vectors Ai belong to the unit simplex AF = ((a1,..., ak) 2 0: 
a, +... + ax = 1). In game-theoretic terms, the vectors àf represent the investors’ 
actions. The portfolio weights at each date t > 0 are selected by the N investors si- 
multaneously and independently—the investors participate in a simultaneous-move 
N-person dynamic game. 

The investors’ actions can depend (for t > 1) on the history st of the states of 
the world and the history of the game (p'-1,0*-1, M-1), where p'^! = (po, ..., pi-1) 
is the history of asset price vectors and 


01 = (89,61, 011), 0 = (01,01) 
AS. = (Ao, À1, ey Mi), A = (AL, ian) 


are the sets of vectors describing the portfolios and the portfolio weights of all the 
traders at all the dates up to t — 1. 

'The history of the game contains information about the market history — the 
sequence (po, ĝo), ..., (p 1,04 1) of the states of the market — and about the actions 
Ai of all the investors i = 1,..., N at all the dates | = 0,...,t— 1. A vector Aj c AX 
and a sequence of measurable functions with values in AF 


Aus pt1, 041, eb EET. 


form an investment strategy A’ of trader i, specifying a portfolio rule according 
to which trader i selects portfolio weights at each date t > 0. This is a general 
game-theoretic definition of a strategy, assuming full information about the history 
of the game, including the players’ previous actions, and the knowledge of all the 
past and present states of the world. Among general investment strategies, we will 
distinguish those for which At depends only on st and not on the market history. 
We will call such strategies basic. Note that basic strategies are in general not 
simple (constant). 

To complete the description of the market, it remains to define a dynamic equi- 
librium for the assets. Suppose that at date 0, each investor 7 has selected some 
portfolio weights A = (6.5, M) € AF. Then the amount invested in asset k 
by trader i is oA), , wj and the total amount invested in asset k is a 375. A5 u$. It 
is assumed that the market is always in equilibrium (asset supply is equal to asset 
demand), which makes it possible to determine the equilibrium price po,, of each 
asset k from the equation 

N 
Po,k Vok =Q M Ab. Ub (2) 


i—l 
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The left-hand side is the total market value po, Vo, of the supply of asset k while the 

right-hand side represents the total wealth invested in asset k by all the investors. 

Equilibrium ensures that both sides are equal, i.e., (2) holds. The portfolio weights 
$ chosen by the traders at date 0 determine their portfolios 64 at date 0 by 


jr DET EEES (3) 


i.e., the current market value po,,6% ,, of the kth position of investor ?'s portfolio is 
equal to the fraction AÀ , of the investor's investment budget aw}. 

Consider now the situation at any date t > 1. Suppose all investors have chosen 
their portfolio weights Aj = (Aj ),...,A¢ y). Then the equilibrium of asset supply 
and demand determines the market-clearing prices of assets k = 1,...,K through 
the relation 

N 
PukVek =Q > NL (Dis) + po 604) (4) 
i=l 
The investment budgets a(D+(s*) + pe, 01 ,) of the traders i = 1,2, ..., N are dis- 
tributed between assets in the proportions Ai,, so that the kth position of the 
trader ?'s portfolio 01 is l 


adi p(Dels*) + pu Oi) (5) 
Pt,k 


The price vector p, is determined implicitly as the solution to the system of 
equations (4). The existence and uniqueness of a non-negative vector p+ solving (4) 
(for any s* and any feasible 0? , and A‘) is proved in Amir et al. (2009, Proposition 
1). 

For a given strategy profile (A!,..., AV) of investors and their endowments 
UT yess wi, , à path of the market game is generated by setting 


do = AG (6) 


i 
tk = 


X = A(S, pt, 01, x71) (7) 
for all t = 1,2,..., and i = 1,..., N; and by defining p, and 02 recursively according 
to equations (2)-(5). The random dynamical system (Arnold 1998) described above 
defines step by step the vectors of portfolio weights \}(s*), the equilibrium prices 
pi(s*) and the investors’ portfolios 02(s*) as measurable vector functions of s* for 
each moment of time t > 0 (for t = 0 these vectors are constant). Thus, we obtain 
8 random path of the game 


(pots yea (s). cs ON UA. AT (5) (8) 
as a vector stochastic process in RK x REN x REN. 
The above description is correct only if the price of each asset is strictly positive. 


Strategy profiles which guarantee this property at each period will be called admis- 
sible. Admissibility is satisfied, e.g., if the strategy profile contains an investor who 
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uses a strategy with strictly positive portfolio weights. We will deal only with such 
strategy profiles from now on and, therefore, have well-definedness of the random 
dynamical system. Then (using induction) the equilibrium path all the portfolios 
0, = (011... 0; g) are non-zero and the wealth 


wy = (Dit po 813) (9) 


is strictly positive. 

Equation (5) implies ar Oi y = V; k, i.e., market clearing for every asset k and 
each date t > 1 (analogously for t = 0). For every equilibrium state of the market 
one therefore has that markets clear, p; > 0 and 6% 4 0 for all i. 

The description of a financial market is game-theoretic: the market dynamics is 
formulated as a simultaneous-move N-person dynamic game. Less general versions 
of the model can be motivated by invoking Marshall’s (1949) principle of temporary 
equilibrium (Evstigneev et al., 2008) or evolutionary dynamics (Evstigneev et al., 
2006). 


3 Dynamics and Stability 


'This section provides details on the dynamics of the model and defines notions of 
survival and evolutionary stability. An explicit formulation of the dynamics for the 
investors' wealth as well as their relative wealth is given. 


3.1 Dynamics and the role of basic strategies 


Let (A!,..., AF) be an admissible strategy profile of the investors. Consider the 
path (8) of the random dynamical system generated by this strategy profile and the 
given initial budgets. Let wi > 0 denote the investor i's wealth at date t > 0. For 
t > 1, wt = wi(s') is given by formula (9) while each investor i’s endowment, wj, 
is a constant. 

When studying the dynamics of this wealth process for a fized strategy profile, it 
is sufficient to consider the class of basic strategies, i.e., those strategies for which A? 
depends only on the history of states until time t, st, and not on the market history. 
This assertion holds because any sequence of vectors w; = (w},...,w) of wealth 
generated by some strategy profile (A!,..., AV) can be generated by a strategy 
profile (A1 (s), ..., AN (s*)) consisting of basic portfolio rules. The corresponding 
vector functions At (st) can be defined recursively by (6) and (7), using (2)-(5). 

'l'his observation is very useful when studying the dynamic properties (such 
as survival or evolutionary stability) for a basic strategy — as we will do here. 
However this ‘reduction’ of the model to basic strategies has its limitations when 
dealing with the characteristics of a general (non-basic) strategy which competes in 
markets with different opponents. Such a strategy will in general take on different 
basic strategies, dependent on the pool of competitors — it is therefore not basic. 


Survival and Evolutionary Stability of the Kelly Rule 279 


The procedure described above gives the system of equations 


K 


1 
Win = » les Quae Ure) + Desi (ses) | Ve 
coi lk Vitae 


A pwi 
(Atk Ui) 


(10) 


i — 1,..., N. This dynamic on the space {w € R | w > 0 and w £ 0} is well-defined 
under the assumptions imposed in Section 2 (e.g., Amir et al. (2009) or Section 4.1 
in Evstigneev et al. (2008) which applies with minor modifications). 

Our analysis will focus on the long-run behavior of the relative wealth or the 
market shares ri = wi/ Xa wl of the traders. Both the stability and survival of 
portfolio rules will be studied in this framework. An explicit random dynamical 
system can be derived for the vector r; as follows. 

Assume that the asset supply changes over time at the same rate y > 0: 


Via = Ve (11) 


where V; > 0 (k = 1, 2,..., K} are the initial amounts of the assets. In the case of real 
dividend-paying assets — involving long-term investments in the real economy (e.g., 
real estate, transportation, media, infrastructure, etc.) — the above assumption 
means that the economic system under consideration is on a balanced growth path. 
Define the relative dividends of the assets k = 1,..., K by Ry = (Fa,..., Rok) where 


D, (sf*)Vi 

Rex = Ras!) = c YE (12) 

5 Di mhs) Vm 

m=1 
for t > 1. Further, assume that a < y and define 
p=a/y, and p, = p (1 — p) 
One finds the dynamic for the vector of market shares: 
i xK Ai pri 
ria = 3 [PAm r) + (1 D Rel ri) (13) 
k=1 aa 


with 4 = 1,..., N. This equation can be written in explicit form (e.g., Evstigneev et 
al., 2009): 


Ttp] = (1 — p) [Id — POr] Oi Ris (14) 


where AT = ATi es MEK) € RN*K is the matrix of portfolio weights and ©; € 
RN*F is the matrix of portfolios given by O7, = Aj ,ri/ Que, Te). 

Setting p = 0 in (14) one obtains a model with short-lived assets akin to 
parimutuel betting markets, see e.g. Evstigneev et al. (2009). 
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3.2 Survival and extinction of portfolio rules 


We study the dynamics of wealth shares from an evolutionary perspective. Our 
focus is on the questions of “survival and extinction” of portfolio rules. 

A portfolio rule \* = (Ai(s*)) (or the investor ? using it) survives with probability 
one if infysort > 0 almost surely. This ineans that for ahnost all realizations of 
the process of states of the world s1, s2,..., the market share of the first investor is 
bounded away from zero by a strictly positive random constant. We say that \* 
becomes extinct with probability one if lim:...7r} = 0 almost surely. Survival and 
extinction can be defined for general non-basic investment strategies without any 
changes. An investment strategy A is called a survival strategy if the investor using 
it survives with probability one regardless of what other strategies are present in the 
market. In terms of the wealth process wj, i = 1,2,..., N, no investor’s wealth can 
grow asymptotically faster than the wealth of investors who use survival strategies. 
In this sense, all survival strategies are competitive. 

A portfolio rule À = (A;(s*)) is called globally evolutionarily stable if the following 
condition holds. Suppose, in a group of investors į = 1,2,..., J (1 < J < N), all use 
the portfolio rule A, while all the others, i = J--1,..., N use portfolio rules À distinct 
from à. Then those investors who belong to the former group (i = 1, ..., J) survive 
with probability one, whereas those who belong to the latter (i = J + 1,..., N) 
become extinct with probability one (cf. Evstigneev et aL, 2008)). An analogous 
concept of local evolutionary stability can be defined. In that definition of stability, 
the initial market share p *! E... rd! of the group of investors who use strategies 
X! distinct from A is supposed to be small enough (cf. Evstigneev et al., 2006). 


4 Results 


The findings on the long-run outcome of the dynamics of the above game fall in 
two categories, survival and evolutionary stability. The study of the first is carried 
out in the general model while the latter requires placing restrictions on the set 
of admissible strategies. This is simply caused by the extreme generality of the 
asset market game introduced above. For instance, if one strategy is constant while 
some other strategy always imitates the decision made in the previous period, both 
end up with the same portfolio weights. However, it can be shown that mimicking 
strategies are not survival strategies. 

The central role in our analysis is played by a (generalized) Kelly rule. Define 
the investment strategy A* with the vectors of portfolio weights A7(s*) by 


oo 
Aik = E, S pios (15) 


l=] 


where E,(-) = E(-|s*) is the conditional expectation given s* (unconditional expec- 
tation E(-) if t = 0). 
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The portfolio rule specified by (15) prescribes to distribute wealth across assets 
in accordance with the proportions of the expected flow of their discounted future 
relative dividends. The discount rate p,,1/pi = p is equal to the investment rate a 
divided by the growth rate y. 

The portfolio weights of the strategy A* generally depend on time t and the 
history of states of the world s', but do not depend on the history of the game 
(p*1,0* 1, At!) — the strategy is basic. The strategy A* is a generalization of 
the Kelly portfolio rule of ‘betting your beliefs’ playing an important role in capital 
growth theory — see Kelly (1956), Breiman (1961), Thorp (1971), Algoet and Cover 
(1988), and Hakansson and Ziemba (1995). 


4.1 Survival of the Kelly rule 


This section collects the results on the survival of the Kelly rule in different speci- 
fications of the model. Survival of the Kelly rule is ensured for the case of gencral 
investment strategies in both stock and betting markets. 

Amir et al. (2009, Theorem 1) shows that the Kelly rule (15) is a survival 
strategy. The only additional assumption needed for this result is that for all k and 
t, Ey Ri41.4(st41) > ô almost surely for some constant 6 > 0. This results extends 
the findings in Amir et al. (2008) to the stock market case (long-lived dividend- 
paying assets, p > 0). In Amir et al. (2008), betting markets (p = 0) are studied 
in which the assets are short-lived and are reissued at each date. The Kelly rule in 
this setting is obtained from (15) by using the discount rates pı = 1 and p; = 0 for 
i> 1. One has 


tk Eau (16) 


for k = 1,..., K. Under the assumption E In EF: Ri41,4(s°t!) > —oo (which ensures 
strict positivity of all A7 ,), Amir et al. (2008, Theorem 1) asserts that A7 ,, defined 
in (16) is à survival strategy. The existence of a non-basic survival strategy is an 
open problem. 

In both cases, the Kelly rule is asymptotic unique among all basic survival 
strategies. One has that 57% ||[Až — A||? < oo almost surely for any basic sur- 
vival strategy A = (A1) (see Amir et al. (2008, Theorem 2) and Amir et al. (2009, 
Theorem 2)). 


4.2 Evolutionary stability of the Kelly rule 


This section reports on properties of the Kelly rule that are stronger than ‘merely’ 
ensuring survival. These features however only surface when restricting the set 
of admissible strategies (as well as the type of randomness driving the dividend 
process). The main results are that evolutionary stability with iid asset payments 
holds when the set of admissible strategies is restricted to constant strategies; and 
that local evolutionary stability hold is strategies and states are Markov. These 
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theoretical findings are complemented by simulation studies of markets in which 
the pool of investment strategies changes over time through a process of mutation. 

Assume that there are finitely many states of the world s € S, states 31,32, ... 
are independent identically distributed (iid) with P(s, = s) > 0 for each s € S and 
that the relative dividends R; (sf) = Ri(s:) depend only on the current state s; 
and do not explicitly depend on t. In addition to (1), we assume that ER (s,) > 0 
for k = 1,..., K. Under these conditions, (15) takes the special form 


LE S AER) = ERx(s1) (17) 
which means that the strategy A* is formed by the sequence of constant vectors 
(ER (sj), ..., ERk(s;)) (independent of t and s*). Note that in this special case, 
the formula for A* does not involve the investment rate p. In this case, the ‘beliefs’ 
at each date t are concerned simply with the expected relative dividends (which 
do not depend on t). We call an investment strategy A simple, or fired-miz, if 
all the portfolio weights are constant. The investment strategy (17) is simple and 
completely mixed (i.e., all components are strictly positive). 

Suppose there are no redundant assets, i.e., the functions F1(s),..., Rg (s) are 
linearly independent. Evstigneev et al. (2008), Theorem 1, shows that the Kelly 
rule defined in (17) is globally evolutionarily stable in any market in which all the 
portfolio rules are simple. This result demonstrates that the Kelly rule A* has 
good properties beside ensuring survival: the group of investors following this rule 
outperform all other simple strategies and dominate the market. These investors 
eventually gather the total market wealth, while those who use simple strategies 
distinct from A* become extinct. The related result for short-lived assets (p = 0) is 
proved in Evstigneev et al. (2002) and Amir et al. (2005). The latter deals with a 
more general setting in which the state follows a Markov process and the relative 
asset payoffs at time {+ 1 depend on s; and $444. 

Results on the local evolutionary stability of the Kelly rule are obtained in 
Hens and Schenk-Hoppé (2005) (dealing with short-lived assets) and Evstigneev et 
al. (2006) (considering long-lived assets and basic Markovian strategies). The Kelly 
rule is the only portfolio rule that is locally evolutionary stable — and, thus, the 
only candidate for a global evolutionary stable strategy. The advantage of consider- 
ing the local dynamics is that stability can be characterized by growth rates (of the 
linearized dynamics) that do have explicit representations. It is therefore straight- 
forward to verify whether particular portfolio rules co-exist or whether selection can 
occur through extinction of some portfolio rules. 

In a strict sense, all of the above papers deal with the issue of selection rather 
than the process of mutation that creates novel behavior. The question whether the 
Kelly rule emerges in à market where traders adapt their strategies is investigated 
in Lensberg and Schenk-Hoppé (2007). They combine the evolutionary model with 
traders whose decisions are made by genetic programs. In that setting, the strategies 
and the dynamics of wealth shares co-evolve in a process of market interaction 
and tournament selection of strategies (in which poor strategies are replaced by 
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novel rules). Their numerical analysis shows that the Kelly rule evolves in the 
long-run but prices converge much faster than the individual strategies to those 
prescribed by the Kelly rule. Successful traders, interestingly, invest according to a 
fractional Kelly rule rather than its pure form. This approach drastically reduces 
the volatility of investment returns, which helps to avoid deletion by the tournament 
selection process. The optimality properties of the fractional Kelly rule for practical 
investment are discussed in MacLean et al. (1992). 


5 Conclusion 


This survey presents in detail an evolutionary model of financial markets. The 
model is described as a simultaneous-move N-person dynamic game. Restricting 
the space of strategies to those depending only on the history of states (rather than 
the entire information consisting of the market history and revealed strategies of 
competing traders), one obtains the earlier, behavioral models evolutionary finance. 
The main findings of the survival and evolutionary stability properties of investment 
strategies are discussed. It turns out that the Kelly rule (appropriately defined to fit 
the framework under consideration) ensures the survival of the traders following this 
portfolio rule. The Kelly rule brings the additional benefit of (global) evolutionary 
stability. 
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Abstract. In this paper, we study the Kelly criterion in the continuous time 
framework building on the work of E.O. Thorp and others. The existence of an 
optimal strategy is proven in a general setting and the corresponding optimal 
wealth process is found. A simple formula is provided for calculating the 
optimal portfolio for a set of price processes satisfying some simple conditions. 
Properties of the optimal investment strategy for assets governed by multiple 
Ornstein-Uhlenbeck processes are studied. The paper ends with a short 
discussion of the implications of these ideas for financial markets. 


Keywords: utility function; optimal investment strategy; self-financing; 
complete market; risk-neutral measure; Brownian motion; Ornstein-Uhlenbeck. 


1 Introduction 


The Kelly Criterion [1], [2] was initially introduced in 1956 to find the optimal 
betting amount in games with fixed known odds, and was later extended to the field 
of financial investments by E. O. Thorp and others. The strategy maximizes the 
entropy and with probability one outperforms any other strategy asymptotically [3]. 
This approach was recently further developed by Kargin [4], who applied the criterion 
to a mean-reverting asset process under liquidity and credit constraints. 

The Kelly Criterion tells us that the optimal betting fraction is given by p-g, if a 
gambler is faced with a bet, where the probability to double the money is p and to lose 
the initial stake is q (p>q). The optimal betting fraction maximizes the expected log 
wealth. The question, why investors should choose to maximize the log wealth, has a 
simple answer: according to Breiman’s theorem [3], it gives the asymptotically 
optimal pay-out and dominates any other strategy. 

In this paper, we start by extending the original idea to the general continuous time 
framework with n correlated assets. Our task is to find the optimal self-financing 
trading strategy. We will prove that if the market is complete, this optimal self- 
financing trading strategy always exists. A limited number of applications are 
discussed in the context of Ornstein-Uhlenbeck processes. 

The paper is organized as follows. In section 2 we review the standard assumptions 
and prove the optimization theorem. The theorem covers both the existence of the 
optimal trading strategy and the explicit form of the associated optimal investment 
fraction. In section 3 we apply the theory to a market of n correlated assets given by 
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Ornstein-Uhlenbeck mean-reverting processes. The optimal investment strategy is 
calculated for some representative examples. In the last section we e put the results into 
the financial context and describe some open problems. 


2 General Theory 


In the first section, we will cover the assumptions and the theoretical framework. In 
2.2 we will provide the main result, which is contained in Theorem 1. 


2.1 Basic Assumptions and other Preliminaries 


We will use the standard notation and conventions of financial mathematics. In 
section 2 the basic assumption is that the market is complete and frictionless [5]. Let 
us further assume that we consider all the processes in the finite time interval 0 to the 


terminal time T. There exists a probability space(Q, P,,.97), on which all of the 
random variables are constructed, where Q is the sample space,.% is ac -algebra 
which denotes the information accumulated up to time T and P, is the spot probability 
measure [5]. The filtration Z ,t € [0. T], represents the information accumulated up 
to time t. The sub-probability space (Q, P,,.%) is introduced at time t, where P, is 


the restriction of P, on the filtration Z . 


We assume there are n+linvestable assets in the market including the wealth 
process B,, representing a saving account with value 1 at the initial time 0. We 


assume B, follows 

dB, -Brdt, (1) 
where z is the short term rate at time t. 
The other n assets in the market are denoted by S; (1)! €[0,7],i=1,2,...,2, and we 
define a n-dimensional vector by S, = (5, (7). S, (1)... S, (o) , where’ ` "represents 


the transposition of a matrix. Let us define the relative assets price process 
by S, - S, B. . Let g, (1) denotes the number of units of B, an investor holds at time t 


and d (¢),/e[0,7],i=1,2,...,2, denotes the number of units of the /^ asset an 


investor holds at time ¢. In addition, the n-dimensional vector ¢ is defined 


as, - (& (t).6, (1).....6, ()) V, (y) is the total value of the portfolio v, = (d, (4),¢,). 


So we have 
V (w)- (2) 8, *6.8, , Q) 
where ø - S, is the inner product of two vectors. 
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Definition 1 A self-financing trading strategy y, - (d (0).4.) is a strategy that 


satisfies: 
aV, (v) - & (r)dB,*4 -dS vte[0,7] . (3) 
We assume V, (v) 21. 
Definition 2 A self-financing trading strategy y, ^ (o, (f),6,) is said to be 
admissible if and only if 
V(w)20, P, as. Were[0,7] . (4) 
U (x) ,x 20, is defined to be a concave function representing the utility of wealth. 
Here concaveness means 
U ((1- p)x, + px) 2 (17 p)U (x) e pU (x;) ,Vx,2x 20and0<p<l . (5) 
Further it is assumed that U (x) has a first order derivative for Vx e (0,+00). The first 


order derivative at x 2 0 can be either finite or infinite, and the first order derivative 
of U(x) , x20, is a strictly decreasing function of x with lim U'(x)-0 . 


IfU' (0) — +00, let (x) , x 2 0, be the inverse function ofU'(x) with 7 (0) =+% and 
I(*«) 20. For U'(0)- 520, we denote by 7, (x), x e [0,5] , the inverse function 
ofU' (x) ,with 7 (0) = +00 . In this case, we define / (x) as 


of (x) xe [0,6] (6) 


0 , xe(b, +0) 


Let us denote by £ the class of all of the admissible self-financing trading strategies. 
We say a self-financing trading strategy y` € 2 is the optimal trading strategy, if and 
only if 


E [u(i a [Un] vveg . (7) 


Our task is to find an optimal y* € £ , which satisfies eq.7. 


2.2 The Optimal Strategy 
To find the optimal strategy, we will first need to introduce the following lemma.. 


Lemma] The function I(x), x € [0,+0) satisfies the following inequality: 


U (1(»)) - »1 (y) U (e) - ve Vy,c €[0,+00) . (8) 
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Proof: 
If/(y) 2 c, then eq.8 is obviously satisfied. If /(y) » c , then the average growth rate 


of U(x) from c to /(y) should be larger than the first order derivative 
ofU(x)at/(y), which is given by U'(1(y)) , since the first order derivative 
ofU (x) is a strictly decreasing function of x. 


The average growth rate of U (x) from to / ( y)is 
Uu (1(y))-U (c) 
I(y)-c 


This yields the following inequality 


=> U 1 (y))-»t(v)2 U(c)-ye . 
An almost identical argument can be applied in the case / (y) «c. CJ 


- dP 
Let us define P,as the martingale of the market and Z, UPC Then (Z,, Z) is 


i 
a P, martingale [5]. Define y = y,Z7' Bj , V? = I(r) , where we assume y, is a 
deterministic function of ¢ and is defined in such a way that V' =V, B7" is 


a P. martingale. So y, solves the equation 
E B'i(yz?B)]ei. (9) 


The deterministic property of y,seems contrived, but is necessary for the proof of 


Proposition 1. As we shall see in section 3, in the case of the log utility function the 
deterministic function y, indeed exists and is a constant. 


Proposition! — /; =/ (m; ) satisfies the inequality given by eq.7. 


Proof: 
Let, (w), Vw € £ , be the wealth process corresponding to a special trading 


strategy y , then 
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According to lemma 1, the first term of the right hand side of eq.10 is positive. The 
second term is equal to zero, 


E b (8 -v Q)]  £[ zi Q2 27 (/))] 2» [Iz Fe ()]-0 . am 


The last equality of the above equation is deduced from the fact that both’ and 


t 
V (v) are martingales under the martingale measure. 


Combining eq.10 and eq.11, we directly get 
E, [U(97) 2 E [Us (v))] 


Ll 
Proposition 1 only state the fact that / (n; ) satisfles eq.7. It doesn't necessarily mean 
that / (5; ) is the optimal wealth process. We will prove in the following theorem 


that / (n; ) is in fact the optimal wealth process. 


Theorem 1 Given a concave utility function U (x), there exists an optimal self- 
financing trading strategy w' , such that for each time te[0,7 |. the wealth 


process V, ( y` ) of this strategy satisfies 


E, [Ul (v'))| 2E [uu] vwee. 
And the optimal wealth process is given by: V, (v) =] (n; ). te [0, T] . 
Proof: 
Define V; to be B I(nj). For 1=7, we have V; - B,V^ - Hm) , which 


represents a general contingent claim in the market. Since the market is complete, the 
contingent claim V; is attainable. This means there exists a self-financing trading 


strategy y/ such that. =V, (v ) B', where V, (v`) is the wealth process of this self- 
financing trading strategy. So the relative wealth process V, (y^) V, (^ ) Br" is a 
martingale under the martingale measure P, . V'is also a martingale under the 


martingale measure P, , and we have 
y = BEV; 12] -BE 7 (w) F]=¥,(v'), welo7] . a2 


Eq.12 shows that y’ is a self-financing trading strategy and replicates the optimal 
wealth process”, = Bj (n). From the combination of V, , satisfying eq.7 for any 
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time ¢ before 7, and eq.12, we can see that V, (v^) also satisfies eq.7. This proves that 


the strategy y” is both self-financing and optimal. N 
It follows from Proposition 1 and Theorem 1 that the existence of a self-financing 
trading strategy y` € £ , where the total wealth at a fixed time T is consistent with 


eq.7, implies that the wealth process V, (v^) satisfies 


& lup) jaue] vveo 


Therefore, an optimal trading strategy for a fixed time T will be optimal for any time 
before T. It follows further that an optimal trading strategy is only based on the 
information up to time /. The optimal trading strategy v; is an adapted process with 


respect to the filtration {Ft € [0, +00} . In the next section, we will apply the theorem 


to the case of a financial market containing a saving account and n correlated assets, 
whose price processes follow Ornstein- Uhlenbeck mean-reverting processes. 


3 Implications for Ornstein-Uhlenbeck Processes 


In this section we set r =r and S, (1) = exp(x, (1) , t e[6.7] ,dzeL2,.2n^. 


Each x, (t) is governed by 
dx, (t) =| a, bx (1) Jat +50, dW/ Lif =12 00 « 
j=l 


where a, is some fixed real number, b, >0 is some nonnegative real number 


T. : . . 
ando,,are constants. W, = (W), W, W) is a standard n-dimensional Brownian 


motion. 
T . . . 

Definea to be the vector (a 5, yoa, ) ( r’ is the transposed of a matrix), b to be the 

b,; = b, i= j 

b, | = 0, 1x J 


forl <i, j <.The matrix e has a non-zero determinant. Then the dynamic equation 


, andae to be the matrix 9,,70;;, 


HN 


nXn matrix of the form b= | 


ofx, = (x (1). (0)... x, (D can be expressed as 


dx, =[a—bx,|dt+odW, . (13) 


LetS, = (S, (0),5, (0)....8, (y „S, =B'S,. 


According to Ito's lemma, the dynamic ofS, is 
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4S, (r) - S, (t) s (rat S, (n Y, dW! i=1,2,....0 


j=} 


T 
- (6,50, 50, ) . 


? a ; (14) 


where 0, = (8, (u),0, (u),....0, (u)) is a n-dimensional adapted stochastic process 


where z, (1) = a, — bj log(S, (£))+ 
Define 


dP, PEE 
-5 = exp} |0, dW, 
ral 


O (u)+...+0? (u) is the Euclidean vector norm, and W,a Girsanov 


1 
transformed Brownian motion, i.e. W, = W, — fo du. 


Then under P, , S, it follows 
dS, (t) - S, (t Aw) )-r-So c, 8, ( oas: de, dw) 


Define c, (7) = 1, (t)—r and in vector form e, = (c, (1),c, (¢),....¢, (0) . If 8, solves 
c0, =c, 0, - 67e, , (15) 
then under Ê, it follows that 
d$, = Yedw, , (16) 


~ ~ S. H -$ H , j=j x 
where the matrix Z is defined ss a iC) | 1s, is a martingale 


under P.. 


To apply theorem 1, we need first to prove the completeness of the market price 
processes under consideration. The next lemma tells us that indeed the market is 
complete. The proof is given in an earlier presentation [7]. 


Lemma2 The mean-reverting market given above is complete. 


In caseU (x) = log(x), we find /(x) 2 1/x. Using eq.9, we can show y, =1. Then 
the optimal discounted wealth process is 
e, | |. 


Now we are in a position to derive a general result for Ornstein-Uhlenbeck processes. 


t t 


y'-z =e aW, 
0 
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Theorem 2 The optimal trading strategy y; = (d; (0.4) is given by: 
g, (t)= BPV; (1- &'6/xS,). d (t)= BY XO, (1)4,,(t),i=1,2,...0 . (17) 
fel 


where kh, = (Zo ) . 


Proof: 
First, we can show immediately 


A =¢ S, +h (1) B, , 


where Vis the optimal wealth process given by 


t 


V, =B, ef Je. aw, -l fle. |" 2 . 
0 2 0 


Using Ito’s lemma for Ñ’ , we get 
dV’ 2dZ,-V/e/aw, . 
From eq.16 we know that 
aW, =h dŠ, . 
Combining the above two equations, we have 
dV; - V/0/ h, dŠ, = BV, (07h,dS, -r07 A,S,dt) , (18) 
and 
dV" = B;'dV' -rB;V/dt . (19) 
Combining eq.18 and eq.19, we arrive at 
dV; =V; | B'9/2,48, + B7' (1- B,'8,72,S, )dB, |= 4 aS, +45 (t)dB, . QU) 
Eq.20 shows directly that v; = (d; ORKA ) given by eq.17 is the optimal self-financing 
trading strategy. O 


The optimal fraction vector f -(A (0. 5 (0. 5 ()) is composed of the 
individual f" (z), e.g. 4; (1); (f)/ V/ . By simple calculations based on Theorem 2, 


we have 


f -R'c, (21) 
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where R = oo’ is a symmetric matrix and called correlation matrix. We will show in a 
separate paper that the matrix R denotes the correlations of the yield rates, e.g. the 
correlation of the i" and j™ assets is a deterministic function of, -6,. If the standard 


inverse of the volatility matrix does not exist, then one can resort to the generalized 
Moore-Penrose inverse to obtain a related result for the optimal investment fractions 
in markets without arbitrage. 

Another derivation of the optimal fraction can be based on the function 


F(x)- ex- zx Rx, VxeR"^ , (22) 


linked to the mean-variance approach, since the optimal fraction given by eq.21 is the 
maximum of the function F. This indicates the close relationship between the utility 
maximization and the mean-variance method. 

In the special case where the market is composed of only one stock, by eq.21, the 
optimal fraction is f 2 (u, —r)/o^ , where u, = a- blog(S(r))- 0-5" . Fig. 1 shows 
a sample path for the stock process and the associated optimal investment fraction and 
wealth process. As an aside, if one assumes zero interest rates, than the sensitivity of 
the optimal fraction to a percentage estimation error in the drift z is twice the negative 
of a similar error inc , e.g. a 7% overestimation in volatility has approximately the 
same impact as an underestimation of the drift by 226. 
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Fig. 1. Simulation of the stock price process, the corresponding optimal strategy f, , and the 


wealth process V, with parameters a=0.5, b=0.2, o =0.1, r-0.03, S, =10 and Y, =10. 
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Besides the sensitivity to estimation errors, it would be interesting to analyze the 
impact of the correlation matrix on the optimal trading strategy. Positive correlation 
has a tendency to reduce the number of ‘independent’ assets and forces investors to 
reduce leverage, e.g. the sum of the absolute values of the investment fractions is 
smaller. 

Next, we study a special case where the assets have local correlations. The different 
assets only correlate to the neighboring assets but have no correlation to the rest assets. 
Let's set the risk-free rate to zero and the volatility matrix to be 


cc 0 0 
00 oco -.: 
6=|0 0 ^. >. 0 ,nz2 
Do. 75 75 8 
00 0 - c 


nxn 


In this case, we will only study the large time limit. Let us denote by f; (œ) 


Zo) im & [9.70 - a» 
i=} 
After some simple manipulations we get 


ntl grnodd 


Fs (2) = . (24) 


n . 
—  forneven 


For a fixed odd integer n, the limit of the expected total fraction is (7#+1)/4, which is 
identical to the value for the next even number. 

Now, let us investigate another correlation structure where the assets have global 
correlations. As a simple example the volatility matrix is chosen as 


100 0 
110 +. 0 

6-c|l 1 LEE ,nz2, 
: 1 0 


1 0 0 0 

-] 1 0 0 
o'=o'|/0 -1 ` : 
: 1 0 

0 0 -] 1 
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Thus the optimal fraction f” ( )i is given by 


)- (e (0) - e(t )) i=] 
f ()= » )-«4(0)-(e.()-5()) i23..() . — 0s 
c,(t)-—c j 


FOEL Oa) 40— . (26) 


The total fraction here is equal to the optimal fraction in another market containing 
only the first asset. This surprising result is partly due to the fact that in the multi- 
dimensional case the investment fractions are likely to have both positive and 
negative signs. The expected total optimal fraction is 


l 
a, —b, log( S, (0)) le ^ - o? -r 
F =r (9]-a(-^ iu » 2 . QT) 


B 7 7 for b= 
f (20) = lim f (1) = ° . (28) 
0? =r 
2 for b, #0 
[02 


For additional examples, in particular in higher dimensions, we refer the reader to [7]. 


4 Conclusions 


In the earlier sections we presented a discussion of the Kelly criterion in the 
continuous-time framework. The main theorem shows that in a complete market there 
exists an optimal self-financing trading strategy that maximizes the logarithmic utility 
function. The optimal investment fractions were explicitly calculated. 

One general implication of the Kelly's criterion is maybe worth mentioning. It 
follows from Breiman's theorem [2], which shows that a logarithmic utility 
maximizer outperforms with probability one in the long run any substantially different 
trading strategy. This theorem has surprising consequences, for example it has 
spanned a smallish field called ‘evolutionary finance’ [8]. According to evolutionary 
finance ‘natural selection’ should favor agents with log utility. Such agents maximize 
the growth rate of their wealth with probability one, and thus dominate eventually the 
market. The stark claim is that either the investor maximizes utility or is marginalized. 
The authors are doubtful, if such a strong claim is justified, since only in the long time 


11 
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limit does the utility maximizer almost surely outperform. In the real world, where 
one has multiple independent agents and frequent paradigm shifts, maybe an even 
more aggressive strategy is warranted. Being ‘overinvested’ can be ‘superior’ (lower 
utility, but higher winning probability) in the short term. Even in the medium term the 
log maximizer has difficulties to outperform, if many independent agents exist. This 
could be a partial explanation for the regular crisis in financial markets, e.g. investors, 
who seek a short-term competitive advantage, invest over-aggressively. This is on top 
of the significant inherent volatility of utility maximization. A proper understanding 
of the impact of the Kelly criterion on the optimal behavior of individual agents is the 
precursor to consistent multi-agent modeling. 

As a speculative aside, maybe utility maximization has a role in the study of the 
punctuated equilibrium observed in the evolutionary history of the earth, since utility 
maximization could provide a potential explanation without necessarily having to 
resort to external causes like asteroid impacts or volcanic eruptions for rare 
widespread extinction events. 

A further interesting aside is to relate Kelly’s criteria to market bubbles. A 
bubble is sometimes defined as a self-reinforcing dynamic associated with an 
excessive increase in asset prices followed by a sudden collapse. One simple and 
generic way to achieve a bubble is to find a reinforcing process that reduces the 
volatility a , which would increase the optimal leverage as perceived by the investors, 


since the optimal Kelly fraction is given by(u, —r)/6^. Next a description of a 


possible self-reinforcing process: In a number of investment strategies investors are 
short volatility. This selling of implied volatility can lead to a decrease of implied 
volatility due to supply and demand imbalances, which leads on the one hand to an 
increase in the optimal leverage and on the other hand to mark-to-market profits for 
the short volatility positions. As volatility decreases the optimal position size, e.g. 
leverage, increases, putting additional pressure on implied volatilities. This process 
continues until the friction associated with obtaining additional leverage stops the 
process. This ‘virtuous circle’ is then replaced by a ‘vicious circle’, since the extremal 
point is unstable, as implied volatility increases and leverage decreases. Here we do 
not discuss realized and implied volatility separately, since they are positively 
correlated and for the qualitative description presented here a dampening process for 
either type of volatility is sufficient. The analysis sketched out above can be extended 
to show that many bubbles, i.e. credit and stock market bubble, are driven by changes 
in the optimal leverage ratio as derived from the Kelly criterion. 

Due the space limitations, we are not able to provide even a brief description of the 
application of the result in the area of statistical arbitrage. The present discussions are 
based on the continuous time framework, but realistic markets have an inherent 
discreteness. Furthermore there are different types of frictions, e.g. transaction cost, 
bid-offer spreads and liquidity constraints, which impose portfolio readjustment 
frequency restrictions. Not all of those influences are small! and can be neglected. In 
an earlier presentation [7] correction terms for reducing the investment fractions were 
explicitly calculated. It would be of interest to give a comprehensive analysis of the 
impact of the different types of frictions for statistical arbitrage strategies. This will be 
done by the authors in a separate paper. 
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In conclusion this article gives a quantitative insight into the trade-off between risk 
and return as diversification opportunities are added, correlation structure changed, 
and other constraints modified. 
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Introduction to the Relationship of Kelly Optimization to 
Asset Allocation 


The Kelly growth optimum approach is an attractive formula for investing. If the 
formula is robust and can be adapted to include realistic constraints on investing, 
then the practicality of the method is clear. The various papers here discuss these 
issues including liabilities, fractional Kelly, benchmarks, fixed mix strategies, and 
volatility induced wealth growth. 

The first paper discusses asset allocations with withdrawals. Browne (1997) ana- 
lyzes the optimal behavior of an investor (or pension fund manager) who withdraws 
funds continuously at a fixed rate per unit time. The investor can allocate funds to 
a given number of risky assets whose prices follow geometric Brownian motion and 
a risk free asset with a known constant rate of return. He shows that there are safe 
and risky zones with the latter being a region with a positive probability of ruin and 
the former where there is no chance of bankruptcy with those asset allocations. Tt 
is not possible to reach the safe region with probability one before bankruptcy or to 
determine an optimal policy to reach an upper level before a given lower level. But 
the safe region can be reached (optimized) with c-optimality. Brown then finds the 
optimal growth policy for safe zone investors, which is a Elog (Kelly) maximizing 
policy where the investor invests a constant proportion of wealth above a stochastic 
target floor in a form of constant proportions portfolio insurance. 

'The next two papers discuss fractional Kelly strategies. MacLean, Ziemba and 
Blazenko (1992) discuss growth versus security tradeoffs in dynamic investment 
analysis. We know that the E log maximizing strategies have the highest asymptotic 
long term growth rates. This approach is combined with the maximal security 
literature of Ferguson (1965), Epstein (1977), and Feller (1962). They propose 
three growth measures and three security measures. The growth measures are: 


(1) The mean accumulation of wealth at the end of time t, that is how much do we 
expect to have, on average, after ¢ periods. 

(2) The mean exponential growth rate and its limit as t — co. 

(3) The mean first passage time to reach the set [U, co), that is, how long on average 
does it take the investor to reach a specific level of wealth. 


The measures of security are: 


(1) The probability that the investor will have a specific accumulated wealth at 
time t, that is what are the chances of reaching a given target in a fixed amount 
of time. 
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(2) The probability that the investor's wealth is above a specified path. 
(3) The probability of reaching a goal U which is higher than the initial wealth yo 
before falling to a wealth level L below yo. 


MacLean et al. study growth-security efficiency analgously to mean-variance 
efficiency as well as the weaker conditions of growth-security monotonicity. This 
latter tradeoff is called effective. They show that fractional Kelly strategies trace 
out effective growth-security tradeoffs. Later in MacLean, Ziemba, and Li (2005), 
it is shown that these fractionally Kelly strategies are efficient, if the assets are 
log-normally distributed, so they maximize growth for any given security level. The 
analysis leads to simple two dimensional graphs that compare growth rates with the 
probability of achieving a higher goal before falling to a lower level. Applications 
are made to blackjack, horseracing, lotto games, and futures trading. This range 
of applications runs the full range of asset weights from very small to very large 
percentages of the investor's wealth. 

MacLean, Sanegre, Zhao, and Ziemba (MSZZ) (2004) provide an approach to 
calculate the optimal fractional Kelly weights at discrete intervals of time. The idea 
is to maximize the expected log subject to being above a specified wealth path with 
high probability. This is à value at risk type of constraint and leads to a model 
where a modified Kelly strategy can be computed at the discrete intervals of the 
planning horizon. This strategy is not necessarily fractional Kelly. In general, as 
risk rises, the expected return will fall so there is an effective tradeoff. With geomet- 
ric Brownian motion in continuous time, fractional Kelly is optimal for the value 
at risk (VaR) and conditional value at risk (CVaR) where returns in the VaR tails 
are lineraly penalized constraints. The algorithm proposed uses a disjunctive form 
for the probabilistic constraints which identifies an outer problem of choosing an 
optimal set of scenarios and an inner (conditional) problem of finding the optimal 
decisions for a given discrete scenario set. The multiperiod inner problem is com- 
posed of a sequence of conditional one period problems. The theory is illustrated 
for the dynamic allocation of wealth in stocks, bonds and cash. 

There are two ways to model the application, maximize the expected log of final 
wealth subject to 


(1) wealth being above the specified wealth path with high probability; and 
(2) a secured annual drawdown with a drawdown amount and a security level both 
exogenously specified. 


'The VaR model only controls loss at the horizon but not at intermediate decision 
points. The more stringent risk control constraint in the drawdown model considers 
the loss in each period. 

At low levels of risk control, the full Kelly strategy is optimal. As the risk control 
requirements are raised, the strategy becomes more conservative, especially when 
close to the planning horizon. Assets are assumed to be log-normally distributed 
with geometric random walk but the computational procedure is general and applies 
to general price distributions. 
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In an extension, MacLean, Zhao, and Ziemba (2009) add one more feature to 
the MSZZ model, namely that when the predetermined exogenous path is violated, 
there is a convex penalty for there shortfall violations (see Mulvey, Bilgili, and Vural 
(2010) in part VI for an application along these lines). 

The next three papers discuss the impact of benchmarks. Browne (2000) con- 
siders a dynamic portfolio management problem where the objective concerns the 
tradeoff of performance goals and the risk of a shortfall below a benchmark target. 
Return is measured by the expected time to reach investment goals relative to the 
benchmark. This return is maximized subject to the risk constraint in terms of the 
probability of shortfall relative to the benchmark. The setting is Merton’s (1971) 
continuous time framework. 

Davis and Lleo (2009) extend the definition of fractional Kelly strategies to the 
situation in which the investor’s objective is to outperform a benchmark. These 
benchmarked fractional Kelly strategies are efficient portfolios even when the asset 
returns are not log-normally distributed. They find the benchmarked fractional 
Kelly strategies for various types of benchmarks such as the S&P500 and the Sa- 
lomon Smith Barney World Government Bond Index. They also determine the re- 
lations between the investor’s risk aversion and the given benchmarks. The setting 
is a stochastic control model in continuous time. They develop, following Merton, 
two and three mutual fund theorems. This means that the optimal weights over n 
assets can be found by optimally weighting the two or three mutual funds which are 
linear combinations of the original n assets, where n > 3, see Rudolf and Ziemba 
(2004) for a four mutual fund theory with liabilities in part VI; see also Davis and 
Lleo (2008a,b). In addition to single index benchmarks, there can be composite 
benchmarks plus alpha or composite weighted multiple index benchmarks with or 
without an alpha. 

The Davis and Lleo two fund theorem for the case without and with benchmarks 
is that the assets can be split between the full Kelly portfolio and a correction 
portfolio C with the weights dependent on the investor’s risk sensitivity. C is 
an intertemporal hedging portfolio, which can be the risk free asset under certain 
assumptions. They develop a three mutual fund theorem for the case of benchmarks 
with risky and risk free assets. Formulas are developed for the optimal equity 
portfolio weightings in the various mutual funds. 

Any portfolio can be expressed as a linear combination of a mutual fund, an 
index fund and a long-short hedge fund with risky and risk free asset allocations. 
Again, the weights are dependent on the risk sensitivity. The risk sensitivity varies 
from zero (the full Kelly expected log case) to infinity where the investor replicates 
the benchmark with an index fund. With moderate risk sensitivity there will be a 
blend of all three asset funds. 

Platen (2010) discusses the benchmark approach for dynamic investing and pric- 
ing. A numeraire portfolio exists which is strictly positive that makes all bench- 
marked nonnegative portfolios downward trending or trendless. The benchmark 
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portfolio is the Kelly Elog maximizing portfolio which cannot be outperformed by 
another long only portfolio. The benchmark portfolio can be used for pricing and 
computing conditional expectations. 

The final paper in this part deals with fixed-mix and volatility induced wealth 
growth. Dempster, Evstigneev, and Schenk-Hoppé (2009) discuss the literature con- 
cerning the growth of wealth over time through volatility using fixed-mix strategies. 
These self-financing strategies rebalance the portfolio at each decision point to keep 
constant the proportions of wealth invested in various assets. Through volatility, 
the portfolio will gain wealth even with assets that do not have strictly positive 
means. This is a generalization of the buy-low sell-high investment strategy. But 
they show that there is much more to the story than this implies. The analysis does 
not need utility assumptions and compares the fixed-mix rebalancing with buy and 
hold strategies. The key assumption is that the period by period returns of the var- 
ious assets are stationary. Dempster, Evstigneev, and Schenk-Hoppé discuss some 
counter-intuitive examples such as growth with assets having zero means. They 
discuss the effects of transaction costs on their results and determine conditions for 
growth to occur with sufficiently small transaction costs. They also show that the 
conjecture, the higher the volatility, the higher the induced growth rate is not true 
in general. The key idea is to rebalance to an arbitrary fixed mix portfolio. 
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SURVIVAL AND GROWTH WITH A LIABILITY: OPTIMAL 
PORTFOLIO STRATEGIES IN CONTINUOUS TIME 


SID BROWNE 


We study the optimal behavior of an investor who is forced to withdraw funds continuously 
at a fixed rate per unit time (e.g., to pay for a liability, to consume, or to pay dividends). The 
investor is allowed to invest ın any or all of a given number of risky stocks, whose prices 
follow geometric Brownian motion, as well as in a riskless asset which has a constant rate of 
return. The fact that the withdrawal is continuously enforced, regardless of the wealth level, 
ensures that there 1s a region where there is a positive probability of ruin. In the complemen- 
tary region ruin can be avoided with certainty. Call the former region the danger-zone and 
the latter region the safe-region. We first consider the problem of maximizing the probability 
that the safe-region is reached before bankruptcy, which we call the survival problem. While 
we show, among other results, that an optimal policy does not exist for this problem, we are 
able to construct explicit e-optimal policies, for any e > 0. In the safe-region, where ultimate 
survival is assured, we turn our attention to growth. Among other results, we find the optimal 
growth policy for the investor, i.e., the policy which reaches another (higher valued) goal as 
quickly as possible. Other variants of both the survival problem as well as the growth problem 
are also discussed. Our results for the latter are intimately related to the theory of Constant 
Proportions Portfolio Insurance, 


1. Introduction. The problem considered here is to solve for the optimal invest- 
ment decision of an investor who must withdraw funds (e.g., to pay for some liability 
or to consume) continuously at a given rate per unit time. Income can be obtained 
only from investment in any of n + 1 assets: n risky stocks, and a bond with a 
deterministic constant return. The objectives considered here relate solely to what 
can be termed “goal problems," in that we assume the investor is interested in 
reaching some given values of wealth (called goals) with as high a probability as 
possible and/or as quickly as possible. 

The fact that the investor must continuously withdraw funds at a fixed rate 
introduces a new difficulty that was not present in the previous studies of objectives 
related to reaching goals quickly (cf. Heath et al. 1987). Specifically the forced 
withdrawals ensure that at certain levels of wealth, there is a positive probability of 
going bankrupt, and thus the investor is forced to invest in the risky stocks to avoid 
ruin. In this paper we address the two basic problems faced by such an investor: how 
to survive, and how to grow. The survival problem turns out to be somewhat tricky, in 
that we prove that no fully optimal policy exists. Nevertheless, we are able to 
construct e-optimal policies, for any e > 0. The growth problem is answered com- 
pletely, once the survival aspect is clarified. 

While this model is directly applicable to the workings of certain economic 
enterprises, such as a pension fund manager with fixed expenses that must be paid 
continuously (regardless of the level of wealth in the fund), our results are also 
related to investment strategies that are referred to as Constant Proportion Portfolio 
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Insurance (CPPI). In fact a related model was used as the economic justification of 
CPPI in Black and Perold (1992), where both the theory and application of such 
strategies is described. In Black and Perold (1992), optimal strategies were obtained 
for the objective of maximizing utility of consumption, for a very specific utility 
function, subject to a minimum consumption constraint. However, the analysis and 
policies of Black and Perold (1992) are relevant only when initial wealth is in a 
particular region (specifically, when initial wealth is above the "floor"), wherein for 
that policy, there is no possibility of bankruptcy. Black and Perold (1992) did not 
address the fact that for the model described there, ruin, or bankruptcy, is a very real 
possibility when initial wealth is in the complementary region (below the “floor’). 

Here we focus on the objectives of survival and growth, which are intrinsic 
objective criteria that are independent of any specific individual utility function. As 
such, our results for both aspects of the problem will therefore complement the 
results of Black and Perold (1992) (as well as the more recent related work of Dybvig 
1995). Firstly, the survival problem has not been addressed before for this model 
(although see Majumdar and Radner 1991 and Roy 1995), and secondly, the optimal 
growth policies we obtain provides another objective justification for the use of the 
CPPI policies prescribed in Black and Perold (1992), since for this problem we get 
similar policies as those obtained there. 

The remainder of the paper is organized as follows: In the next section, we will 
describe the model in greater detail, and prove a general theorem in stochastic 
control from which all our subsequent results will follow. To facilitate the exposition, 
we will at first consider the case where there is only one risky stock and where the 
withdrawal rate is constant per unit time. It turns out that the state space (for wealth) 
can be divided into two regions, which we will call the “danger-zone” and the 
“safe-region.” In the latter region, the investor need never face the possibility of ruin, 
and so we can concentrate purely on the growth aspects of the investor. (The 
aforementioned studies of Black and Perold 1992 and Dybvig 1995 considered only 
this region in their analyses of the maximization of utility from consumption problem.) 
In the former region, ruin, or bankruptcy, is a possibility (hence the term “danger- 
zone") and therefore we first concentrate on passing from the danger-zone into the 
safe-region. This is the survival problem and it is completely analyzed in $3. In 
particular, two problems are considered, maximizing the probability of reaching the 
* safe-region" before going bankrupt, and minimizing the discounted penalty that must be 
paid upon reaching bankruptcy. It is the former problem that does not admit an 
optimal policy, although we are able to explicitly construct an e-optimal policy. The 
latter problem does admit an optimal policy, which we find explicitly. The structure of 
both policies are quite similar, in that they both essentially invest a (different) 
proportional amount of the distance to the safe-region. In $4 we consider the growth 
problem in the safe-region. We define growth as reaching a given (high) level of 
wealth as quickly as possible. Two related problems are then solved completely. First, 
we find the policy that minimizes the expected time to the (good) goal, and then we find 
the policy that maximizes the expected discounted reward of getting to the goal. Our 
resulting optimal growth policies turn out to be quite similar to the CPPI policies 
obtained for a different problem by Black and Perold (1992), in that they invest a 
(different) proportional amount of the distance from the danger-zone. Extensions to 
the multiple asset case as well as the case of a wealth-dependent withdrawal rate are 
discussed in $$5 and 6. 

All the control problems considered in this paper are special cases of a particular 
general control problem that is solved in Theorem 2.1 in $2 below. For this problem 
we use the Hamilton-Jacobi-Bellman (HJB) equations of stochastic control (see, e.g., 
Fleming and Rishel 1975, or Krylov 1980) to obtain a candidate optimal policy in 
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terms of a candidate value function and this value function is then in turn given as the 
solution to a particular nonlinear Dirichlet problem. These candidate values are then 
verified and rigorously proved to be optimal by the martingale optimality principle (see 
$V.15 in Rogers and Williams 1987, or $2 in Davis and Norman 1990). The resulting 
nonlinear differential equations are then solved in turn for each of the problems 
considered below, yielding the optimal solutions in explicit form. 


2. The model and continuous-time stochastic control. Without loss of generality, 
we assume that there is only one risky stock available for investment (e.g., a mutual 
fund), whose price at time ¢ will be denoted by P,. (Extension to the multidimen- 
sional case (for a complete market) is quite straightforward, and since the excess 
notation required adds little to the understanding, we will simply outline how to 
obtain the results for the multidimensional case in a later section.) As is quite 
standard (see, e.g., Merton 1971, 1990, Davis and Norman 1990, Black and Perold 
1992, Grossman and Zhou 1993, Pliska 1986), we will assume that the price process of 
the risky stock follows a geometric Brownian motion, i.e., P, satisfies the stochastic 
differential equation 


(1) dP, = uP, dt + oP, AW, 


where u and a are positive constants and {W,: t > 0) is a standard Brownian motion 
defined on the complete probability space (Q, Z, P), where (5; is the P-augmenta- 
tion of the natural filtration Z” = of{W,; 0 < s < t). (Thus the instantaneous return 
on the risky stock, dP,/P,, is a linear Brownian motion.) 

The other investment opportunity is a bond, whose price at time t is denoted by B,. 
We will assume that 


(2) dB, — rB, dt 


where r > 0. To avoid triviality, we assume p >r. 

We assume, for now, that the investor must withdraw funds continuously at a 
constant rate, say c > 0 per unit time, regardless of the level of wealth. (This would 
be applicable for example if the investor faces a constant liability to which $c must be 
paid continuously.) In a later section we generalize this to a case where the 
withdrawals are wealth-dependent. 

Let f, denote the rotal amount of money invested in the risky stock at time t under an 
investment policy f. An investment policy f is admissible if (f,, t 2 0) is a measurable, 
{#}-adapted process for which fdf; dt < œ, as., for every T < œ. Let 9 denote the 
set of admissible policies. 

For each admissible control process f € Y, let (X, f, t > 0) denote the associated 
wealth process, i.e., X/ is the wealth of the investor at time !, if he follows policy f. 
Since any amount not invested in the risky stock is held in the bond, this process then 
evolves as 


dP, j dB, 
(3) aX] = fg t (X) -I)E ~ cdt 
= [rX/ + f(u- r) - c] dt + fio aW, 


upon substituting from (1) and (2). 
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Thus, for Markov control processes f, and functions W(t, x) € €'^, the generator 
of the wealth process is 


(4) A Y(t, x) =F, + [flu r) +r- c] Y, + Shia, 


We will put no constraints on the control f, (other than admissibility). In particu- 
lar, we will allow f, < 0, as well as f, > Xf. In the first instance, the company is 
selling the stock short, while in the second instance it is borrowing money to invest 
long in the stock. (While we do allow shortselling, it turns out that none of our 
optima! policies will ever in fact do this.) 

What will differentiate our model and results from previous work are the objectives 
considered and the fact that here the withdrawal rate c is constant, and not a 
decision variable. 

The usual portfolio and asset allocation problems considered in the financial 
economics literature deal with an investor whose wealth also evolves according to a 
stochastic differential equation as in (3), where instead of being constant, c is now a 
control variable as well, i.e., the consumption function c, = c( Xf). For a specific 
utility function u(-), the investor’s objective is then to maximize the expected utility of 
consumption and terminal wealth over some finite horizon, i.e., for T > 0, and some 
“bequest function” V(-), the investor wishes to solve 


(5) sup E [ f"e-*u(e, dt c e" v(Xf)|, 
fre 0 


for some discount factor A > 0. Alternatively, the investor may wish to solve the 
discounted infinite horizon problem 


(6) sup E, f e" u(c,) dt. 
fe 8 


In both of these cases, since the process {c,} is usually assumed to be completely 
controllable, it is clear that for certain utility functions at least, ruin need never occur, 
since we may simply stop consuming at some level. Alternatively, as is the case when 
the utility function is of the form u(c) = c'~®/( — R) for some R < 1, or u(c) = 
In(c), the resulting optimal policy takes both investment f, and consumption c, to be 
proportional to wealth, i.e., f, = m(2)X,, c, = 7{t)X,, which in turns makes the 
optimal wealth process into a geometric Brownian motion, and thus the origin 
becomes an inaccessible barrier, Classical accounts of such (and more sophisticated) 
problems are discussed in Merton (1971, 1990) and Davis and Norman (1990) among 
others. 

Optimal investment decisions with constraints on consumption have also been 
considered in the literature previously. Most relevant to our model is the literature on 
constant proportion portfolio insurance (CPPI), as introduced in Black and Perold 
(1992), where the resulting policy is to invest a constant proportion of the excess of 
wealth over a given constant floor. (As its name suggests, portfolio insurance can be 
loosely considered any trading and investment strategy that ensures that the value of 
a portfolio never decrease below some limit. Alternative approaches to portfolio 
insurance using options and other techniques are described in e.g., Luskin 1988.) 
Black and Perold (1992) introduced this policy as the solution to the discounted 
infinite horizon problem of (6) subject to the constraint that c, > c,,,, where Cum is 
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some given constant. The specific utility function considered there was 


ute) = cl'^/(1- R) force >c*, 
K, — Kye forc x c*, 


where c* is a given constant, R < 1 and K,, K, are constants chosen to ensure u(-) 
continuous throughout. While others (e.g., Dybvig 1995) have raised some technical 
questions about the analysis in Black and Perold (1992), more relevant to our point of 
view is the fact that this model (and the resulting optimal policy) allows for the 
possibility of ruin, or bankruptcy, if wealth is initially below the given floor. This 
possibility was never addressed in Black and Perold (1992). 

In this paper we do not concentrate on the usual utility maximization problems of 
(5) and (6). Rather, here we are concerned with the objective problems of survival and 
growth. In particular, we first study the problem of how the investor (whose wealth 
evolves according to (3)) should invest to maximize the probability that the investor 
survives forever (which turns out to be related to maximizing the probability of 
achieving a given fixed fortune before going bankrupt), as well as the problem of how the 
investor should invest so as to minimize the time until a given level of wealth has been 
achieved. The former problem is called the survival problem, and is discussed in $3. 
The latter is called the growth problem and is the content of $4. Related problems 
have been studied in general under the label of “goal problems" in the works of 
Pestien and Sudderth (1985, 1988), Heath et al. (1987) and Orey et al. (1987). The 
survival problem for some specific related models were studied in Browne (1995) and 
Majumdar and Radner (1991). The former treated an “incomplete market" model, 
where the withdrawals are not fixed but rather follow a stochastic process, and the 
latter treated a model with forced constant consumption but without the possibility of 
investing in a risk free asset. 

Recently, in order to provide a consumption based economic justification for the 
interesting portfolio strategies introduced in Grossman and Zhou (1993) (where the 
optimal policy invests a constant proportion of wealth over a stochastic floor), Dybvig 
(1995) considered the consumption-investment problem of (6) with the constraint that 
consumption never decrease, i.e., that c, > c,, for all t > s, with cg > 0. Thus consump- 
tion is forced in his model as well. He considered utility functions of the form 
u(c) = c! 5/0 — R) and u(c) = In(c). However he only considered the problem in 
the feasible region, where initial wealth X,, satisfies X > cọ/r, and so for which ruin 
need not occur. Dybvig (1995) did not consider the case when X, > cy/r, and hence 
where ruin is possibility, and so our results on this problem in 83 will complement his 
analysis as well. Since in this paper our obiectives deals solely with the achievement 
of particular goals associated with wealth, it is clear that if there is a constraint on 
consumption as in the models of Black and Perold (1992) and Dybvig (1995), we 
should always set consumption at the minimum level, which in both cases is a 
constant (Cmn in Black and Perold 1992 and c, in Dybvig 1995). This is consistent 
with the model we analyze here, where we will (at least at first) take consumption as a 
fixed constant c per unit time. This implies that at least for some values of wealth, 
the origin is accessible, and thus ruin is in fact a possibility. 

In the next section we consider the problem of how to invest in order to survive. 
However, before we study that problem, we need a preliminary result from control 
theory that will provide the basis of all our future results. 


2.4. Optimal control. The problems of survival and growth considered in this 
paper are all special cases of (Dirichlet-type) optimal control problems of the 
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following form: For each admissible control process (f,, £ = 0}, let 
f= inf {1 >0: Xf= z} 


denote the first hitting time to the point z of the associated wealth process (X/) of 
(3), under policy f. For given numbers (l, u) with | < X, < u, let 7^ = min(r/, 7/) 
denote the first escape time from the interval (/, u). 

For a given nonnegative continuous function A(x) > 0, a given real bounded 
continuous function g(x), and a function A(x) given for x = l, x =u, let v/(x) be 
defined by 


af 


f(x) = f — F'XI f Fu xi 
(7) v(x) E(f eX; jew] [nonas dt + a X/)exp| J AXN) as}| 
with 
v(x) = supv/(x) and f*(x) = argsup v/(x). 
JEF SJES 
We note at the outset that we are only interested in controls (and initial values x) for 
which »/(x) < c. 


As a matter of notation, we note first that here, and throughout the remainder of 
the paper, the parameter y will be defined by 


(8) v= (4s). 


THEOREM 2.1. Suppose that w(x): (l,u) > (—©, ©) is a @? function that is the 
concave increasing (i.e., w, > 0 and w,, < 0) solution to the nonlinear Dirichlet problem 


w(x) 
(9) (mx —-c)w(x)- um (x) + g(x) ~ A(x)w(x) = 90, forl<x<u, 
with 
(10) w(I) =A(1) and w(u) — h(u), 


and satisfies the conditions: 
(i) w2(x)/w, (x) is bounded for all x in (l, u); 
(ii) there exists an integrable random variable Y such that for all t = 0,w(X/) = Y; 
(iii) w,GoO/ ww, is locally Lipschitz continuous. 
Then w(x) is the optimal value function, i.e., w(x) = v(x), and moreover the optimal 
control, f*, can then be written as 


(11) f(x) = -(4 — r) w(x) for! € x € u. 


T? IwaQ(x)' 


Proor. The appropriate HJB optimality equation of dynamic programming for 
maximizing »/(x) of (7) over control policies f,, to be solved for a function v is 
sup; c UY Iv + g — Av} = 0, subject to the Dirichlet boundary conditions v(1) = hl) 
and v(u) = hlu) (cf. Theorem 1.4.5 of Krylov 1980). Since v(x) is independent of 
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time, the generator of (4) shows that this is equivalent to 
(12) sup (f(u =r) +m = e)» + Ffon rav) =O. 
fes 


Assuming now that (12) admits a classical solution with v, > 0 and », < 0 (see, e.g., 
Fleming and Soner 1993), we may then use standard calculus to optimize with respect 
to f in (12) to obtain the maximizer f* = —((u — r)/oa?)v/v,, (compare with 
(11). When this f*(x) is then substituted back into (12) and the resulting equation is 
simplified, we obtain the nonlinear Dirichlet problem of (9) (with v = w). 

It remains only to verify that the policy f* is indeed optimal. The aforementioned 
theorem in Krylov (1980) does not apply here, since in particular the degeneracy 
condition (Krylov 1980, page 23) is not met. We will use instead the martingale 
optimality principle, which entails finding an appropriate functional which is a 
uniformly integrable martingale under the (candidate) optimal policy, but a super- 
martingale under any other admissible policy, with respect to the filtration Z (see 
Rogers and Williams 1987, Davis and Norman 1990). 

To that end, let A/(s, £) := f(/ACX/) dv, and define the process 


(13) M(t, XI) = e^ 9 w(X/) + fe “MO.9e(X!) ds, forü «ts v, 


where w is the concave increasing solution to (9). 
Optimality of f* of (11) is then a direct consequence of the following lemma. 


LEMMA 2.2. For any admissible policy f, and M(t, - ) as defined in (13), we have 
(14) E(M(t ^ 77, XI, ,1)) < M(0, Xy) = w(x); 


with equality holding if and only if f =f, where ff is the policy given in (11). Moreover, 
under policy f*, the process {M(t ^ Tf, X*,,.)} is a uniformly integrable martingale. 


PRoor. Applying Ito’s formula to M(t, X/) of (13) using (3) shows that for 
O<s<t<rl 


5) M(t, X/) =M(s, XI) + fe" OCF; Xf) du + f € of m (X1) aW, 
x 5 
where Q(z; y) denotes the quadratic (in z) defined by 
Olzy) mz Fo?me(y)] + zlG 7 000] 


(y — c)w(y) * (y) - ACY) wy). 


Recognize now that since Q,,(z; y) = ow, (y) < 0, we always have Q(z; y) x 0, 
and the maximum is achieved at the value 


(457) ee 


zy) n o? wey) 


with corresponding maximal value 


Q(z*iy) = (y - e)u(y) - yey *g(y) — A(y)w(y) 20 
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where the final equality follows from (9). Therefore the second term in the r.h.s. of 
(15) is always less than or equal to 0. Moreover (15) shows that we have 


[^re vm Yo fw. (X1) dW, 
0 


=M(t A Tf, Xf, 1) ~ w(x) - [^ e tog; X1) dv 
0 


2 M(t ^c, XI, ) — w(x). 


Thus, by Gi) we see that the stochastic integral term in (15) is a local martingale that 
is in fact a supermartingale. Hence, taking expectations on (15), with s = 0, therefore 
shows that 


(16) E(M(t^ Tf, X/,,1)) S w(x) «s Me OOK f; Xf ja] 
0 


< w(x) + z| ane Ven sup Ol hi xD a) 


0 fe 
= w(x) 


with the equality in (16) being achieved at the policy f. 

Thus we have established (14). 

Note that under the policy f? of (11), the wealth process X* satisfies the 
stochastic differential equation 


e lla seo ay lO) | CX). 
OD dk (x CUm y] E T V vr OJ ens 


where 7* = 7%. By Gii) this equation admits a unique strong solution (Karatzas and 
Shreve 1988, Theorem 5.2.5). 
Furthermore note that under the (optimal) policy, f£*, we have, for allO<s<1r< 
T*, 
2 * 
* = xy _ 1 _ d * Wy (xX ) 
(18) M(t, X*) = M(s, X7) 23 fex f A(X? do etx; aW, 


which by (i) above is seen to be a uniformly integrable martingale. This completes the 
proof of the theorem. o 
We now return to the survival problem. 


3. Maximizing survival. We consider in this section two objectives related to 
maximizing the survival of the investor. First we consider the problem of minimizing 
the probability of ruin which is related to the problem of maximizing the probability of 
reaching a particular given upper level of wealth before a given lower level. We will show 
that an optimal strategy for this latter problem does not exist, although exploiting the 
solution to a related solvable problem will allow us to explicitly construct e-optimal 
ones. Next we consider the related objective of minimizing the expected discounted 
penalty of ruin, which is equivalent to minimizing the expected discounted time to 
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bankruptcy. This problem does admit an optimal solution and we find it explicitly. The 
structure of the (optimal) survival policies obtained in this section are similar in that 
they all invest a fixed fraction of the positive distance of wealth to a particular goal. 


3.1. Minimizing the probability of ruin. The evolutionary equation (3) exhibits 
clearly that under policy f, the wealth process is a diffusion with drift function m and 
diffusion coefficient function v given respectively by 


(19) m(f,x,t) - (i —r) * c, v( f, x, t) = fio. 


Thus for any admissible control f < © there is a region (in X space) where there is a 
positive probability of bankruptcy. 'This is due to the fact that while the variance of 
the wealth process is completely controllable, as is apparent from (19), the drift is not 
completely controllable due to the fact that c > 0, and hence the drift can be 
negative at certain wealth levels. This feature differentiates this model from those 
usually studied in the investment literature (e.g., Merton 1971, 1990, Pliska 1986, 
Davis and Norman 1990) with Majumdar and Radner (1991) being a notable 
exception. (For results on an “incomplete market" model where the variance, as well 
as the drift, is also not completely controllable, see Browne 1995.) Specifically, let a 
denote the bankruptcy level or point, with corresponding "bankruptcy time" (or ruin 
time) 7/, where 0 < a < X,. One survival objective is then to choose an investment 
policy which minimizes the probability of ruin, i.e., one which minimizes P(z/ < =), 
or equivalently, maximizes P(z/ = ~) (see, e.g., Majumdar and Radner 1991, Browne 
1995, Roy 1995). 

Clearly this objective is meaningless for X/ 7 c/r. To see this directly, consider the 
case where the wealth level is x > c/r. We may then choose a policy which puts all 
wealth into the bond, and then under this policy the probability of bankruptcy is 0. 
Specifically, if we take f = 0 for x > c/r, (3) shows that the wealth will then follow 
the deterministic differential equation dX, = (rX, — c) dt, Xy =x > c/r, which ex- 
hibits exponential growth and for which P(z, , = ©) = 1, for all e > 0. Thus the 
survival problem is interesting and relevant only in the region a < x < c/r, which we will 
call the “danger-zone.” This is of course due to the fact that c/r = cfy e "' dt is the 
amount that is needed to be invested in the perpetual bond to pay off the forced 
withdrawals forever. Since the investor need never face the possibility of ruin for 
x > c/r, we will call the region (c/r, ©) the “safe-region.” 

Our objective in this section therefore is to determine a strategy that maximizes the 
probability of hitting the safe-region or “safe point," c /r, prior to the “bankruptcy point,” 
a, when initial wealth is in the danger-zone, i.e., a < x < c/r. As noted above, we will 
show that an optimal policy for this problem does not exist, necessitating the 
construction of an e-optimal strategy. 

A somewhat related survival problem with constant withdrawals was studied in 
Majumdar and Radner (1991) in a different setting, although without a risk-free 
investment, and hence without a safe-region. Moreover, their results are not applica- 
ble to our case since here inf, vf, x,t) = 0, which violates the conditions of the 
model in Majumdar and Radner (1991). As we shall see, it is in fact precisely this fact 
that negates the existence of an optimal policy for our problem. A related survival 
model which allows for investment in a risk-free asset, but where the “withdrawals” 
are assumed to follow another (possibly dependent). Brownian motion with drift, was 
treated in Browne (1995). Since the Brownian motion is unbounded, there was no 
safe-region in Browne (1995) either. A discrete-time model with constant withdrawals 
that does allow for a risk-free investment was treated in Roy (1995), but with no 
borrowing allowed and bounded support for the return on the risky asset. 
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To show explicitly why no policy obtains optimality for the model treated here, and 
how we may construct e-optimal strategies, we will first consider the following 
problem: for any point b in the danger-zone, i.e., with a <x < b < c/r, we will find 
the optimal policy to maximize the probability of hitting b before a. For b strictly less 
than c/r, an optimal policy does exist for this problem, and we will identify it in the 
following theorem. To that end let 


V(x:a,b) = supP.(z/ > sf), andlet ff = argsup P,(7/ > z[). 
fes 


JES 


THEOREM 3.1. The optimal policy is to invest, at each wealth level a <x < b, the 
state dependent amount 


(20) fie) = HL ($ -x). 


Bar 
The optimal value function is 


yr y/r+i 


(c — ra ~ (c - rx) 
(c - ra)" -(c- rb)" *! , 


(21) V(x:a,b) = forasx<b, 


where y is defined by (8). 


REMARK 3.1. Note that the policy of (20) invests less as the wealth gets closer to 
the goal b. In fact, it invests a constant proportion of the distance to the “safe point” 
c/r, regardless of the value of the goal b, and the bankruptcy point a. It is interesting to 
observe that while here this constant proportion is independent of the underlying 
diffusion parameter q’, this does not hold when there are multiple risky stocks in 
which to invest in (see §5 below). The constant proportion is greater (less) than 1 as 
p/r < (>) 3. Thus it is interesting to observe that as the wealth gets closer to the 
bankruptcy point, a, the optimal policy does not "panic" and start investing an 
enormous amount, rather the optimal policy stays calm and invests at most f7(a) = 
2(c — ra)/( u — r). The investor does get increasingly more cautious as his wealth 
gets closer to the goal b. (This behavior should be compared with the “timid” vs. 
“bold” play in the discrete-time problems considered in the classic book of Dubins 
and Savage 1965. See also Majumdar and Radner 1991 and Roy 1995.) 

Observe further that the investor is borrowing money to invest in the stock only 
when x « 2c/( g +r) but not when 2c/(u +r) « x € c/r. This can be seen by 
observing directly that in the former case f(x) > x and in the latter case f(x) <x. 
(The fact that 2c/( + r) < c/r follows from the assumption that u > r.) 


PROOF. While we could prove Theorem 3.1 from a more general theorem in 
Pestien and Sudderth (1985) (see also Pestien and Sudderth 1988) which we will 
discuss later (see Remark 3.4 below), recognize that this is simply a special case of the 
control problem solved in Theorem 2.1 for /=a,u =b with A — 0, g — 0 and 
h(b) = 1, h(a) = 0. As such the nonlinear Dirichlet problem of (9) for the optimal 
value function V becomes in this case 


2 
È 


Xx 


(22) Qx -c)V, — yy 7-0, fora<x<b 


subject to the (Dirichlet) boundary conditions V(a) = 0, V(b) = 1. 
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The general solution to the second-order nonlinear ordinary differential equation 
of Q2) is K, — K,(c ~ rx)” +! , where K,, K, are arbitrary constants which will be 
determined from the boundary conditions. The boundary condition V(a) = 0 deter- 
mines that K, = K;(c — ra)*/'*!, and the boundary condition V(b) = 1 then deter- 
mines K,, which then leads directly to the function V(x) given in (21). It is clear that 
this function V is in @? and does in fact satisfy V, > 0 and V,, < 0, and moreover 
satisfies conditions (i), (ii) and (iii) of Theorem 2.1 on the interval (a, b). (Condition 
(ii) is trivially met since V is bounded on (a, b).) As such V is indeed the optimal 
value function and the associated optimal control function fë of Q0) is then obtained 
by substituting the function V of (21) for w in (11). o 

Note that under policy f7, the wealth process, say X,*, satisfies the stochastic 
differential equation 


(23) AX = (c = XX) dt + Ee = XH) dW, fort < T*, 

where T* = min(z/, 77), and z* = inf{t > 0: X* = z}. This is obtained by placing 
the control (20) into the evolutionary equation (3). Equation (23) defines a linear 
stochastic differential equation, ie. X" is a time-homogeneous diffusion on (a, b) 
with drift function 1, (x) — c — rx, and diffusion coefficient function o ?, (x) = 
(Qo/(p. — rYXc — ny = Q/yXoc — rx). As such its scale function is defined by 


(24) 


S*(x) = fexp{ - frau} dy=—-(yt+r)'(e-m)”"', forasx<b 


e&(u) 
where y = 3(( — /o.. For this process therefore, 


S*(x)-S*(a) | (c— ray?! - (c- nx) 
S*(b) — S*(a) B (c _ ray") B (c _ rb) vet} ? 


P(e > T$) = 


which of course agrees with (21). Thus the process (S*CX*)) is a diffusion in natural 
scale, and is therefore a (uniformly integrable) martingale with respect to the 
filtration Z (as is the optimal value function), ie, E(S*CX*)I%) = S*(X7) for 
Oxsztz c, where r* :— miní(z?, r7). Note further that the scale function S*(x) 
of (24), is increasing in x (although negative) for 0 € x « c/r. 


3.1.1. Inaccessibility of the safe-region under f¥ and e-optimal strategies. While 
we have found a policy that maximizes the probability of reaching any b « c/r before 
any a « b, it is important to realize that if we extend b to c/r, then this policy will 
never achieve the safe point c /r with positive probability in finite time. We can of course 
extend the function displayed in (21) to the point c/r to get 


(25) V(x:a,c/r) = P(t > ty) 


* — — /r1 

_ S*(x) - S*(4) . -(£ aye « c, fora<x<c/r 
S*(c/r) — S*(a) c — ra 

which shows than in fact c/r is an attracting barrier for the process X *. However, it is 

an unattainable barrier. (See 815.6 in Karlin and Taylor (1981) for a discussion of the 
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boundary classification terminology used here.) To verify this, first recall that if we let 
s*(x) = dS*(x)/dx denote the scale density of the diffusion X *, then its speed density 
is given by 


(26) m*(x) (o2(x)s*(x)) ' = [5t nce e) 


(c E m) ~(yf/rt2) 
and it is well known then that 
E,(min(z*,7*,,]) <% if and only if f S*es) — S*(y)]m*(y) dy < œ. 


However, it can be seen from (24) and (26) that the latter quantity is 


SEIS -SOO à aS dr = 


and thus we see that while f? minimizes the probability of hitting the ruin point a, 
and so is in fact optimal for the problem of min, P (Tf = x), it does so in a way which 
makes the upper goal, c/r, unattainable in finite expected time. In fact, under f? we 
have 7, = «c a.s., and thus no optimal policy exists for the problem of maximizing the 
probability of reaching the safe-region prior to bankruptcy! 

Intuitively, what's going on is that as the wealth gets closer to the boundary of the 
safe-region, c/r, the investor gets increasingly more cautious so as not to forfeit his 
chances of getting there. This of course entails investing less and less, but in 
continuous-time, where the wealth is infinitely divisible, this just means eventually 
investing (close to) nothing. However while this in turn does in effect shut off the drift 
of the resulting wealth process (see (23)), it also shuts off the variance, and some 
positive variance is needed to cross over the c/r-barrier from the danger-zone into 
the safe-region. This is not supplied by the policy described above, which essentially 
tells the investor that the best he can hope to do (i.e., with maximal probability) is to 
try to get pulled into an asymptote that is drifting toward c/r. 

In terms of our Theorem 2.1, it is clear that V is no longer concave increasing for 
x > c/r (ie., for x > c/r, we have V(x) > 0 and V,,(x) < 0), and thus Theorem 2.1 
is not valid for any u > c/r. 


Remark 3.3. This difficulty disappears if r = 0, since if there is no risk-free 
investment, the investor always faces a positive probability of ruin and the only way to 
survive is to always invest in the risky stock. To see this, note that letting a = 0 and 
taking limits as r | 0 (so that the “safe point" goes to infinity, i.e., when r = 0, there is 
always a positive probability of bankruptcy) shows that the value function, V(x: 
0, c /r), then goes to an exponential, i.e., as r 10, 


(27) V(x:0,c/r) > 1- exp{ - es 


and for this case the (unconstrained) optimal contro] to minimize the probability of 
ruin is to always invest the fixed constant 2c/p. (This model then becomes a 
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degenerate special case of Browne 1995.) In this case the optimal wealth process 
follows a linear Brownian motion with drift c and diffusion coefficient 2ca/p, for 
which the probability of ruin is the exponential (27). Ferguson (1965) conjectured that 
an ordinary investor (in discrete-time and space) can asymptotically minimize the 
probability of ruin by maximizing the exponential utility of terminal wealth, for some risk 
aversion parameter. It is interesting to observe that for this model the conjecture turns 
out to be true. To verify this, one would have to solve the finite-horizon utility 
maximization problem for the utility function u(x) = 8 — ņ exp{—2cx/y}, with arbi- 
trary y > 0 and ô. Since this problem is then essentially a special case of the 
(Cauchy) problem considered in §3 of Browne (1995), we refer the reader there for 
further details. If we impose the constraint that the investor is not allowed to borrow, 
then it can be shown (Browne 1995, Theorem 3) that the optimal control in this case 
is f* = max{x,2c/u}, whereby the investor must invest all his wealth in the risky 
stock when wealth is below the critical level 2c/,. In this case the value function is 
no longer concave below 2c/. Such extremal behavior (or “bold” play, ala Dubins 
and Savage 1965) and nonconcavity of the value function below a threshold is also a 
feature of the optimal policies in the related survival models studied in Majumdar 
and Radner (1991) and Roy (1995), where borrowing is not allowed. 


REMARK 3.4. This inaccessibility and the resulting nonexistence of an optimal 
policy can be best understood in the context of the more general “goal” problem: 
Consider a controlled diffusion (Y/) on the interval (a, b) satisfying 


dY/ = m(f, y) dt + v(f, y) dW, 


with the objective of determining a control to maximize the probability of hitting b 
before a. Let V(x) denote the optimal value function for this problem, i.e., V(x) = 
sup; ey PAt{ > rf) with optimal control (x) = argsup,c g P (7f > rf). This prob- 
lem was first studied by Pestien and Sudderth (1985, 1988), who showed—using a 
different formulation—that 


= m(f, x) 
(28) W(x) = argsup | 2 \, 


and indeed our f*(x) of (20) can be obtained from maximizing m/v’ for m and v? in 
(19). However as noted in Pestien and Sudderth (1985, 1988), this is the case only 
when inf, v2 (f, x) > 0, where p= m, /v^,. 

These results can be obtained from somewhat simpler methods (albeit with some 
lesser generality) then those used in Pestien and Sudderth (1985, 1988) as follows: 
V(x) must satisfy the HJB equation 


(29) sup [mCf, x) V, get x). 
f 


irl geste eee) = 


subject to the Dirichlet boundary conditions ¥(a) = 0, ¥(b) = 1. 

If VW is a classical solution to the HJB equation (29), then we must have W, > 0 and 
W,, < 0. Therefore as long as v?(f, x) > 0, it is clear from (29) that the maximum of 
(29) occurs at the maximum of m/v’, which by (28) is denoted by y. If we now let 
p(x) = sup{m(f, x)/u-(f, x), ie, p(x) = myx), x)/v?( p(x), x), then the solu- 
tion to (29) subject to the Dirichlet conditions is simply W(x) = fžs(z) dz/ f/s(z) dz, 
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where s(z) = exp( — 2/ py) dy), with which our value function (21) of course agrees, 
for b < cyr. 

However, it is also clear from (29) that for v?^( f, x) = 0, the HJB equation need not 
hold, and therefore, no policy is in general optimal when this is the case, which is 
precisely what is happening here for b — c/r (see also Example 4.1 in Pestien and 
Sudderth 1988). 

For more details on the general problem from a different perspective, we refer the 
reader to the fundamental papers of Pestien and Sudderth (1985, 1988). We now 
return to the problem of determining a ‘good’ strategy for crossing the c/r barrier. 


An e-optimal strategy. As we have just seen, the inaccessibility of c/r is due to 
the fact that f dictates an investment policy that causes the drift and variance of the 
resulting wealth process to go to zero as the c/r barrier is approached from below. A 
practical way around this difficulty is to modify f? as follows: 

Let ff denote the (suboptimal) policy which agrees with fý below the point 
c/r — 6, and then above it invests « in the risky stock until the c/r barrier is 
crossed, i.e., 


e(x) forx <c/r- 6, 
me [ ) esee 
K or x » c/r — ô. 
Now (xq, a, c/r) as given in (25) is an upper bound on the probability of escaping 
the interval (a, c/r) into the safe-region starting from an initial wealth level xy < c/r 
(see Krylov 1980, page 5). Without loss of generality, we may take a = 0 here. Thus 
for any e > 0, and initial wealth xy < c/r, the best we can do is find a policy which 
gives 


(30) V(xg:0,c/r) -e-1— (3 B mr 


— € 


as its value. Therefore for any given e, and initial wealth xy < c/r, we need to find 
8 = (xp €) and x = k(x,, €) which will achieve the value (30). To keep the drift 
and diffusion parameters continuous, we must take x = (2r/( u — r))8(x,, €), and so 
f GO = Qr/Ca — r)maxí(c/r — x, 8), which then gives a corresponding wealth pro- 
cess X? which has the (continuous) drift function w(x) and diffusion function o° (x) 
given by m(x) = max(c — m, 2ró + rx — c), and oej(x) = max{2(c — 
m / y, 2r/8? / y). 
The scale density for this new process, defined by 


»2us(z) 
Ss(¥) = apl- f IE a 
can then be written as 
(c — y)" for y x c/r — 6, 
sy) = (r8) eX OWT e y+2ô-c/r 


| for y 2 c/r — 6, 


where $ denotes the standard normal p.d.f. 
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The probability of reaching the safe-region from initial wealth xy < c/r under this 
policy is therefore 


fo*ss( y) dy 


2 = V(xy:0,c/r)(1 + 8?" H(y,r,c)) ! 
fis) dp C ert (Hr) 


Vs(x9:0,c/r) = 


where V is as in (25) and H is given by 


H(y,.r,c) =(1 4 y/r)(r/c)**! e 92r y e(2/v7r ) — o(Vyr )], 


where ® denotes the standard normal c.d.f. Setting V; = V — e, and then solving for 
6 therefore gives 


€ 


r/(ytr) 
(31) 5( Xo, €) = EIOUDILTSDEZEDI 


Therefore, for the particular 5(xy, €) given in (31), the policy f* is within e of 
optimality. Since we chose a — 0 here purely for notational convenience, we summa- 
rize this in the following theorem for the case with an arbitrary bankruptcy point a, 
with 0 <a < xo. 


THEOREM 3.2. The policy f; , given by 


(32) 
f£(x) fora «x € cyr — 6, 
fë (x) = ALAA r oV: a,c/r) -e])]" forxzc/r- 8, 


is an. e-optimal policy for maximizing the probability of crossing the c /r barrier before 
hitting the point a, starting from an initial wealth level xy, where a « xy « c/r, and 
VG: a, c/r) is the function given by (25). 


3.2. Minimizing discounted penalty of bankruptcy. Suppose now that instead of 
minimizing the probability of ruin, we are instead interested in choosing a policy that 
maximizes the time until bankruptcy, in some sense. Obviously, this problem is nontriv- 
ial only in the danger-zone a <x < c/r, which is the case considered here. Maximiz- 
ing the expected time until bankruptcy is a trivial problem, since there are any number 
of policies under which the expected time to bankruptcy is in fact infinite. In 
particular the e-optimal policy described above gives a positive probability of reaching 
the c/r barrier, and since the safe-region (x > c/r) is absorbing, it therefore gives an 
infinite expected time to ruin. Thus we need to look at other criteria. Here we will 
consider the objective of minimizing the expected discounted time until bankruptcy (a 
related problem without forced withdrawals was treated in Dutta 1994 in a different 
framework, and in an incomplete market in Browne 1995). In particular, suppose 
there is a large penalty, say M, that must be paid if and when the ruin point g is hit. 
If there is a (constant) discount rate A > 0, then the amount due upon hitting this 
point is therefore Me™, and we would like to find a policy that minimizes the 
expected value of this penalty. Clearly, this policy is equivalent to the policy that 
minimizes E (e ^"). 

To that end, let F(x) = inf; e y Ele” M and let f? denote the associated 
optimal policy, i.e., f? = arginf, ey E, (e-^). For reasons that will become clear 
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soon, define the constants n* and D by 
(33) yt= g*(A)- sleet y+a) + VD], 


(34) D=D(A) =(y+A-—ry + 4ry, 


where y is defined by (8). The optimal policy and optimal value function for this 
problem is then given in the following theorem. 


THEOREM 3.3, The optimal control is 


-r c 
(35) fe (x) = Sia pF - x}, fora «x « cyr, 
and the optimal value function is 
(36) P(x) = (E), fra sx «cp. 


REMARK 3.5. Note that 4*> 1, and that F(a)- 1, F(c/r) — 0, and F(x) is 
monotonically decreasing on the interval (a, c/7), as is the optimal policy f, which 
once again invests a constant proportion of the distance to the goat. Observe too that we 
are therefore once again faced with the problem that under this policy, the safe-point 
c/r is inaccessible. However, in this case it is indeed the unique optimal policy. The 
condition F(a) = 1 is by construction, but the fact that F(c/r) = 0 is determined by 
the optimality equation itself, ie., optimality determines that the c/r barrier is 
inaccessible. The intuition behind this is that this policy—-although it never allows the 
fortune to cross the c/r-barrier —does indeed minimize the expected discounted time 
until ruin. The best one can do in this case is to get trapped in an asymptote 
approaching c/r, which this policy tries to do. Any additional investment near the 
c/r-barrier (such as in the eoptimal strategy of the previous problem) allows a 
greater possibility of hitting 2, thus increasing the value of E,(e ^"). 

REMARK 3.6. As a consistency check, note too that when we substitute the control 
f£ of (35) back into the evolutionary equation (3), we obtain a wealth process, say 
XÀ, that satisfies the stochastic differential equation 


(37) 


dX? = X?) dW, fort« T^, 


[Gray le mt vae 


where T° = mintz?, TÒ}, where 7? = inf(t > 0: X? =z). 
For this process, it is well known that the Laplace transform of 7,‘ evaluated at the 
point A, say L(x: A) = E,(e~?"*), is the unique solution of the Dirichlet problem 


a 


_2y 0. j iu - JL. -AL- 
fee Dr ihe- alat I giray m AL = 0, 
with Lla: A) = 1 and L(c/r: A) = 0. It can be easily checked that we do in fact have 
F(x) = L(x: A). It should be noted that 7? is a defective random variable, with 
E,(7,4) = », as can be seen from the fact that 


L(x:0) = (€ LE = 1~V(x:4,¢/r), 
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where V(x: -,-) is the function defined by (21). This of course is due to the fact that 
c/r essentially acts as an absorbing barrier, and it can be hit with positive probability 
(albeit in infinite time). Specifically, as A | 0, it is clear that 9*(A) > 9*(0) = y/r +1, 
and thus F(x) converges (uniformly in x) to the probability that the bankruptcy point 
a is hit before the safe point c/r, which implies that the control ff(x) converges 
(uniformly in x) to the control f#(x) of (20), ie. as ALO: 


Z (e-r) = f(x). 


F(x) o1-V(x:a,c/r) and ff(x)o m 


Note also that for A > 0, we have fë > fë, which is of course consistent with the fact 
that a “bolder” strategy maximizes the probability of survival, while a “timid” strategy 
maximizes expected playing time (for subfair games). 


PROOF. Theorem 2.1 is again relevant, however since Theorem 2.1 deals with the 
maximization problem, recognize that F = —sup,(—E,(e^ ^). We can now apply 
Theorem 2.1 to F = —F with A(x) = A, g = 0, h(a) = — 1. Reverting back to F, we 
then see that the nonlinear Dirichlet problem of (9) for F becomes then: 


2 


F, 
(38) Gx-c)F, -YF -AF=0, fora <x<c/r, 


subject to the Dirichlet boundary condition F(a) = 1, where y = (u — r)/oY.. 
Observe of course that we now require F, < 0 and F,, > 0. 

The nonlinear second-order ordinary differential equation in (38) admits the two 
solutions C(e — rx)", and K(c — rx)'^, where C and K are constants to be deter- 
mined from the boundary condition, and where n+, ^ are the roots to the quadratic 
equation Qn) = 0, where 


(39) O(n) 2 mr -nly t Ar) +a. 


To determine which (if any) of these two solutions are appropriate we need to 
examine these roots in greater detail. The discriminant of (39) is the constant D of 
(34) which is clearly positive, and thus the two roots are real and, for A 0, distinct. 
In particular 


(40) n= Æ| +y+a) + vD] and n= =| + y+ a) - vD]. 


Since n* 7 = A/r > 0, both roots are of the same sign, and since 9*7 0, they are 
both positive. The boundary condition F(a) = 1 determines the constants C, K as 
C= (c — ra) " and K = (c — ra)". and so clearly C > 0 and K > 0, and there- 
fore, F, « 0 for both solutions. However it is easy to check the roots in (40) to see 
that n*> 1, while 5^ « 1, and so F,, > 0 only for the root *. Thus we find that the 
(unique) solution to the (38) that satisfies F(a) = 1 and F, < 0 F, > 0 is given by 
the function F(x) defined in (36). Moreover, it is a simple matter to check that 
conditions (i), (i) and (iii) of Theorem 2.1 are indeed met for F (F is bounded on 
(a, b)), and so we may conclude that F is optimal. The associated optimal control 
function, f# of (35), is then obtained by placing F (or F = —-F)into(11). o 


REMARK 3.7. An alternative proof of Theorem 3.4 can be constructed by modify- 
ing the arguments in Orey et al. (1987), who treat the converse problem of maximiz- 
ing discounted time to a goal, to deal with the minimization problem treated here. 
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The evolutionary equation (3) would have to be reparameterized by taking f, = m,- 
(c/r — X/), for admissible control processes 7, and then applying the results of Orey 
et al. (1987) to the further transformed process Y? = In(c — rX7)/(c — rb)]. 

Note that since the wealth process, say X>, under the policy fZ, satisfies the 
stochastic differential equation (37), we can apply Ito’s formula to the function F(-) 
given by (36) ta show, after simplification, that 


(41) 


tye. at + 
ar(xs) = rx) € Hn Ya zy 2 ZI for0 « i « TA, 


The quadratic of (39) then shows that (n*)?r — n*(r + y) = A(y*— 1), and thus 
substituting this into the r.h.s. of (41), and then solving the resulting (linear) 
stochastic differential equation gives 


+ 2 + 
F(X") = Pixel - rf] i - mut) fort < T°, 


which shows that the value function F(-) operating on the process X? is a geometric 
Brownian motion on the interval (0, 1), for a x X? x cr. 

Unfortunately, as noted above, this policy, while optimal for the stated problem, 
will never cross the c /r barrier into the safe-region, and thus the investor should utilize a 
policy similar to the e-optimal policy described earlier to get into the safe-region. 
Since this can be achieved at relatively little cost, we will assume for the sequel that 
the investor does in fact invest in a way that will allow a positive probability of getting 
into the safe-region. When (if) the safe-region is achieved, the investor no longer 
faces the problem of bankruptcy, and should then be concerned with other optimality 
criteria. We consider two such criteria in the next section. 


4. Optimal growth policies, in the safe-region. Suppose now that we have 
survived, i.e., we have achieved a level x > c/r. As noted earlier it is clear that in this 
region there need never be a possibility of ruin, and therefore the investor who has 
achieved this safe-region will be interested in criteria other than survival. In particu- 
lar, we assume here that in this region the investor is interested in growth, by which 
we mean achieving a high level of wealth as quickly as possible. Suppose therefore 
that there is now some target goal, which we will denote again by 5 with b > x, which 
the investor wants to get to (e.g., to pay out dividends) as quickly as possible. In this 
section we consider two related aspects of this problem. First we consider the 
problem of minimizing the expected time to the goal, and then we consider the related 
problem of maximizing the expected discounted reward of achieving the goal. In both 
cases, the optimal strategies are interesting generalizations of the Kelly criterion that 
has been studied in discrete-time in Kelly (1956), Breiman (1961) and Thorp (1969), 
and in continuous-time in Pestien and Sudderth (1985) and Heath et al. (1987). (See 
also Theorem 6.5 in Merton 1990, where it is called the growth-optimum strategy. For 
Bayesian versions of both the discrete and continuous-time Kelly criterion, see 
Browne and Whitt 1996.) Such policies dictate investing a constant multiple of the 
wealth in the risky stock. Here our policies invest a constant multiple of the excess 
wealth over the boundary c/r, in the risky stock. This will make the c/r boundary 
inaccessible from above, ensuring that the investor will stay in the safe-region forever, 
almost surely. 
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4.1. Minimizing the time to a goal. To formalize this, let Xo = x, but now with 
c/r <x < b. For rf = inf(t > 0: X/ = b), let 


G(x) = int E.G). and let fë = arg inf E,(7$). 


THEOREM 4.1. For the problem of minimizing the expected time to the goal b, the 
optimal policy is to invest 


(42) fé(x)- Thx- £). forc/r «x « b. 
g r 
The optimal value function is 


(43) G(x) = 


inf 


r+ yin eI. for c/r <x x b. 


REMARK 4.1. Note that the proportion ( u — r)/o? in (42) is the same proportion 
as in the ordinary continuous-time Kelly criterion (or optimal growth policy) (see 
Heath et al. 1987, Merton 1990, Browne and Whitt 1996). However in our policy fë, 
this proportion operates only on the excess wealth over the boundary c/r. Under this 
policy therefore, the lower boundary, c/r is inaccessible. It is quite interesting to 
observe that this policy is independent of the goal b. This is quite remarkable, since 
while it was to be expected a priori that the optimal policy should look something like 
(42) near the point c/r, which ensures that c/r is inaccessible from above, it is not 
clear why one should expect such behavior to continue throughout even when the 
wealth is far away from c/r. Nevertheless, it appears that the best one can do is to 
simply put c/r into the safe asset, and leave it there forever, continuously compound- 
ing at rate r. This is the endowment which will finance the withdrawal at the constant 
rate c forever. (Recall, c/r = cfg e~” dt.) Once this is done, the optimal policy then 
plays the best ordinary optimal growth game with the remainder of the wealth, 
x — c/r. This policy is quite similar to the policy prescribed in Proposition 11 of Black 
and Perold (1992) as a form of CPPI (see also Dybvig 1995). Thus, we have shown 
that CPPI has another optimality property associated with it, namely that of optimal 
growth. 


PROOF. Since here we are minimizing expected time, we could apply Theorem 2.1 
to G(x) = sup- E, Gr), with g(x) = —1, A = 0, h(b) = 0. Recognizing that G = 
—G, it is then seen that in terms of G, Theorem 2.1 now requires G, < 0 and 
G,, > 0, and that the nonlinear Dirichlet problem of (9) specializes to 


2 


G 
(44) (x = c)G, - Yg +1=0, forc/r<x<b, 


subject to the boundary condition G(5) = 0. It is readily verified that the function G 
of (43) satisfies this, and that moreover for this function we have G, < 0 and G,, > 0 
for all c/r < x < b. The control function f(x) of (42) is obtained by substituting G 
of (43) for v in (11). However, note that while it is easy to see that conditions (i) and 
(iii) of Theorem 2.1 are satisfied by G, it is also clear that G is unbounded on 
(c/r,b), since G(x) > » as x | c/r. Thus it is doubtful that condition (ii) of 
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Theorem 2.1 holds for this case. Nevertheless, we will show that Theorem 4.1 holds 
and fë is indeed the optimal policy, however the final proof of this awaits the 
development in $4.2, and we will complete the proof there after Lemma 4.3. o 

Note that when we substitute the contro! fë of (42) back into the evolutionary 
equation (3), we obtain an (optimal) wealth process, say X^, that satisfies 


(45) dX} = (r + 2y) (x? - E) dt + y2y(x? - E) aW, ford<1< cf, 


where 7; :— inf{t > 0: X? = b), which is again a linear stochastic differential equa- 
tion. (It is clear from this that c/r is in fact an inaccessible lower boundary for X^.) 
The solution to (45) is 


Xx? = (xt - Clexp((r + y) + 2yw] + 5. for0 <t < rf, 


from which it follows that 


v2y 
(46) G(X?) = G(X) =t- ay, for0 <t< Tř, 


ie. under the (optimal) policy fë, the process {G(X?) — G(X,)}, follows a simple 
Brownian motion on (0, ©) with a drift coefficient equal to — 1. (From this it is easy to 
recover the value function (43) from (46) by evaluating the expected value of (46) at 
t= vg using the fact that G(b) = 0, which then gives E,(7#) = G(x).) 


REMARK 4.2. The "minimal time to a goal" problem for the case c = 0 was first 
solved in the fundamental paper of Heath et al. (1987) without direct recourse to 
HJB methods (see also Schál 1993). The result in that case is simply (42) with c = 0 
(see $4 in Heath et al. 1987). Merton (1990, Theorem 6.5), also obtained this policy 
via another, rather complicated, argument. Since the proof given here holds too for 
the case c = 0, our results also provide an alternative and complementary proof for 
that case to the ones in Heath et al. (1987) and Merton (1990). 

In fact, it is possible to apply the results of Heath et al. (1987) to construct a 
different proof of Theorem 4.1. First one would need to reparameterize the wealth 
equation (3) by taking f, = v, : (Xf — c/r), and then applying results of Heath et al. 
(1987) to the further transformed process Y? = In[(rX7 — c)/(rb — c). However the 
results in Heath et al. (1987) are specific to the case where the controls must lie on a 
given constant set that is independent of the current wealth, while the approach here, 
based on the HJB methods of Theorem 2.1 could be modified to allow for a state 
dependent opportunity set. 


42. Maximizing expected discounted reward of achieving the goal. Suppose now 
that instead of minimizing the expected time to the goal 5, we are instead interested 
in maximizing E,(e ^"^), for c/r <x x b. To that end let 


U(x) = sup E, (e^ ^"*), andlet f(x) = arg sup E,(e~*7), 
fee pum 


As we show in the following theorem, the optimal policy for this problem also 
invests a (different) constant proportion of the excess wealth above the c/r barrier, 
and is hence another version of the CPPI strategy as in Black and Perold (1992). 
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THEOREM 42. The optimal control is 


(47) fü(x)- Sioa (Fo 3! for c/r « x <b, 


and the optimal value function is 


(48) U(x) = (=y, forc/r<sx<b, 


where ^ = q (A) was defined previously in (40). 


REMARK 4.3. Recall that 5 is the root that satisfies 0 < $^ < 1 to the quadratic 
equation Q() = 0, where Q(J is given in (39). Note that U(b) = 1, U(c/r) = 0, with 
U(x) monotonically increasing on (c/r, b). As was the case earlier in 83, the fact that 
U(b) — 1 is by construction, but it is optimality that causes U(c/r) — 0, and hence 
makes the danger-zone inaccessible from the safe-region. 


PRoor. The proof is essentially the same as for Theorem 3.3. Specifically, here 
Theorem 2.1 applies directly with u = b, A(x) = A» 0, g = 0 and A(b) = 1. Thus 
the nonlinear Dirichlet problem of (9) for this case specializes to 


U? 
(49) (x — c)U, - YU. =0, forc/r «x «b, 


subject to the boundary condition U(b) = 1. Since we require U, > 0 and U,, < 0, it 
is clear the solution of interest here involves the smaller root, * , to the quadratic 
O(n) = 0 (see (39)), since 47 < 1. The control function fG(x) of (47) is then obtained 
by substituting U of (48) for v into (11). Finally, it is easy to check that U of (48) 
satisfies conditions (i), (ii) and (iii) of Theorem 2.1, and we may therefore conclude 
that fj is indeed optimal. o 

It is interesting to observe that when we place the control fj back into the 
evolutionary equation (3), we find that the resulting optimal wealth process, say X? 
satisfies the stochastic differential equation 


(50) 


y2y 


e - [ny je o ao ram terrent 


where 7? = inf(t > 0: X? = b). An application now of Ito's formula to the function 
U(-) of (48) using (50) (and (39)) gives 


2 - 
{+ y2y re. fort < vg, 


U(X) = Ges [a - » i a | 


which shows that the value function U(-) operating on the process X? is a geometric 
Brownian motion on the interval (0,1), for c/r « X? « b. 


REMARK 4.4. Orey, Pestien and Sudderth (1987), using different methods, studied 
some general goal problems with a similar objective as that considered here, and as a 
particular example study a version of our problem with r = c = 0 (Orey Pestien and 
Sudderth 1987, page 1258). An alternative proof of Theorem 4.3 can therefore be 
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constructed by using the results of Orey et al. (1987) using the transformation and 
reparameterization described above in Remark 4.2. 

We may now use the results of Theorem 4.2 to complete the proof of Theorem 4.1, 
However, we first need the following lemma, which is of independent interest since it 
is applicable to more general processes than those considered here (for related 
results, see Schal 1993, 84). 


LEMMA 4.3. Suppose that for every à > 0, we have 


f 
v(x A) = ite, ooo with optimal control f* ( x; A), 


with v(x; A) < œ, lim, 9 vG a) = vix; 0) < o, and lim, o f*G; A) = fG50). 
Then 


l= e>" 
| 


. , 1-e7 
51 ] fE = inf E | lim —-~——- | = i f 
(51) lim in d ìn | un | inf E (7 ) 


f ao A 
with inf, E (1^) = v(x;0) and with optimal control f(x; 0). 


PROOF. It is the first equality in (51) that needs to be established since the second 
is just an identity. To proceed, it is obvious that A~*[1 — e^ ^] x 7f for all A z 0, 
and hence E, (ATI — e^") < E(7), as well as inf; EOI — eY] < 
inf, EG). Since the r.h.s. of this inequality is independent of the parameter A, it 
follows that we may take limits on À to get 


(52) lim inf E,(A~"[1 -e |) < inf E, (77). 


For notational convenience now, let f* denote the policy f*(-; A), and for any policy 
f, let cf] = 7" Note that under this notation, we may write v(x; A) = E(A^![1 — 
e AUNTY), 


To go the other way now, suppose that there is an admissible policy, say f, such 
that 7[f*} >" 7[f] as A10. Then it is clear that 
int E, (7) < E(7[/]) = E. [lima - ect ) 
int E,(r/) TFT) lima“ ] 
An application of Fatou's lemma then shows that 
E (i A[1 — e AU) < tim E, (A^ [1 — e^ V], 
[lima = eM) < lim E (a [1r = e 1) 
and since E,CA7![1 — e7"U71] = inf, EAT U — e "p, we in turn conclude that 
53 infE,(7/) < Sim int E,(A^!]1 — e7^]]. 
(53) XE, (7) < im int E, (A ] 
The inequalities (52) and (53) yield (51). o 


COMPLETION OF PROOF OF THEOREM 4.1. Observe first that q (à) > 0 as A10, 
and that therefore f* > f$ as A40, where fý and f are given by (47) and (42). 
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Note further that for any c/r <x < b, we have 


lim 


l- U(x) 
ALO A 


= G(x), 


where U and C are given by (48) and (43). Finally, since n^ (A) > 0 as ALO, we have 
Xan, 2° XA. as ALO, where X? and X^ are defined by (45) and (50), from 
which it is clear that 7? ?* 7, as AO. Therefore, Lemma 4.3 may be applied 


directly to Theorem 4.2 to deduce Theorem 4.1. a 


5. The multiple asset case. As promised earlier, here we show how all of our 
previous results extend in a very straightforward way to the case with multiple risky 
stocks. The model here is that of a complete market (as in, e.g., Karatzas and Shreve 
1988) where there are n risky assets generated by n independent Brownian motions. 
The prices of these stocks evolve as 


n 
n, dt. Y, o, dw 
J71 


(54) dP(t) = P(t) 


; i=1],...,n, 


while the riskless asset, B,, still evolves as dB, = rB, dt. The wealth of the investor 
therefore evolves as 


(55) ix! = LE EG -0] + E X fo, aw, 
tal 


ra] sel 


where now f, denotes the total amount of money invested in the ith stock. 

If we introduce now the matrix o = (c),, and the (column) vectors p = 
(py... uM E = Cf... fa)", and then set A = ao’, we may write the generator 
of the (one-dimensional) wealth process, for functions V(x) € €? as 


(56) W(x) = (f( o r1) +r- c)V, + FEAE Y, 


where 1 denotes a vector of 1’s. The assumption of completeness implies that A`! 
exists, and thus all our results will go through exactly as before. In particular, if an 
optimal value function for a specific problem is denoted by v(x), the optimal control 
vector is f(x) where 


(57) f(x) = -A "(n - 71) 


LA 
Vx ` 
The differential equations ((22), (38), (44) and (49)), and hence the value functions 


(Q1), (36), (43) and (48)), all remain the same except for the fact that now the scalar y 
is evaluated as 


(58) y= ji - r) A7 (gc - 71). 


It is interesting to note that for the problem of maximizing the probability of 
reaching 5 before a when 5 is in the danger-zone, considered in 83.1, the optimal 
policy now does depend on the variances and covariances of the risky assets, since 
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instead of (20), in the multiple asset case we now get 
- rfe 
(59) fé(x) =A (uw — rZ (5 - x). 


The e-optimal policy of 83 needs to be modified, but the extension is straightforward 
and we leave the details for the reader. For reference, we note further that if we 
define the vector K by K := A~ "(p — r1), then the optimal controls (35), (42) and 
(47) of 8832, 4.1 and 4.2 become, respectively 


(60) FOKO- "(SF - x}, f) = K(x - $), 


f$) = Kü - m)" (x- £). 


6. Linear withdrawal rate. In this section we show how all of our previous 
results and analysis for the case of forced withdrawals at the constant rate c > 0 can 
be generalized to the case where there is a wealth-dependent withdrawal rate, c(x) 
where 


c(x) =c + Ox. 


Here we will only consider the case where 0 x 6 < r. For notational ease, we will 
consider again only the case with one risky stock. The generalization to the multiple 
stock case as in the previous section is very straightforward, and so we leave the 
details for the reader. 

For this case the evolutionary equation (3) becomes 


P l 
(61) ax! = 5.5 +(x! -pÈ - (c + 6X/) dt 
i t 


= [(r- 6)X/ + f(u- r) - c] dt + fio aW,. 


If we now define F := r — 0 > 0, then for Markov control processes f, and Y € g? 
the generator of the wealth process is 


(62) oY (x) = [Fe r) & & - c], + of 


The parameter ř is simply the adjusted (risk-free) compounding rate. Essentially, 
nothing really changes except for the fact that the danger-zone is now the region 
x « c/F, and the safe-region is its complement. The differential equations (22), (38), 
(44) and (49) all remain the same except for the fact that we must replace zx — c with 
ix — c. The parameter y in all those equations, as well as here, is still defined as in 
(8), ien y 3C — r)/o)’, where r is the standard interest rate. Thus, the previous 
analysis will go through with relatively little change, and so we will only point out the 
essential differences. In particular, the structure of the policies remain the same, in 
that the optimal survival policies of $3 invest a fixed proportion of the distance to the 
(new) safe-region barrier, c/f, while the optimal growth policies of $4 invest a fixed 
proportion of the excess of wealth over the barrier. 


6.1. Survival problems in the danger-zone. The analysis of §§3.1 and 3.2 can be 
repeated almost verbatim. What changes is that now for a <x <b < c/r, the value 
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function V(x; a, b) of (21) becomes instead 


_ y/ř+} _ Lom 
(63) V(x:a,b) = E LE LEE fora <x <b. 
— fa —(c~F 


Since (11) still holds, the resulting optimal control becomes, instead of (20), 


(64) f(x) = 


For the discounted problem of §3.2 (as well as for the discounted problem of §4.2), 
the quadratic Q(-) of (39) changes to An) = m?F — nly + A+) + A, and thus the 
two (real) roots to O(n) = 0, denoted by 5* and 3, become, instead of (40), 


(65) os a; yt) Van -an]. 


It is easy to check that once again, we have 0 < 5$; « 1 < 3j*, and so the optimal 
value function (36) and the optimal control function (35) become, respectively 


fx 


69 y= [EE nol) 


ra 


6.2. Growth policies in the safe-region. Once again, the analysis is almost 
identical to that in §§4.1 and 4.2. The value and the optimal control functions for the 
minimal expected time to the goal problem, (43) and (42) are replaced respectively by 


T(x- S), forc/r « x € b. 


2.1 m-e (>) = 
(67) Gx) - zT. o= 
Note that the optimal (Kelly) proportion of the excess wealth invested in the stock, 
(p — r)/o’, is unchanged in this case. 
Similarly for the discounted problem considered in §4.2, the value function (48) 
and optimal policy (47) become 
" -r (:- 
J 8o u$ 


The case where 6 > r, and hence with 7 < 0, introduces new difficulties that will 
be discussed elsewhere. 


ds 


(8) —— ua- (= 
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This paper concerns the problem of optimal dynamic choice in discrete time for an investor. 
In each period the investor is faced with one or more risky investments. The maximization of 
the expected logarithm of the period by period wealth, referred to as the Kelly criterion, is a very 
desirable investment procedure. It has many attractive properties, such as maximizing the asymp- 
totic rate of growth of the investor's fortune. On the other hand, instead of focusing on maximal 
growth, one can develop strategies based on maximum security. For example, one can minimize 
the ruin probability subject to making a positive return or compute a confidence level of increasing 
the investor's initial fortune to a given final wealth goal. This paper is concerned with methods 
to combine these two approaches. We derive computational formulas for a variety of growth and 
security measures. Utilizing fractional Kelly strategies, we can develop a complete tradeoff of 
growth versus security. The theory is applicable to favorable investment situations such as blackjack, 
horseracing, lotto games, index and commodity futures and options trading. The results provide 
insight into how one should properly invest in these situations. 

(CAPITAL ACCUMULATION; FRACTIONAL KELLY STRATEGIES; EFFECTIVE 
GROWTH-SECURITY TRADEOFF; BLACKJACK; HORSERACING; LOTTO GAMES; 
TURN OF THE YEAR EFFECT) 


This paper develops an approach to the analysis of risky investment problems for 
practical use by individuals. The idea is to provide simple-to-understand two-dimensional 
graphs that provide essential information for intelligent investment choice. Although the 
situation studled is multiperiod, the approach is couched in a growth versus security 
fashion akin to the static Markowitz mean-variance portfolio selection tradeoff. The 
approach is a marriage of the capital growth literature of Kelly ( 1956), Breiman (1961), 
Thorp (1966, 1975), Hakansson (1971, 1979), Algoet and Cover (1988) and others, 
which emphasizes maximal growth, with the maximal security literature of Ferguson 
(1965), Epstein (1977), and Feller (1962). In $1, we formulate a general investment 
model and describe three growth measures and three security measures. The measures 
all relate to single-valued aggregates of the investment results over multiple periods such 
as the mean first passage time to a particular wealth level and the probability of doubling 
one's wealth before halving it. In 82, we model the investment process as a random walk. 
Then, using familiar procedures from probability theory, we can easily generate com- 
putable quantities which are usually close approximations to the measures of interest. 
To generate tradeoffs of growth versus security one can utilize fractional Kelly strategies 
as discussed in $3. A simple but powerful result is that a complete trade-off of growth 
versus security for the most interesting growth and security measures is implementable 
simply by choosing various fractional Kelly strategies. Theoretical justification for the 
fractional Kelly strategies in a multiperiod context can be made using a continuous time 
approach and log normality assumptions (see Li et al. 1990 and Wu and Ziemba 1990). 
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Application of the theory to four favorable investment situations is made in §4. In each 
case the basic game or investment situation is unfavorable to the typical or average 
player. However, systems have been developed that beat the game in the sense that they 
have positive expected value. The question then remains how large should the wagers be 
and how confident is one that particular goals will be achieved. The various games, 
blackjack, horseracing, lotto games and the turn of the year effect differ markedly in their 
character. The size of the wagers vary from over half of one's fortune to less than one 
millionth of the fortune. The graphs that are outputted for each of these applications 
show how to trade off risk and return and provide crucial insight into how one should 
invest intelligently 1n these situations. 


1. The Basic Investment Problem 


An investor has initial wealth yo € R and is facing n risky investments in periods 1, 2, 

.,¢,.... The return on investments follows a stochastic process defined on the prob- 
ability space (Q, B, P) with corresponding product spaces (2', B', P^). Given the realization 
history w! € Q^'!, the investor's capital at the beginning of period t is Y ,., (o^! ). The 
investment in period t in each opportunity i, i = 1, ..., A, is Xalo’), Xj, (ow!) 
x Y,,(w"!) The investment decisions in terms of proportions are X;4(«* !) 
= p(w )Y (o !) fori 1,..., n, and p(w!) = (polw™'),...,Pm(w'')), where 
Èo pilo) = Land pow Y -i (o^!) is the investment at time / in riskless cash-like 
instruments. In this general form the investment strategy p,( «^! ) depends on time and 
the history Q^! of the investment process. The net return per unit of capital invested in 
i,i=1,...,n, given the outcome w, € Q, in period t is given by K;(w,), i= 1,...,n, 
(= 1,.... The return on the risk-free asset is given by Ko, = O. It is assumed that K;,(w;) 
is defined by the multiplicative model 


Kilo) = ailt) Elw), 


where o;(1) > 0 is the average time path and £;(w;) is an independent error term. Special 
cases result when a@;(f) = o; or aj(1) = af for t = 1,.... The error terms in this model 
are not autocorrelated. A more general approach would be to consider the conditional 
return at time / given the history «^ !, denoted by K;(«w,|«^ !). A Bayesian analysis 
with such a model is theoretically tractable, but it is not as suited to computations as the 
multiplicative model. 

An investment environment is said to be favorable if EK;(«) > 0 for some i. We are 
only concerned with favorable environments. The total return on investment is R,,(;) 
= ] + Kj,(w,), and the investment decisions are p = (pi, po(w'),..., pw ), - ++). 
Then the accumulated wealth at the end of period z is 


Y(p, o) = yo I] (Z Rs Gops o!) ; o! EN, [—]),.... (1) 
i=0 


s=l 


For the stochastic capital accumulation process { Y,(p)} Æ, we are interested in char- 
acterizing the accumulation paths as the investment decisions vary. Consider the following 
measures of growth and security. 


1.1. Measures of Growth 


Gl. uw (yo, p) = EY,(p). This is the mean accumulation of wealth at the end of time 
t. That is, how much do we expect to have, on average, after ! periods. 

G2. (yo, p) = E In (¥,(p)'"). This is the mean exponential growth rate over t 
periods. It isa measure of how fast the investor is accumulating wealth. We also consider 
the long-run growth rate $(p) = lim,.,, $,( yo, p). 
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G3. w(yo, p) = Er(yiysvy. This is the mean first passage time 7 to reach the set 
[U, oo). That is, how long on average does it take the investor to reach a specific level 
of wealth. For example, how long must the investor wait before he is a millionaire. 


1.2. Measures of Security 


S1. yi(yo, p) = Pr[Y,(p) = b]. This is the probability that the investor will have a 
specific accumulated wealth b, € R at time ¢. That is, what are his chances of reaching 
a given target in a fixed amount of time. For example, what chance does the investor 
have of accumulating $200,000 one hundred days from now? 

S2. a(yo, p) = Pr[Y,(p) = b,,t = 1, +++]. This is the probability that the investor's 
wealth is above a specified path. Our concern for an investor is that his wealth will fall 
back too much in any period. He may want protection against losing more than, say, 
10% of current wealth at any point in time. 

S3. B(yo, p) = Pr[Ti ypu} € Tiyiysr)l. This is the probability of reaching a goal 
U which is higher than his initial wealth yo, before falling to a wealth level L which is 
less than yo. For example, if U = 2yọ and L = yo/2, this is the probability of doubling 
before halving. 

The rationale for the measures selected is to profile the growth and security dimensions 
of the accumulation process. In each case there is a natural pairing of the measures: 
(m, Yı) for accumulated wealth at a point in time; (¢, a) for the behavior of growth 
paths; and (n, 8) for first passage to terminal or stopping states. The choice of profile 
(pair of measures) is largely a function of personal preference and problem context. It 
is expected that the information contained in each of the profiles would be useful in 
selecting an investment strategy. 

The usual criterion for evaluating a decision rule is t¢,(yo, p), the expected log of 
accumulated wealth. When K;,(w,) = a,(t)E;(w,), i= 1,..., n, t9 lh ..., then the log 
optimal strategy is proportional, p4(«' ^!) = Da. Furthermore, if pj, i = 1,..., n, solves 
the one-period problem 


max [£s (: + D Eo) 


t=] 


n 
> pi <1, pi = 0, isien), 
i=] 


then p; = inf {a7'(t)p;, 1] fori=1,...,n,t = 1,.... In the case where a;(t) = ai, 
i — 1,..., n, the log optimal strategy is a fixed fraction (independent of time), and is 
referred to as the Kelly (1956) strategy. 

Considering the other growth measures we find that the Kelly strategy (1) maximizes 
the long run exponential growth rate $( yo, p), and (ii) minimizes the expected time to 
reach large goals n( yo, p) (Breiman 1961, Algoet and Cover 1988). The main properties 
of the Kelly strategy are summarized in Table 1. The performance of the Kelly strategy 
on the security measures is not so favorable. As with any fixed fraction rule, the Kelly 
strategy never risks ruin, but in general it entails a considerable risk of losing a substantial 
portion of wealth. 

In terms of security an optimal policy is often to keep virtually all wealth in the riskless 
asset (Eithier 1987). We will let pọ = (1, 0, ..., 0) be the extreme security strategy, 
where all assets are held in cash. 

Our purpose is to monitor the growth and security measures as the investment decisions 
vary with the goal of balancing the measures to obtain a strategy which performs well 
on both the growth and security dimensions. For a particular (growth, security) com- 
bination, the complete set of values is given by the graph (see Figure 1) 


U; = {(Gi(p), Sj(p))|p is a feasible investment strategy }. 
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TABLE I—PART 1 
Main Properties for the Kelly Criterion 


Good/Bad Property Reference 
G Maximizing E log X asymptotically Breiman (1961), Algoet and Cover (1988) 
maximizes the rate of asset growth. 
G The expected time to reach a Breiman (1961), Algoet and Cover (1988) 


preassigned goal is asymptotically as 
X increases least with a strategy 
maximizing E log Xy. 
G Maximizing median log X. Ethier (1987) 
B False Property: If maximizing E log Xy Thorp (1975) 
almost certainly leads to a better 
outcome then the expected utility of 
its outcome exceeds that of any 
other rule provided n is sufficiently 
large. 
Counterexample: u(x) = X, 5 « p « 1, 
Bernoulli trials, f= 1 maximizes 
EU(x) but f* 2p- 1 <1 
maximizes E log Xy. 
G The E log X bettor never risks ruin. Hakansson and Miller (1975) 
B If the E log X bettor wins then loses or (1 + yYX1— 3)Xo = (1 — y?)Xo «0 
loses then wins he is behind. The 
order of win and loss is immaterial 
for one, two, . . . sets of trials. 


G The absolute amount bet is monotone (GE log X)/9W, > 0 
in wealth. 

B The bets are extremely large when the Roughly the optimal wager is proportional to the 
wager is favorable and the risk is edge divided by the odds. Hence for low risk 
very low. situations and corresponding low odds the 


wager can be extremely large. For one such 
example, see Ziemba and Hausch (1985, pp. 
159-160). There in a $3 million race the 
optimal fractional wager on a 3-5 shot 

was 64%. 


The most obvious points in U; to identify are efficient points given by (G,(p.), SiCp.)) 


Gj(p,) = max (G;(p)| Si(p) = a, p feasible}. 
Alternatively the efficient points are (G;( p), S;(pg)) with 
Si(pa) = max ( S;(p)| Gi(p) = B, p feasible]. 


Given the required level of security (or growth) the efficient points are undominated. 
The set of efficient points constitutes the efficient frontier. 

Two key efficient points are the unconstrained optimal growth and optimal security 
points, respectively (G;(p), S;(p)) with G;(p) = max (G;(p)]| p feasible } and (G;(po), 
Si(po)) with S;(po) = max { S;(p)| p feasible). The optimal growth strategy is the Kelly 
strategy p and the optimal security strategy is the cash strategy py. These are both fixed 
fraction strategies. Other strategies yielding points on the efficient frontier would not 
typically be fixed fraction, but rather would be dynamic strategies, depending on current 
wealth and/or time. Gottlieb (1985) has derived such a strategy for the criteria of max- 
imizing mean first passage time to a goal (G3), subject to a high probability of reaching 
the goal ($3). 


Growth versus Security in Dynamic Investment Analysis 335 


1566 L. C. MACLEAN, W. T. ZIEMBA AND G. BLAZENKO 
TABLE I —PART 2 
Main Properties for the Kelly Criterion 
Good/Bad Property Reference 
B One overbets when the problem data Betting more than the optimal Kelly wager is 
is uncertain. dominated in a growth-security sense. Hence if 
the problem data provides probabilities, edges 
and odds that may be in error, then the 
suggested wager will be too large. This property 
is discussed in and largely motivates this paper. 
B The total amount wagered swamps Ethier and Tavaré (1983) and Griffin (1985) show 
the winnings—that is, there is that the Expected Gain/F Bet is arbitrarily small 
much “churning.” and converges to zero in a Bernoulli game where 
one wins the expected fraction p of games. 
B The unweighted average rate of Related to property 5 this indicates that you do not 
return converges to half the scem to win as much as you expect; see Ethier 
arithmetic rate of return. and Tavare (1983) and Griffin (1985). 
G The E log X bettor is never behind Finkelstein and Whitley (1981) 
any other bettor on average in 1. 
2,... trials. 
G The £ log X bettor has an optimal This is a crucially important result for practical use. 
myopic policy. He does not have Hakansson (1971b) proved that the myopic 
to consider prior to subsequent policy obtains for dependent investments with 
investment opportunities. the log utility function. For independent 
investments and power utility a myopic policy is 
optimal. 
G The chance that an £ log X wagerer Bell and Cover (1980) 


will be ahead of any other wagerer 
after the first play is at Icast 50%. 

G Simulation studies show that the £ 
log X bettor's fortune pulls way 
ahead of other strategies’ wealth 
for reasonably-sized sequences of 
investments. 


Ziemba and Hausch (1985) 


Rather than trying to move from optimal growth to optimal security along the efficient 
frontier, we will consider the path generated by convex combinations of the optimal 
growth and optimal security strategies, namely 


P(A) = Ap t (1 — X)po, 


Clearly p(A) is a fixed fraction strategy and we will call it fractional Kelly since it 
blends part of the Kelly with the cash strategy. The fractional Kelly strategy is n -dimen- 
sional, specifying investments in all opportunities, and the key parameter is ^, which we 
can consider as a tradeoff index. 

The path traced by the fractional Kelly strategies is 


U? = ((GpQ), S(p(A)))|0 € A < 1]. 


The fractional Kelly path is illustrated in Figure 1. This path has attractive properties. 
First, it is easily computable, as shown in the next section. When A varies from 0 to | 
we continuously trade growth for security. Although this tradeoff is not always efficient, 
it is effective in that we see a monotone increase in security as we decrease growth. 
However, the rate of change in growth or security with respect to À 1s not constant and 
thus the Kelly path is curvilinear. In fact, MacLean and Ziemba (1990) established that 
the Kelly path is above the straight line (secant) joining the optimal growth and optimal 
security points. These properties are discussed in $3. 


O<A 1. 
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Si 


Efficient frontier 


Fractional Kelly path (effective) 


Gi 


FIGURE Í. Graph of (Growth, Security) Profile Showing Efficient Frontier and Effective Path. 


2. Computation of Measures 


If we select a strategy which specifies our investment in the various opportunities and 
we stick with that strategy, then each of the measures described in 82 is easily computable. 
We will now develop the computational formulae. Since most of our discussion focuses 
on fractional Kelly strategies, we work with them. However, more general fixed strategies 
could be used. 

Consider the fractional Kelly strategy p( A) and use the transformation 


Z(p(A), w) = In ¥,(p(A), e). 
From (1), 


Z(p(X), w') = zo + 2; In (z Ron) = ZPA), o^!) + IOA), o), 


is0 


sel 

where J(p(A), w) = In (Xo Ri(w)p;{A)) is a stationary jump process and zo = In yo. 

Hence, Z,, f= |, ..., is a random walk and we can evaluate the various measures of 

interest using this process. An alternative approach to evaluating these measures is pre- 

sented in Ethier (1987) where the discrete time (and maybe discrete state) process Y,(p(A)) 
is approximated by geometric Brownian motion. 
The mean wealth accumulation measure Gl is 


uu yo, p(3)) = EY((p(3)) = vE |I (z Ro) = »(Z n) , (2) 
i=0 


s=l \i=0 
where R; = ER((w),i - 0,...,n. 
For the mean growth rate measure G2 


:(Yo, PAD = E ln (YAPA) = 1/LEZ,(p(A)) 


i l 
= Eis [X RWP) +7 zo = EIDO), 9) +7 20 (3) 


i20 
which is an easy calculation to make. 
For measure S1 assume the process 


ZPA), w) = zo + 22 JPA), ws) 


s=l 
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has an approximate normal distribution. Using this approximation yields 


bř - SM 
a(Z,(p(A))) 
where F is the cumulative for the standard normal, b* = In b,, and F(Z,(p(A))), 


o(Z,(p(A))) are the mean and standard deviation respectively. 
Measure S2 is given by 


YY PA) = 1 — + (4) 


a Yo, P(A)) = Pr[Zi(p(3)) = OF, f= 1,05] (5) 
= [I PriZGO)) = bF | Z.(p(A)) 2 OF, 5 = 1,...,0- 1] 
t=] 
= [T e«(zo, p(A)). (6) 
=1 
Then with f,(z,|Z, = b¥,s = 1,...,£— 1) the conditional density 


a= f h(zl Z: = bł, 5 = 1;...,t— 1)dz, 


izbr 


-Í ( ha 7 xlZ = BES = isst Droodieas, 
Zeb? zy-xebyy 


where r(x) is the density for J(p(A)), 


— 77 * E — 
-Í p ha(z-x|lZzzebft,s-10,...,t 2 seas ds. (7) 
zzbr; X&Zz-bj| 


1-2 


Equation (7) provides a sequential procedure to compute measure S2, requiring only 
the distribution from the previous stage and the jump probabilities. 

For the other measures assume that the random variable J(p(AX)) has finite support 
given by the interval [J4(p( A)), Ju (p( ))]. Consider measure $3 


BCyo, p(X)) = Pr[vivipi zu) € Tivipo»st) | Yol- 
Transforming to an additive process with z — In y and letting 
B(z, p(X)) = Pr[rizoo»stmu) € Tizon) IZ] 
yields 
B(zo, P(A)) = E {Blz + J(p(A)), p). (8) 


The expectation in (8) is over the random variable J(p(A)). We must solve (8) for 6 
subject to the boundary conditions 


B(z, p(3)) = 0 if zxlnL (8a) 
B(z, p(A)) = 1 if z 2 ln U. (8b) 


A solution to (8) is of the form (z, p(A)) = X=, A40£, where A; are chosen to satisfy 
(8a, 8b) and the roots 6,,..., 0, satisfy the equation E0700? = ], [t is easy to show 
that there are two positive roots 0, = 1 and @ = 0, where 0 < 1 or 62 > 1 depending on 
whether EJ(p(A)) > 0 or EJ(p(A)) < O respectively. Similar to the methods discussed 
in Feller (1962, pp. 330-334), we take the minimum J,(p(A)) < 0 and maximum 
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TABLE 2 


Computational Formulas for Growth Measures G1-G3 and Security 
Measures S1-S3 


Ma 


GI u(Yo,p)- zi Rini) 


[t 
° 


G2 (p) = Ein (= Rom) 
i-0 


g . 8^ (Usp), UÀ Zo 
G3 "(Zo p) sei (oun ea) EJG) 


b-ZA- EA 
VKP) 


Sı (Zo, 9) = 1 ~ Fl 


p: 


S2 a(Zo, p) = Il e«(Zo, p), 


t=) 


G,-i(Z, T Jp). | 
eZ, p) = 2. L.E. amp 77 
(0^ — gv yo"* — gin) + (0^ — Cn CA — 8^) 
= 2(g9*» —-g7 yaw’ — EP) 


S3 8(Zo, p) 


JA4(Cp(A)) > 0 jumps and find bounds on the solution by solving a simple system with 
the extreme boundary conditions satisfied as equalities. For the upper bound, 


Bn L + JaA) p(3)) =O and Bn U, p(X)) = I. 


For the lower bound, 


Bn L,p(3)) 20 and — (In U + Ju(p()), p(3)) = V. 


Let L* = In L, L*(p(1)) = In L + J4(p( )), U* = In U, and U*(p()) = In U 


+ Ju(p( ))). 
The bounds for measure S3 are then 


8^»—9g^ z — gL* (Q0) 
groon — gi* = BO; PX) < gos gr oo» - (9) 
For measure G3, 


"( yo, P(A)) = ET, yipyyys Ul yo) = ET | Z(p(ayy21nU | 20} , 


we use the recursive equation 


n( Zo, P(A)) = E(n(zo + JOA), pA))} + 1 (10) 
subject to the boundary condition 
n(z, P(A)) = 0 if z z In U. (10a) 


One solution to (10) is n*(z, p(A)) = —z/(EJ(p(X))). Any other solution can be 
written as (Zz, p(A)) = n*(z, p(A)) + A(z, p(A)) where A(z, p(À)) satisfies the system 


A(z, p(X)) = EL A(z + J(p(d)), p(9)]. (10b) 
Solutions of (10b) have the form A(z, p(A)) = 2 A,0$. Solutions of (10) are then 
MZ, p) = X Akb — 2/(EI(p(A))) 
where the 4, are chosen to satisfy the boundary conditions (10a). With U,(p(A)) 
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7 ]n U — J,,(p(A)) we can solve the extreme boundary conditions as equalities, namely 
qz (U*( p(A)), p(A)) = O and nf U,(p(A)), p(A)) = 0 yields the following bounds on G3: 


U,(p(À 9^ 
| (p( » (gcns Zo <= nl Vo, P(A)) 


EJ(p(X)) EJQO)) — 
-| UO) 9^ Zo (11) 
[EPAD KOTO] EPA) ” 


The bounds in (9) and (11) are quite tight when the jumps are small relative to the 
initial wealth zo. 

Table 2 summarizes the formulas we can use to compute the six growth and security 
measures. For G3 and S3 we have bounds and the formulas given provide midpoint 
estimates of these bounds. A similar table appears in Ethier (1987) for the geometric 
Brownian motion model. 


3. Effective Growth-security Tradeoff 


The fact that the various growth and security measures can be easily evaluated for 
fractional Kelly strategies provides an opportunity to interactively investigate the per- 
formance of strategies for various measures and scenarios. The ability to evaluate various 
performance measures and determine a preferred strategy will depend on an effective 
tradeoff between complementary measures of growth and security. A tradeoff is effective 
if the loss of performance in one dimension (say growth) is compensated by a gain in 
the other dimension (security). In this section we establish that the tradeoff using fractional 
Kelly strategies is effective for each of the profile combinations. In calculating the rates 
of change for the various measures we will see that the tradeoff between growth and 
security is not constant for all À, 0 <A <1. 

(1) Rate Profile: ($, a). 


LEMMA 3.1. Suppose p(X) is a fractional Kelly strategy and p is the Kelly strategy. 
Then 


à$( yo. P(A)) 5 


0 O<dA <1. 
On , 
PROOF. Consider the optimal growth strategy given by P = (Po, ---, Pn), and the 
optimal security strategy po = (1,0,...,0). 
Then 
P(A) = (Apo + (Y — 3), ..., Apa) = (PA), «© PaA) and 
d d 
— À — Dp; j — e. . 
gj Poh ) <0, An PIA) > 9, i= 1,...,0 


We have the optima! growth problem 


max $(p(A)) = max E In (Z Rp) . 


i=0 


Then 


z (p(n) = e| La Zim Rule) and 
Do 


2-0 Ri fw) pir) 


1— Et Ry) P 
E . 
ERIS <0 


à? 
ap; 00) 
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So é(p(A)) is concave in po and with po such that d¢(p(A))/dpo = 0, we have 


ð _ 
ap, MCA) < 0, Do < po. 
Po 
Therefore 


d _ doy ( 8. s imereasing i 
d^ é(p(A)) = dA P WAD] > 0 and #(p(A)) is increasing in A. oO 


LEMMA 3.2. 


Let p( X) be a fractional Kelly strategy and In a < Q be a given fallback 
rate. Then 


ð 
ES ala, p(X)) x 0. 


PROOF. We have for the security measure 
ala, p(X)) = Prob [Y(p(3)) = wal, t= 1, ++] 


= J] Prob [Y,(00) = a'l YAp) = à s = 1... (— LU 


fet 


= [J a(4, p). 
izl 


Along any path ], yi, ..., y,-; with y,z a3,$— 01;...,1— 1. 


[-1 
a,(a, p( A)) = Prob L (z Rcon) >Ina- -l. » zl. 


D—d., 


where z, = In y, and Ina — (1 — 1) ! ZZ z « 0. With b = In a — (1 — 1)! È z, and 
RPA) = È Ry(w)p(d) we have a (à) = 


a,(a, p(X)) = Prob [In (R,p(A)) = b] 
= Prob [In (AR, P + (1 — A) Ro) = b] 


= Prob [A In (R;p) + (1 — A) In (Rp) + Ax b] 


= Prob | In (R, p) = 


= 1 = FQ), 


where x(A) = (6 — A)/X and F, is the distribution function for the optimal growth 
process In ( R,po). 


Then dx(A)/dd > 0 and F nondecreasing imply da,( )/dÀ x 0. Hence do(a, p( ))/ 
dA x0. Ul 


Thus along the fractional Kelly path growth is increasing and security is decreasing 
and an effective tradeoff is possible. 
(it) Wealth Profile: (44, Yi). 


LEMMA 3.3. If p(A) is a fractional Kelly strategy then 


d 
qx (OU) > 0, 0xAxI. 
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PROOF. We have 


à 
a ae (P(A)) 


E] I n 
on. E YQ = Eze. [v] I (2 Rap] 


- 


Eb» 2 > mae Reano) I (= R(an9))| 


r=1 9Po Vico 


WELK (o )Y (9 7!)] = oC EK)(EY,-1) < 0. 


1l 


Then 


dpo(À) ð 
PE (Q0) 0, D 


d 
geo) = 


LEMMA 3.4. Suppose p( X) is a fractional Kelly strategy and consider the growth rate 
a = 1. Then 


d 
qx YO rA) <0 for O<)A <1. 


PROOF. We have 


i 


Y(Yo, P(A)) = Prob [Y,(p(A)) = yoa'] = Prob | > In Rp) = tna 


s=] 


= Prob È In (XR,p + (1 — A)Rspo) = 0] 


= Prob ^ X lIn(Rp)+} A2 o] where A,> 0 


> 
] — F(x,(A)), 


where x,(À) = (~È A,)/A and F' is the distribution function for Z5., In (R,p). Then 
(d/ dA)x(X) > 0 and F' nondecreasing gives the desired result. © 

Hence for the wealth profile an effective tradeoff is possible along the fractional 
Kelly path. 

Gü) Stopping Rule Profile: (n^!, 8B). Our analysis of the stopping rule profile focuses 
on the computational formulas y (p(A)) and B(p(A)). Since we have upper and lower 
bounds for the true values, the accuracy of the approximation is easy to determine in a 
particular application. Our choice of strategy is based on the profile graph and that 
requires 7 and f. 

The formulas for 7 and B are based on the random walk Z,(p(A)) with jump process 
J(p(A)). The process J(p(A)) is flexible at the fractional Kelly strategy p( A) if the con- 
dition 


Prob b In (RB) = — 


I 


L'*«(p(A)8P00P? — EJ'(p(A3))0/70? — U'*( pA) o 0^0» 
L*( p( 3))8 0» < EJ(p(X))8 4 90» < U*(p(X))6 00 


holds, where 0 < 1 is a positive root of the equation E&(p(A)) "?? = 1, and prime 
denotes the derivative w.r.t. À. 


LEMMA 3.5. Suppose p(X) is a fractional Kelly strategy and consider the wealth process 
Y(p(X),t- 1, +++ and the absorbing states U = kys, L = kyo, k > 1. If J(p(X)) is 
flexible at p(X) then for0 X x 1 
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(1) 7 ( 05 p(X)) « 0, 


à 
(ii) E B(yo. p(X)) « 0. 


PROOF. (i) For the mean first passage time consider the lower bound 


U,.(CD(X)) 
2 À = BL ALAS 
n (p( )) EJ( pA) 9 POD , 

where without loss of generality assume yg = 1 (zo = 0). Then 


N^(p(X)) 
D'(p(X)) ' 


where — D'(pQ)) = [EJ(p(A))0 ^99]? — and 
N?(p()) = LEPOAN 9» Fig DON) = Us (OQ) ES (pA) BU 


ð 2 
— "VES 
aps” (2) 


= Ug POET DA) OPO” 7, (p(A)) In 8 
— Ug (PADEN pA) 878? U, (p( A ))6'/6], 


where the prime denotes the first derivative w.r.t. pọ. Since EJ'(p(^)) « 0 and EJ(p(A)) 
> 0, Jin(p(A)) > 0, 0' < 0, we get all terms in N?(p(A)) positive and (9/0po)n?(p(À)) 
> 0. In the same way for the upper bound 7'(p(d)) we get (0/dpo)n'(p(A)) > 0. With 
(d/dX)po(A) < 0 the average of the bounds yields (1). 

(ii) For the first passage probability we utilize the upper bound 


, | — ( 0» 
8 (0(3)) = g^ — g^»: 
Then the numerator of (0/ópg)8' (p(X)) is 


« Oy. . ð . 
N(p(r)) = (8Y — 1) —- 000» — (1 — 4 (PON) — gU . 
0) OPo ODo 


We have E8( p( A) y ^? = | and (8/dpp) E6( p(A) "^? = 0 from which 


li 


ð 
ape &(p(À)) = —O(P(A)) In (pO) LET (oO )0" 099 / EJ(p( X ))879?1] 


(706(p(X)) In 6(p())) VG (9). 


Then 
ð * « L*(p(A)) ð ð 
as tron = ge E g y In à — L* ` ) 
am HD) apo apy POD 
= —L*(p(4))0 PO Wp(r)) In 8 + L'* p(A)) 02° |n 8, 
and 


à * * à 
z— 00 = Ut! — 8 = —U*8 U y(p(A)) In 8. 
apo apo Wp(r)) In 
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Substituting into N(p(A)) with a little algebra yields 
N(p(X)) = (8 = DL POOT 09Py(p(X)) In 0 
+ (1. — 04009) UFO VC p(X)) In 0 
— (0% — 1)L'*( p( X))8 E X90 In 8. 
With U = kyo and L = k7'yo it is easy to show that 


U*9 U* Lip (r)) 9h OY 
19m r TETES 


and therefore 
A = (1 — 900») YQ > (0% — L)L*(p(X))8 ^00» = p, 
We have, then, substituting B for A 
N(p(X)) > 2(1 — 8^) In AL* (DA) OPPO WDA) + L'*( p(X))68 70099) > 0 
since J flexible at p( A) implies 
L*( pA) BP POY DAY) + L*( pA) OV <0. 
So 


ð 
— B'>0. 
Opo 


With the same result for the lower bound and with (d/dA)p(A) < 0 we get (ii). O 

The proof of the above lemma depends on the flexibility condition but not the re- 
quirement that the strategy be fractional Kelly. The significance of the fractional Kelly 
strategies is that the flexibility condition is always satisfied if it is satisfied for the Kelly 
strategy. 

For the stopping rule profile (5 ~', 8) we have along the fractional Kelly path growth 
increasing and security decreasing. The inverse of expected stopping time is a measure 
of growth since faster growth would imply getting to the target U sooner. 

In summary, for each of the profile types we have the same type of behavior. The 
complementary measures, growth and security, move in opposite directions as we change 
the blend between the Kelly and cash strategies. This enables a continuous tradeoff between 
growth and security. 


TRADEOFF THEOREM. Suppose p(X) is a fractional Kelly strategy. Then for each 
profile there is an effective tradeoff between growth and security by varying ^, the trade- 
off index. 


Additional theoretical results concerning these tradeoffs are presented in MacLean 
and Ziemba (1990). 

Friedman (1982) seems to have first considered fractional Kelly strategies in the context 
of blackjack. His results are along the lines discussed in $4.1 below. Other references on 
fractional Kelly strategies are Gottlieb (1984); Ethier (1987); Li, MacLean and Ziemba 
(1990); and Wu and Ziemba (1990). 

The tradeoff theorem in this section establishes the use of growth and security measures 
in determining an investment strategy. There are many possible tradeoffs depending 
upon the choice of measures and the parameters specified in these measures. Rather than 
further considering the preferred measures and tradeoff in theory, we will apply the 
concepts to some real world examples and explore the choice of investment decisions in 
practice. 
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4. Applications 


The methods described in previous sections are now applied to a variety of well-known 
problems in gambling and investment. Rather than concentrating on the results in the 
tradeoff theorem, we will explore a variety of growth and security measures to see how 
interactively a satisfactory decision is achieved from both perspectives. In each case we 
emphasize a pair of measures (growth, security). There are a few points we should make 
about our approach to these applications. Satisfaction with a particular investment strategy 
will depend on many factors. The measures of performance we have described are helpful, 
but they are functions of inputs such as initial wealth, return distribution, and performance 
standards. We must vary those inputs in appropriate ways to see a decision rule under 
different scenarios. In cases where inputs are soft we may need many diverse scenarios. 

Throughout the theory and examples we use fixed strategies. This is not to imply that 
the particular fixed strategy is used forever, but rather a statement that if all inputs remain 
constant we can predict the results from the fixed strategy. Of course, the inputs are 
dynamic and we would regularly (continuously) update the strategy and measures of 
performance based on new input data. This process is facilitated by easily computable 
measures. 


4.1. Blackjack: ($, B) 


The game of blackjack seems to have evolved from several related card games in the 
19th century. It became popular in World War I and has since reached enormous pop- 
ularity. lt is played by millions of people in casinos around the world. Billions of dollars 
are lost each year by people playing the game. A relatively small number of professionals 
and advanced amateurs, using various methods such as card counting, are able to beat 
the game. The object is to reach, or be close to, twenty-one with two or more cards. 
Scores above twenty-one are said to bust or lose. Cards two to ten are worth their face 
value, Jacks, Queens, and Kings are worth ten points and Aces are worth one or eleven 
at the player's choice. The game is called blackjack because an Ace and a ten-valued card 
was paid three for two and an additional bonus accrued if the two cards were the Ace of 
Spades and the Jack of Spades or Clubs. While this extra bonus has been dropped by 
current casinos, the name has stuck. Dealers normally play a fixed strategy of hit until 
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1.0 Relative growth 
0.8 
Prob. double 
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Prob. quadruple 
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Fraction of Wealth Wagered 


FIGURE 2. Probabilily of Doubling and Quadrupling before Halving and Relative Growth Rates Versus 
Fraction of Wealth Wagered for Blackjack (2% Advantage, p = 0.51 and q = 0.49). 
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TABLE 3 
Growth Rates Versus Probability of Doubling before Halving for Blackjack 


ÃO, p) o(p) 
= P[Doubling = Relative 
À before Halving] Growth Rate 

0.1 0.999 0.19 

0.2 0.998 0.36 

Range 0.3 $ 0.98 $ 0.51 

for 0.4 SAFER 0.94 LESS GROWTH 0.64 

. 0.5 0.89 0.75 

Blackjack 

Teams 0.6 RISKIER 0.83 MORE GROWTH 0.84 

0.7 4 0.78 4 0.91 

0.8 0.74 0.96 

0.9 0.70 0.99 

1.0 KELLY 0.67 1.00 

1.5 0.56 0.75 

Overkill + 2.0 0.50 0.00 
Too Risky 


a seventeen is reached and then stay. A variation is whether or not a soft seventeen (ar 
ace with cards totalling six) is hit. It is slightly better for the player if the dealer stands 
on soft seventeen. The house has an edge of 2-10% against typical players. For example, 
the strategy mimicking the dealer loses about 8% because the player must hit first and 
busts about 28% of the time (0.28? = 0.08). 

In general, the edge for a successful card counter varies from about —10% to +10% 
depending upon the favorability of the deck. By wagering more in favorable situations 
and less or nothing when the deck is unfavorable, an average edge weighted by the size 
of the bet of about 2% is reasonable. Hence, an approximation that will provide us insight 
into the long-run behavior of a player’s fortune is to simply assume that the game is a 
Bernoulli trial with a probability of success w = 0.51 and probability of loss 1 — ~ 
= 0.49. 

We then have 9 = (0, 1}, K(0, p) = p with probability x, and K(1, p) = —p with 
probability 1 — m. The mean growth rate is 


Eln(1 + K(o,p)) ^ *»In(1 + p) * (1 — «)In(l — p). 


Simple calculus gives the optimal fixed fraction strategy: p* = 24 — | if EK > 0; p* 
= Oif EK x 0. (This optimal strategy may be interpreted as the edge divided by the odds 
(1-1 in this case).) In general, for win or lose for two outcome situations where the size 
of the wager does not influence the odds, the same intuitive formula holds. Hence with 
a 2% edge betting on a !0~! shot, the optimal wager is 0.2% of one's fortune.) The growth 
rate of the investor's fortune is 


é(p) 7 *«In(1 t p) c (1 — v»)In(!— p) 


and this is shown in Figure 2. It is nearly symmetrical around p* = 0.02. Security measure 
S3 is also displayed in Figure 2, in terms of the probability of doubling or quadrupling 
before halving. The bounds, from (5), are fairly sharp. Since the growth rate and the 
security are both decreasing for p » p*, it follows that it is never advisable to wager more 
than p*. However, one may wish to trade off lower growth for more security using a 
fractional Kelly strategy. Table 3 illustrates the relationship between the fraction ^ and 
growth and security. For example, a drop from p = 0.02 to 0.01 for a 0.5 fractional Kelly 
strategy, drops the growth rate by 25%, but increases the chance of doubling before 
halving from 67% to 89%. 
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In the blackjack example we see how the additional information provided by the 
security measure (yo, p) was valuable in reaching a final investment decision. In a 
flexible or adaptive decision environment, competing criteria would be balanced to achieve 
a satisfactory path of accumulated wealth. With this approach professional blackjack 
teams typically use a fractional Kelly wagering strategy with the fraction À = 0.2 to 0.8. 
See Gottlieb ( 1985) for discussion including the use of adaptive strategies. 


4.2. Horseracing: (m, yi) 


Suppose we have n horses entered in a race. Only the first three finishers have positive 
return to the bettor. For the remaining positions, you lose your bet. Then 


Q= {(1,2,3),..., (j, Ex 2 1,2)] 


is the set of all outcomes with probability z;4. We wager the fractions p;1, Pi2, Pi3 of our 
fortune Y, on horse / to win, place, or show, respectively. One collects on a win bet only 
when the horse 1s first, on a place bet when the horse is first or second, and on a show 
bet when the horse is first, second, or third. The order of finish does not matter for place 
and show bets. The bettors, wagering on a particular horse, share the net pool in proportion 
to the amount wagered, once the original amount of the bets are refunded and the winning 
horses share the resulting profits. Let p be the n X 3 matrix of wager fractions, where 


n 


iI 2554 pj € 1. The return function for a particular (i, j, k) outcome is 


pa _, (OP -Pi = P) Pa, Pp 
w, 2 


n (SSAA m. Dj3 Jj 


K((i, j, kh p) = (QW — w) 


+ 
3 S cu 


-[Em* 2 mt È ZI 


hei LN Dj 


where Q = | — the track take, W;, P; and S; are the total amounts bet to win, place, and 
show on the various horses, respectively, and W = È W; P= È P, S= È S. 

As an example we will consider a simplified scheme where a wager is made on a single 
horse per race and there are five races occurring simultaneously across tracks ( Hausch 
and Ziemba 1990). The required information on the races is given below. Each of the 
horses has an expected return on a $1 bet of $1.14, so this is a very favorable situation. 


Probability of Collecting 
Race Horse on Wager Odds 
l A 0.570 l-1 
2 B 0.380 2-1 
3 C 0.285 3-1 
4 D 0.228 4-l 
5 E 0.190 5-1 


The Kelly strategy is 
p* — (p6, P1, D2, D3 , Pa; Ps) = (0.8529, 0.0140, 0.0210, 0.0141, 0.0700, 0.0280). 


So 1.4% of your fortune is bet on horse A in race 1. Using the growth measure ,(p) 
= mean accumulation of wealth, and the security measure y,(p) = probability that 
accumulated wealth will exceed 6,, the Kelly strategy was compared to half Kelly and 
the fixed fraction strategies which bet 0.01, 0.05 and 0.10 respectively, distributed across 
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TABLE 4 
Y Yo, p) = Prob Y(p) Will Exceed b, (t = 700, Yo = 1000) 
Strategy — p 
Proportional 

b, Kelly 0.5 Kelly 1% 5% 10% 
500 0.940 0.992 1.000 0.915 0.518 
1,000 0.892 0.961 0.980 0.856 0.517 
5,000 0.713 0.69] 0.061 0.631 0.358 
10,000 0.603 0.445 0.003 0.512 0.304 
100,000 0.231 0.028 0.000 0.148 0.141 
uCpr* $17,497 $8,670 $2,394 $10,464 $1,176 


* The median is used here since Y (p) is approximately log normal (skewed) (Ethier 1987). 


horses in the proportions (0.1, 0.3, 0.3, 0.2, 0.1). These fixed fraction strategies are not 
fractional Kelly. 

The results for u,(p) and y,(p) are given in Table 4. The Kelly strategy with a 6% 
betting fraction and the 596 fixed fraction are similar, both having favorable growth and 
security. The smaller betting fractions provide slightly more security but have much less 
growth. Since the 1096 strategy 1s beyond the Kelly fraction it has less growth and less 
security. An interesting point is that the total fraction bet is more significant than the 
distribution of the bets across the horses. 

Implicit in the above example is the ability to identify races where there is a substantial 
edge in the bettor's favor. There has been considerable research into that question. Hausch 
et al. (1981) demonstrated the existence of anomalies in the place and show market. At 
thoroughbred racetracks about 2-4 profitable wagers with an edge of 1096 or more exist 
on an average day. The profitable wagers occur mainly because: (1) the public has a 
distaste for the high probability-low payoff wagers that occur on short priced horses to 
place and show, and (2) the public's inability to properly evaluate the worth of place 
and show wagers because of their complexity—for example, in a ten-horse race there are 
72 possible show finishes, each with a different payoff and chance of occurrence. In 
Hausch et al. (1981) and more fully in Hausch and Ziemba (1985) equations are de- 
veloped that approximate the expected return and optimal Kelly wagers based on minimal 
amounts of data to make the edge operational in the limited time available at the track. 

Ziemba and Hausch (1987) implement and discuss these ideas and explore various 
applications, extensions, simulations and results. A laymen’s article discussing this appears 
in the May 1989 issue of OMNI. A survey of the academic literature on racetrack and 
other forms of sports betting is in Hausch and Ziemba (1992). 


4.8. Lotto Games: (n, B) 


Lotteries have been played since before the birth of Christ. Organizations and govern- 
ments have long realized the enormous profits that can be made, based on the greed and 
hopes of the players. Lotteries tend to go through various periods of government control 
and when excessive abuses occur, they have often been shut down or outlawed, only to 
resurface later. Since 1964, there has been an unprecedented growth in lottery games in 
the United States and Canada. Current yearly sales are over thirty billion dollars, while 
net profits to various governmental bodies are well over ten billion dollars. Since the 
prizes are very large, the public cares little that the expected return per dollar wagered is 
only 40-50 cents. Typically, half the money goes to prizes, a sixth to expenses, and a 
third to profits. With such a low payback, it is exceedingly difficult to win at these games 
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TABLE 5 
Lotto Games 


Case A Case B 
Contribution Contribution 
Probability to Expected to Expected 
Prizes of Winning Prize Value Prize Value 
Jackpot 1/13,983,816 $6 M 42.9 $10M 71.5 
Bonus, 5/6* 1/2,330,636 $0.8 34.3 $1.2 M 51.5 
5/6 1/55,492 M 9.0 $10,000 18.0 
4/6 1/1,032 $5,000 14.5 $250 24.2 
3/6 1/57 $150 .17.6 $10 17.5 
$10 
118.1 182.7 
Edge 18.195 82.795 
Optimal Kelly Bet 0.000,000,11 0.000,000,65 
Optimal Number of Tickets 
Purchased per Draw with 
$10 M Bankroll 11 65 


and the chances of winning any prize at all, let alone one of the big prizes, is very small. 
Is it possible to beat such a game with a scientific system? The only hope of winning is 
to wager on unpopular numbers in pari-mutuel lotto games. A lotto game is based on a 
small selection of numbers chosen by the players. Typically, games have forty to forty- 
nine numbers and you must choose five or six numbers. Management then draws the 
winning numbers and prizes typically accrue to those with three, four or more correctly 
matching numbers. By pari-mutuel, it is meant that the net pools for the various prizes 
are shared by those with winning tickets. Hence, if a small number of people, with so- 
called unpopular numbers, win a prize, it will be larger than if more people with popular 
numbers are sharing. Ziemba et al. (1986) investigated the Canadian 6/49, which played 
across the country, and other regional games. They found that numbers ending in nines 
and zeros and high numbers tended to be unpopular. Collections of unpopular numbers 
have a slight edge, a dollar wagered was worth more than a dollar on average, and this 
edge became fairly substantial when there was a large carryover. With a large carryover, 
that is, a jackpot pool that 1s steadily growing because the jackpot has not been won 
lately, the edge can be as high as 100% or more. Despite this promise, investors may still 
lose because of the very reasons that Chernoff found, essentially reverberation to the 
mean and gamblers ruin. Still, with such substantial edges, it is interesting to see how an 
investor might do playing such numbers over a long period of time. 

Two realistic cases are developed in Table 5. Case A corresponds to the situation when 
the investor wagers only when there is a medium-sized carryover and the numbers that 
are drawn are quite unpopular. Case B corresponds to the situation where there is a huge 
carryover and the numbers drawn are the most unpopular. About one draw in every five 
to ten is similar to Case A and one draw in every twenty or more corresponds to Case 
B. Also, we have been generous in the suggested prizes. In short, we are giving the un- 
popular number system an at least fair chance to see if it has any hope of being a winning 
system in the Jong run. These cases correspond to the Canadian situation of paying the 
lotto winnings in cash up front tax free. The US situation of paying the lotto winnings 
over twenty years and taxing these winnings yields prizes with expected returns about 
one-third of those in Canada. 

First we observe that the optimal Kelly wagers are miniscule. This is not surprising 
since for Case A, 77.2 cents of the $1.18 expected value and for Case B, $1.23 of the 
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FiGURE 3. Probability of Doubling, Quadrupling and Tenfolding before Halving for Lotto 6/49, Case A. 


$1.83 expected value, respectively, is composed of less than a one in a million chance of 
winning the jackpot or the bonus prize. One needs a bankroll of $1 million to justify 
even one $1 ticket for Case A and over $150,000 for Case B. If one had a bankroll of 
$10 million, the optimal Kelly wagers are just 11 and 65 $1 tickets respectively. Figures 
3 and 4 show the chance that the investor will double, quadruple, or tenfold his fortune 
before it is halved, using Kelly and fractional Kelly strategies for Cases A and B respectively. 
For the Kelly strategies, these chances are 0.4 to 0.6 for Case A and 0.55 to 0.80 for Case 
B. With fractional Kelly strategies in the range of 0.00000004 and 0.00000025 or less, 
the chance of tenfolding one's initial fortune before halving it is 95% or more with Cases 
A and B respectively. This is encouraging, but it takes an average 294 and 55 billion 


FIGURE 4. 


Probability 


1.0 


0.8 


0.6 


0.4 


0.2 


0.0 


Growth rate 


Quadruple 
Tenfold 


0.0E+00 2.0E-07  4.0E-07 $.0E-07  8.0E-07  1.0E-06 1.2E-06 


Optimal Kelly Fraction of Wealth Wagered 
Wager 


Probability of Doubling, Quadrupling and Tenfolding before Halving for Lotto 6/49, Case B. 
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FIGURE 5. Probability of Reaching the Goal of $10 Million before Falling to $4 Million with Various Initial 
Wealth Levels for Kelly, } Kelly and 1 Kelly Wagering Strategies for Case A. 


years, respectively, to achieve this goal. These calculations assume that there are 100 
draws per year. 

Figure 5 shows the probability of reaching $10 million before falling to $4 million for 
various initial wealth levels in the range $4-$10 million for Cases A and B with full 
Kelly, half Kelly and quarter Kelly wagering strategies. The results are encouraging to 
the millionaire lotto player especially with the smaller wagers. For example, starting with 
$1 million there is over a 9596 chance of achieving this goal with the quarter Kelly strategy 
for Cases A and B. Again, however, the time needed to do this is very long being 914 
million years for Case A and 482 million years for Case B. More generally we have that 
Case A with full Kelly will take 22 million years, half Kelly 384 million years and quarter 
Kelly 915 million years. Case B with full Kelly will take 2.5 million years, half Kelly 
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FIGURE 6. Probability of Reaching $10 million before Falling to $25,000 with Various Initial Wealth Levels 
for Kelly and } Kelly Wagering Strategies for Case B. 
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19.3 million years and quarter Kelly 482 million years. It will take a lot less time to 
merely double one's fortune rather than tenfolding it, but it is still millions of years. For 
Case A it will take 4.6, 2.6 and 82.3 million years for full, half, and quarter Kelly, 
respectively. For Case B it will take 0.792, 2.6 and 12.7 million years for full, half and 
quarter Kelly. 

Finally, we investigate the situation for the nonmillionaire wishing to become one. 
First, our aspiring gambler must pool his funds with some colleagues to get enough 
bankroll to proceed since at least $150,000 is needed for Case B and $1 million for Case 
A. Such a tactic is legal in Canada and in fact highly encouraged by the lottery corporations 
who supply legal forms for such an arrangement. For Case A, our player needs a pool 
of $1 million even if the group wagers only $1 per draw. Hence the situation is well 
modeled by Figure 5. Our aspiring millionaire puts up" $100,000 along with nine other 
friends for the $1 million bankroll and when they reach $10 million each share is worth 
$1 million. The pool must play full Kelly and has a chance of success of nearly 50% 
before disbanding if they lose half their stake. Each participant does not need to put up 
the whole $100,000 at the start. Indeed, the cash outflow is easy to bankroll, namely 10 
cents per week per participant. However, to have a 5096 chance of reaching the $1 million 
goal each participant (and his heirs) must have $50,000 at risk. On average it will take 
22 million years to achieve the goal. 

The situation is improved for Case B players. First, the bankroll needed is about 
$154,000 since 65 tickets are purchased per draw for a $10 million wealth level. Suppose 
our aspiring nouveau riche is satisfied with $500,000 and is willing to put all but $25,000/ 
2 or $12,500 of the $154,000 at risk. With one partner he can play half Kelly strategy 
and buy one ticket per Case B type draw. Figure 6 indicates that the probability of success 
is about 0.95. On average with initial wealth of $308,000 and full Kelly it will take 3 
million years to achieve this goal. With half Kelly it will take 2.7 million years and with 
quarter Kelly it will take 300 million years. 

The conclusion then seems to be: millionaires who play lotto games can enhance their 
dynasties’ long-run wealth provided their wagers are sufficiently small and made only 
when carryovers are reasonably large; and it is not possible for non-already-rich people, 
except in pooled syndicates, to use the unpopular numbers in a scientific way to beat the 
lotto and have confidence of becoming rich; moreover these aspiring millionaires are 
most likely going to be residing in a cemetery when their distant heir finally reaches 
the goal. 


4.4. Playing the Turn of the Year Effect with Index Futures: (¢, 8) 


Ibbotson Associates (1986) have considered the actual returns received from invest- 
ments in US assets with different levels of risk during the period 1926-1985. While small 
stocks outperformed common stocks by more than 4 to 1, in terms of the cumulative 
wealth levels, their advantage is totally in the last two decades and most of the gains are 
in the 1974+ bull market. These returns are before taxes so that the net return after taxes 
adjusted for inflation for the "riskless" investments and bonds may well be negative for 
many investors. 

One way to invest in this anomaly in light of the Roll (1983) and Ritter (1988) results 
is to hold long positions in a small stock index and short positions in large stock indices, 
because the transaction costs are less than a tenth of that of trading the corresponding 
basket of securities. During the time of this study the March Value Line index was a 
geometric average of the prices of about 1,700 securities and emphasizes the small stocks 
while the S&P 500 is a value weighted index of 500 large stocks. Hence the VL/S&P 
special makes are long in small stocks and short in big stocks at the end of the year. Each 
point change in the index spread is worth $500. By January 15, the biggest gains are over 
and the risks increase. On average, the spread drops 0.92 points in this period with a 
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FiGURE7. Turn of Year Effect: Relative Growth Rate and the Probability of Doubling, Tripling and Tenfolding 
before Halving for Various Fractional Kelly Strategies. 


high variance. The projected gain from a successful trade is 0-5 points and averages 2.85 
points or $1,342.50 per spread, assuming a commission of 1.5 X $55. On average, the 
December 15 to (—1) day gain on the spread, is 0.57 points. However, it was 1.05 in 
1985 and 3.15 in 1986 which may reflect the fact that with the thin trading in the VL 
index, the market can be moved with a reasonably small number of players, who are 
learning about the success of this trade, i.e. the basis was bid up anticipating the January 
move. The average standard deviation of the VL/S&P spread was about 3.0. With a 
mean of 2.85 the following is an approximate return distribution for the trade: 


Gain 7 6 5 4 3 2 l 0 -1 
Probability 0.007 0.024 0.070 0.146 0.217 0.229 0.171 0.091 0.045 


The optimal Kelly investment based on the return distribution is a shocking 74% of 
one’s fortune! Such high wagers are typical for profitable situations with a small probability 
of loss. Given the uncertainty of the estimates involved and the volatility and margin 
requirements of the exchanges a much smaller wager is suggested. 

Figure 7 displays the probability of doubling, tripling and tenfolding one’s fortune 
before losing half of it, as well as the growth rates, for various fractional Kelly strategies. 
At fractional strategies of 25% or less the probability of tenfolding one’s fortune before 
halving it exceeds 90% with a growth rate in excess of 50% of the maximal growth rate. 
Figure 8 gives the probability of reaching the distant goal of $10 million before ruining 
for Kelly, half Kelly and quarter Kelly strategies with wealth levels in the range of $0- 
$10 million. The results indicate that the quarter Kelly strategy seems very safe with a 
99+% chance of achieving this goal. 

These concepts were used in a $100,000 speculative account for a client of CARI Ltd., 
a Canadian investment management company. Five VL/S&P spreads were purchased 
to approximate a slightly less than 25% fractional Kelly strategy. Watching the market 
carefully, these were bought on December 17, 1986 at a spread of —22.18 which was 
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very close to the minimum that the spread traded at around December 15. The spread 
continued to gain and we cashed out at —16.47 on the 14th for a gain of 5.55 points per 
contract or $14,278.50 after transactions costs. 

A more detailed discussion of many of the issues in this section appears in Clark and 
Ziemba (1987), which was updated by Van der Cruyssen and Ziemba (1992a, b).! 
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Abstract 


This paper discusses the allocation of capita! over time with several risky assets. The capital 
growth log utility approach is used with conditions requiring that specific goals are achieved with 
high probability. The stochastic optimization model uses a disjunctive form for the probabilistic 
constraints, which identifies an outer problem of choosing an optimal set of scenarios, and an 
inner (conditional) problem of finding the optimal investment decisions for a givcn scenarios 
set. The multiperiod inner problem is composed of a sequence of conditional one period prob- 
lems. The theory is illustrated for the dynamic allocation of wealth in stocks, bonds and cash 
equivalents. 
© 2003 Elsevier B.V. All rights reserved. 
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1. Introduction 


The problem of capital accumulation under uncertainty has occupied an important 
place in the theory of financial economics. Given a set of risky investment opportuni- 
ties, a decision maker must choose how much of available capital to invest in each assct 
at each point in time. When the criterion for selecting an investment policy 1s maximiz- 
ing the expected value of the logarithm of accumulated capital, the resulting policy, 
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known as the Kelly or capital growth criterion, has many desirable properties. The 
maximum expected logarithm strategy asymptotically maximizes the log run expected 
growth rate of capital (Kelly, 1956; Breiman, 1961). Moreover, the optimal policy is 
myopic and period-by-period optimization can be used to compute the optimal deci- 
sions (Hakansson, 1972). Breiman (1961) has shown that the expected time to reach 
asymptotically large wealth levels is minimized by this strategy. A theoretical exposi- 
tion of the properties of the capital growth strategy in the intertemporally independent 
and weakly dependent cases appears in Algoet and Cover (1988). Rotando and Thorp 
(1992) apply the Kelly strategy to long-term investment in the U.S. stock market and 
demonstrate some of the benefits and liabilities of that strategy. Thorp (1998) up- 
dates the 1992 paper with additional discussions on blackjack and sports betting. Addi- 
tional discussions appear in Aucamp (1993), Ethier and Tavaré (1983), Finkelstein and 
Whitley (1981), Markowitz (1976), and Rubinstein (1977, 1991). MacLean et al. 
(1992) discuss a theory of growth versus security using fractional Kelly strategies which 
are convex combinations of cash and the Kelly fraction and apply this to several spec- 
ulative investment applications; see also MacLean and Ziemba (1991, 1999). MacLean 
et al. (2000) show that the fractional Kelly strategies lie on a growth-security efficient 
frontier if the assets are lognormally distributed. Without lognormality, the tradeoff of 
growth versus security is monotone but not necessarily efficient. Hakansson and Ziemba 
(1995) review the capital growth literature and various applications. 

The optimality properties of the Kelly strategy are related to expected values, 
either of log wealth or first passage times. But the fraction of wealth invested may 
be unacceptably large because the Arrow—Pratt risk aversion index is the reciprocal of 
wealth and is essentially zero for reasonable wealth levels. Furthermore, if uncertainty 
in the return on investment is considerable then the probability of wealth becoming 
negligible at some point is high with the capital growth strategy.' This downside risk 
of investment strategies has led to a growing interest in risk control. 

The traditional approach to risk control has been to include the variance of wealth 
in the decision problem, and solve for mean-variance efficient strategies (Markowitz, 
1959). More recently Value-at-Risk-based risk management has emerged as the standard 
(Jorion, 1997). The VaR is the floor below which wealth can fall in a specified time 
interval, with a prespecified small probability. 

Investment models can be formulated in discrete or continuous time. In continuous 
time the dynamics of asset prices are usually defined by geometric Brownian motion, 
with the asset prices having log-normal distributions (Merton, 1969, 1992). A con- 
tinuous time model with a VaR constraint is studied by Basak and Shaprio (2001), 
where the VaR risk managers' strategies are contrasted with portfolio insurance. The 
discrete time analogue to the Brownian motion model is a random walk. Computational 
approaches to obtaining optimal growth strategies in discrete time without assumptions 


! For example, Ziemba and Hausch (1986) ran a simulation where an investor has initial wealth S1000 
and makes 700 independent wagers, all of which have an expected value of $1.14 per $1 wagered and all 
of which have not small probabilitics of winning. In 166 of 1000 replications the Kelly bettor had a final 
fortune of $1,000,000 or more. However, the minimum final wealth was only $18. 
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on asset price distributions are presented by Cover (1991); see also Helmbold et al. 
(1996). 

In this paper the computation of strategies for investment in discrete time which 
achieve maximal capital growth subject to a VaR constraint is considered. The emphasis 
is on defining a multistage stochastic programming problem where the constraints are 
identified by a selection of scenarios sampled from the space of potential financial 
market outcomes. Using the VaR constraint, the outcomes are classified as critical and 
noncritical, with feasible investment strategies maintaining the critical outcomes at a 
small percentage. An algorithm for classifying scenarios is developed and illustrated 
with the optimal trade-off of cash, bonds and stocks over time. 


2. Capital accumulation model 


Consider an investor with initial wealth Wo and the opportunity to invest in m risky 
securities. The following assumptions are made about capital markets: no transactions 
costs; no taxes; infinite divisibility of assets; assets have limited liability; borrowing and 
lending are allowed at the same rate; and short selling is permitted. These conditions 
will be referred to as the market assumptions. 

The trading price of security i at time ¢ is Pj(£), i = l,...,m. In discrete time the 
rate of return on a unit of capital invested in security i at time £ is 


Pt+ 1) . 
— ——- — Ri(t), = 1,...,m. l 
py TRO Ftv (1) 
It is assumed that the returns follow a log-normal distribution, so consider ¥;(¢) = 
In P(t), i= l,...,m. Whereas P,(t) is a geometric process, Y;(/) is an arithmetic 
random walk with increments Z;(1) 2 In Rj(£), i — 1,...,m, with Z;(£) having a normal 


distribution. Therefore, the increments can be represented as a random effects linear 
model 


Zi(t) = ni + ójej(t), i= l,...,, (2) 
where e(f)' =(4,(t),...,&m(0)) is N(0,7) and x? =(mj,...,73) is N(u T), with p" = 
(40, ..., Hm) and P = (yj). So, yy is the covariance between m; and z;, the random 


expected rates of return of securities i and j, respectively. 

From the arithmetic random walk with normal increments the conditional distribution 
of log-prices at time /, given x and 4 = diag(0j,...,67,) is (Y(tr)|r, A) ~ N(att4). 
The marginal distribution of log-prices is Y(t) ~ N(ut, Z(1)), where X(t) = (A + er. 

Although this log-normal model for securities prices is specialized, the random rates 
of return provide the flexibility required to match the theoretical prices to observations. 
This discrete time model is the analogue to the geometric Brownian models in continu- 
ous time. The dynamics of price movements are clear from the distribution parameters. 
The underlying parameters (4#, P, 4) generate the securities prices. If these parame- 
ters are known or estimated then the price distributions can be specified. Parameter 
estimation is discussed in Section 6. 

From the initial values the forward price process evolves as a random walk with 
intertemporally independent increments defined by (2). Consider the rate of returns 
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process R,(t)' 9 (Ri(t),..., Ry(t)). t=1,...,7, and denote the multivariate rate of re- 
turns distribution at time ¢ by F;. For the stochastic process R,(t)' =(Ri(t),...,Rm(t)), 
a trajectory or realization of the data proccss is associated with an outcome o in thc 
sample space 2 of all outcomes (trajectories). The distributions Fi,...,F7 generate a 
probability measure P on €2 and the associated probability space (Q, B, P). The sample 
space can be represented as Q — Qi x--- x Qr, with œw, € Q,, the data for time t. The 
information available to the investor at time ¢ is the data on the past, and is represented 
by the filtration By :— (0,0) C B, C- C B, := B, where B, := o(w’) is the o-field 
generated by the history « of the data process œ to time f. So the stochastic process 
is adapted to (5, t = 1,..., T), the augmented filtration generated by c. 

In addition to the risky securities defined on (Q, B, (B;), P) there is a riskless asset 
with rate of return Ro(£) = 1 + r(t). Let R(t)’ =(Ro(t), R(t)! ). 

An investment decision at time ¢ is the proportion of wealth to allocate to cach asset, 
given by 


X(t)! =(Xo(t),...,Xm(t))- (3) 


It is assumed that X(f) can depend on the data history œ' but not on unknown 
future retums, so it is B, predictable. The budget constraint at each time requires 
Xo X(t) = |. The proportions of wealth invested in risky assets are unconstrained, 
since the proportion invested in the risk-free asset can always be chosen (with borrow- 
ing or lending) to satisfy the budget constraint. 

An investment strategy is an m+ 1 vector process, X = {X (t), f=1,..., TJ, where T 
is the planning horizon. With initial wealth Wọ, rate of returns process R and investment 
process X, the capital accumulated to time ¢ is 


W(t) = Wo | [R6 X69), t=1,...,7. (4) 
s=] 


The paths of the capital accumulation process are controlled by the investment strat- 
egy, and the investor selects a strategy based on anticipated performance as indicated 
by measures for growth and security (risk). 


3. Downside risk control 


For the geometric growth process of capital accumulation in (4) a natural perfor- 
mance measure is the geometric mean E[W(T ) ^]  E[(Wo []_, R( X (0)"7]. Since 


T 
W(T)/* = Wo (so E Yea) ; 


G(X) = ty In(R(t)'X(t)) is the growth rate of capital. The geometric mean is 
maximized by the optimal expected growth rate strategy from 


l T 
max E[G(X)] = 7 ZERO XII (5) 
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The growth optimal strategy .X* from (5) is called the Kelly (1956) strategy. This 
strategy has been studied extensively. For advanced proofs of optimality properties with 
minimal assumptions see Algoet and Cover (1988). 

The Kelly strategy is very risky. Although it provides optimum growth in the long 
run it is possible to experience negative growth in the medium term and in any period 
experience a substantial loss of capital (drawdown). Discussion of these properties 
appears in Table 1 of MacLean and Ziemba (1999). 

Because of the volatility of financial markets it is prudent (and frequently it is a legal 
requirement) to include downside risk control in the decisions on investment strategy. 
To put risk measurement in context, consider the following definition, an adaptation of 
one provided in Breitmeyer et al. (1999). 


Definition 1 (Downside risk measure). Consider a wealth process (W(t), t — 1,..., T) 
with corresponding distributions (H,, t = 1,..., T). Let q € 9 be an arbitrary num- 
ber which partitions wealth trajectories into acceptable and unacceptable sets, denoted 
by C and C, respectively. If V is the set of probability distributions for the wealth 
process, a downside risk measure $ is a function ¢:V x R — NK satisfying the 
axioms 


1. (non-negativity): $(H,q) = 0, 

2. (normalization): If H(@) — 0 for all o € C, then $(H,q) — 0, 

3. (downside focus): If H and A are distributions over wealth trajectories such that 
H(w)=A(w), for o € C, then $(H, q) = HË, q). 


There are additional axioms which consider properties such as consistency, continuity 
and invariance (Breitmeyer et al., 1999; Artzner et al., 1999), but the properties in our 
definition are basic. 

The standard measure for downside risk in the financial industry is VaR (Jorion, 
1997). It is defined as the loss, which is exceeded with some given probability «, over 
a given time horizon. The intention is to control VaR, so a prespecified minimum value 
w is given and the value at risk with probability « must exceed w. Equivalently, the 
probability that wealth exceeds w at the VaR horizon is at least 1 — a. 

Although VaR is widely used, it has some undesirable properties (see Artzner et al., 
1999; Basak and Shaprio, 2001). Other proposed measures are the period-by-period 
drawdown (Grossman and Zhou, 1993) and the incomplete mean (Basak and Shaprio, 
2001). The drawdown used in this paper considers the potential fraction of wealth 
lost at each period. The incomplete mean is the partial expected value in the lower « 
percentile of the wealth distribution. 


Definition 2 (Risk measures). Consider a wealth process (W(t) t = 1,...,T} with 
corresponding distributions (H,, t= 1,..., T). 


1. The value at risk measure for horizon T and wealth value w is 


di(H,w) = Pr[W(T) 2 w] = 1 — Hr(w). 
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2. The drawdown measure for decay fraction b c [0,1] is 
$3(H, b) = Pr[W (t +1) > bW(t),t=1,..., T]. 


3. The incomplete mean measure for horizon T and specified percentile « is 
Wx 
$3(H, «) = / wdHr(w), 
Jo 


where Pr[W(T) < w,] = a. 


Each of these measures satisfies the basic axioms for downside risk. $1(H,w,) and 
Q3(H,«) have the same unacceptable sets, and $3 is a measure of the expected loss 
on the unacceptable set. With respect to d» it is possible to link 5 to the value w 
in $i. 

Let b(w) = (w/Wo)""", and see that the unacceptable set for œ, is contained in the 
unacceptable set for 2. Therefore $2(H,b(w)) < $i(H,w). So do and $3 are more 
stringent risk measures than $;. 

The purpose in defining a risk measure is to develop an investment strategy which 
achieves capital growth while controlling for risk. Subsequent discussion of risk will 
concentrate on the VaR measure $,. The approach followed is easily adapted to other 
risk measures. 

Consider the general growth with security problem 


sup ELGQO]I EO. v) 21-a)j, (6) 


where H(X) —(Hi(X),..., Hr(X)) is the distribution over wealth generated by invest- 
ment strategy X = {X(t),f=1,...,7},w is a prespecified wealth floor and | — a is a 
confidence level. It is assumed that the measurability conditions previously discussed 
for the return process and the investment process are imposed. 

Let w* =1/Wo. Problem (6) can be written more explicitly in terms of rate of returns 
as (CCP): 


T 
Pr | So nR) X(t) > zd 21l- a}. (7) 


t=l 


T 
sup PILLE 
X 


t—l 


The rate of return process R = (R(t),t = L,.... T} and the investment process X = 
(X(0),t = 1,.... T) are defined on (0, B, (B,), P). For a given trajectory o € Q let 
the associated return path be R(w) = (R(o,t), t= 1,..., T} and the investment path 
be X(w) = {X(a,),f = !,..., T). The risk measure refers to acceptable and unac- 
ceptable sets of wealth trajectories. Consider sets of measure | — « in the probability 
space, given by B, = {A|P(A) > 1 — x). There are associated sets of wealth trajectories 
for A and its complement 4. An equivalent formulation to (7) based on trajectories 
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is (DP) 


T 
E in(R(t)' X(t 
sup E n(RG) X(t) 


tx 


T 
So In(R(@, 0) X(0, 1) zino',ocA4 . (8) 


(zl 


The disjunctive formulation in (8) defines a sequence of stochastic convex dynamic 
programming problems, Rockafeller and Wets (1978) referred to as inner problems, 
with a constraint for each o € A. 

Another reformulation of (7), found by introducing a weighing variable 0(c), c € Q, 
with 0 < 6(@) < 1, is (NCP): 


T 
sup Ix E in(R(t) X (0)) 


(,8) t=1 


T 
(c) |: In(R(t) X(t) 2 In o 


f=1 
20, EKo] 21— a) . (9) 


At the optimal solution to (9) the weighing variable is an indicator function. That is, 
if (X*,0*) is optimal then there is a set A such that 0 *(c) —1 for WEA, O*(w) =0 
for w € A. Hence, for the optimal investment strategy: (CCP) + (DP) © (NCP). 

The growth with security problems (7)-(9) provide a framework for individual port- 
folio choice, where risk is controlled at a specific level. In multidimensional financial 
markets it is common to identify the underlying sources of systematic risk, and to 
define a small set of generating portfolios or mutual funds. These funds serve an inter- 
mediary role for fund managers to create products which satisfy investor preferences. 
It is important that the growth with security problem works within this intermediation 


theory. 
To understand the generating portfolios (mutual funds) recall for the price process 
it is assumed there exist p « m independent latent factors UT —(U,..., Up), Ui — 


N(0,1), i=1,..., p, so that the log-prices are represented as 
Y(t) 2 pt - tAU + Vt£, (10) 


where ut = (Ht... À = (2y) for loadings Aj, i= 1,....m and j= l,..., p, 
and the covariance Cové) = diag(1, ..., 9). Assume that E[£] = 0, and U and č arc 
independent. Y(t) has the distribution N(ut, Z(t)), where X(1)-14-- T and r—AA'. 
The factor model (10) and the associated conditions are referred to as the structural 
model assumptions. 

In this structure imposed on the price process, the factors U Tz(U,...,U p) are the 
standardized log-prices for the mutual funds. 


Theorem 1 (Financial intermediation). Suppose there exist m risky securities and a 
risk-free asset satisfying the market and structural model assumptions. Then there 
exists a risk-free and p « m risky mutual funds, so that growth with security investors 
as defined by (7) are indifferent between choosing portfolios from among the original 
securities and choosing portfolios from the mutual funds. 
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Proof. Consider the return in period £, R(1) X(1) = $7.9 R/()X((0). From the model 
for securities prices R;(f) — exp(ui + Aj U --ó;5;), where A; is the ith row of the m x p 
loading matrix A in (10). Let 4o = (ai), where aj; = A2/(0? — 67), and 


1 0 
A= 
0 Ag 
Define g(t) =A: X(t). From (10), 577, 22, =o? — 6? and 


p pom 
Yreo-x0 3 (cado) 
j-0 


j=i isl 


=Xo(1) + M Yas gy 0 


i=l j=l ô; 
- 0-1 
i=0 


With A`, the generalized inverse of A, let X(t)=A7 q(t). Hence, the return in period 
t is R(t)" X(t) =(R(t)" A- )g(t)= M(t)" qu) = Eio MDa), with M(t) the return 
on risky mutual fund j, j =1,..., p and M(t) = I -+ r(£). Since the returns in each 
period are the same for the mutual funds and the original securities, the statement in 
. the theorem holds. 


The investment process X — [X (1), t—1,..., T) is defined on the space of trajectories 
(Q, B, (B,], P), where X(t) is B, predictable. An important property of the investment 
process is path independence (Cox and Leland, 2000), so that it depends only on the 
reinvested return on the securities, not on the whole price history. This property is 
satisfied by the growth with security investor. 


Theorem 2 (Path independence). The optimal growth with security investment strat- 
egy X* is path independent, i.e. X(t)* depends on the level of wealth at time t, w, 
but not on the particular path to achieve that wealth. 

Proof. Consider the growth with security problem in the disjunctive form 


SO In(R(e, 0) X(,1)) 2 Inw*,o € 4 \ 


f=1 


DP: sup bee [remotam S 


ACB, 


If 7, is the indicator function for the set 4 and A(c) 20 is a multiplier so that 
A€L(Q,B,P), the space of Lebesgue integrable functions on Q, then a Lagrangian 
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for (DP) is 


7 
Y(X,A,A)=E x In(R(o), t) ! X(0,1)) 


f=] 


; 
+A) (> In(R(w, t)! X(w,t)) — In | . (11) 


mm 
If a solution exists for (DP) then there exist elements (4*, 2*) so that the solution 
to (DP) is given by supy (X, 4*,4*). The Lagrangian is 


| 
Z(X,A*,4*) = S FIC + ly ARO | X(0))]. 


i=l 


Consider gg = E[1 + /-A*] and yf = Er (1 +14e4*), where Ep_, is the expectation 
over the data process from time £+ 1 to time 7. Then, 47 = n; ,- BY, where fi; 
depends only on the additional data in period £. Since In n(w!) = p» m,(w,), where 


mos) = Esm) — E, 1 m(o! ), then 


T T 
sup $ ^ ELC + Lj) In(RG) X] = sup XO Elnia : BF RO X (0))] 
X mel X 


m 
T 
= sup S > E(B?» E [9]. REOTA], 
x it 
where £,_; is the expectation with respect to data for periods 1 to ¢—1. With 7", = 
Eriti 


* 
LENT 


E, [nz ROT XED] = f Ei E inCRGOY o) 


)|—1 


S ma dn (e LN = x) 


t—l 


from Jensen’s inequality and B,.., measurability of y7_, and X(¢). But R,(¢)=exp(Z,(¢)), 
where the increments Z;(f) are intertemporally independent. With X(t) = E, [G7 4 / 
5; ()X(t)], the problem becomes 


T 
sup » ELBE nao X0). 


t— l 


where Y(t) does not depend on the path «X^!, f£ — 1,..., T. 


The path independence property is within the context of planning for T periods with 
a given distribution over returns. As described in Section 6, at the start of the next 
T period plan the history of prices is used to update the distribution over returns. 
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So decisions depend on trajectories through the estimation (filtration) process, which 
is separated from the optimization. 


4. Scenario selection 


Thc growth with security problem in its disjunctive form (8) or equivalent La- 
grangian form (11) involves complicated multivariate integration. Some form of dis- 
crete approximation to distributions for securities prices is required to proceed with 
computation. 

Consider the distributions F, on the returns R,(t), £ = 1,...,T. Assume that in 
R” a grid is constructured with rectangles so that the probability mass from F, in 
each rectangle is equal. Sample a point from each rectangle, generating the empirical 
distribution 77. Let the space of trajectories corresponding to the empirical distribution 
F”, t — V,..., T, be QU), and the corresponding probability space be (QU), BV), p), 
So QU = (o,,...,0;) and P(o;) ^ VI. 

With this lattice structure, a discrete approximation to the growth with security 
problem is (DP,): 


T 
sup [on {ed neo ro 
tz] 


AU epo) | xX” 


T 
S nR" (o, 0)! X" (o1) 


t=1 


Zu (12) 


The inner problem and the set selection problem are now discrete. For a given A and 
the corresponding A‘) the convergence of the solution of the discrete inner problem to 
the solution of the continuous inner problem has been established and bounds on the 
error for given n developed (Pflug, 1999). The identification of the optimal set AM, 
where P? (40) > 1—«, is required. Assume that the specified target w* at the horizon 
T is such that problem (12) has a solution for A™ = Q°), The process for identifying 
the optimal set of scenarios is backward elimination (Culioli, 1996). If k is the largest 
integer such that k/J <a, then up to k constraints corresponding to scenarios in Qo) 
get eliminated and the remaining constraints yield the optimal AC). The elimination is 
one constraint at a time starting from QU? and working backward. In describing the 
elimination procedure the notation AMG Ls) refers to a set of scenarios in the 
family of sets AM Qj, ...,l) defined by the j — 1 elimination steps, where scenarios 
Oi; + Wij, have been eliminated sequentially. The basis for the algorithm is the inner 
problem 


T T 
P(A): sup P» In(R"(t)  X^(0) |S In(R" (0,0) X" (0,1) 2 In w*, o € AO | 
X^ 


t=1 tz] 


The algorithm proceeds as follows: 


[1] Set / 2 0 and AP? = (QU). 
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[2] (Solution step) For each A”(i;|-) EAP), solve the problem P(A (i,|-)), denot- 
ing the optimal (strategy, value) by (X(A(ij]-)), G*(4*)(ij|-))), and the set of 
scenarios corresponding to the active constraints by C;,;(AU(ij]-)). If j = K or 
Cj, 1 ( AU(I|-)) = 0, then designate the problem P(A (,|-)) as k-reduced and go 
to step [4]. 

[3] (Reduction step) For each AMG | EAPO, generate a family of reduced sets, 
indexed by ist, 


AT) js... i) = (Apa AM 5] ) = AMG (0), o € Ca (I|) 


Retum to step [2] with j updated to j+ 1. 

[4] From the set of all k-reduced problems select the set 4“) with the maximum 
optimal value G*(4"?). The optimal strategy for the growth with security problem 
(DP,) is X*(A)) 


The reduction step in the elimination algorithm is founded on the following simple 
result which establishes the existence of candidate scenarios for elimination. 


Theorem 3 (Scenario elimination). Assume there exists an "optimal" set A™ for the 
problem (DP,) and consider the j step reduced problem P(A™(i;|-)) where j active 
constraints have been sequentially eliminated. That is, the corresponding scenarios 
are in QU 400, If CHACHE) # Ü are the scenarios corresponding to active 
constraints in the solution of the reduced problem, then 


Cj (A (i4 )) n [Q0 49] FO. 


Proof. Jt suffices to consider the first elimination. In the solution to the problem with 
all constraints, P(Q), there are sets of scenarios C, (QU?) and QU? /Ci(QU?) core- 
sponding to active and inactive constraints, respectively, where Ci(QU?) Æ (). Since 
the inactive constraints exceed the goal In w* 


Ci(QU) CAM and C(Q?) n [QU /A?] F O. 


The procedure for identifying 4"? has a tree structure for reduced problems. If, for 
example, the number of active constraints in each reduced problem is r > 2, then the k 
eliminations require solving » r problems. This could be unmanageable. A simple 
heuristic is to solve one reduced problem at each stage corresponding to eliminating 
the active constraint in the previous stage with the best result (highest expected log 
wealth). With this heuristic method at most rk problems are solved. The heuristic is 
equivalent to the exact method if there is at most one active constraint at each stage. 


5. Application to the fundamental problem of asset allocation over time 


The computation of growth with security strategies is now illustrated with the de- 
termination of optimal fractions over time in cash, bonds and stocks. Table 1 has 
information on the yearly asset returns of the S&P500, the Salomon Brothers Bond 
index and U.S. T-bills for 1980—1990 with data from Data Resources, Inc. Without 
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Table | 
Yearly rate of return on assets relative to cash (96) 


Parameter Stocks Bonds Cash 
Mean: je 108.75 103.75 100 

Standard deviation: o 12.36 5.97 0 

Correlation: p 0.32 

Table 2 

Rates of retum scenarios 

Scenarios Stocks Bonds Cash Probability 
l 95.00 101.50 100 0.25 

2 106.50 110.00 100 0.25 

3 108.50 96.50 100 0.25 

4 125.00 107.00 100 0.25 


loss of generality cash returns are set to one in each period and the mean returns for 
other assets are adjusted for this shift. The standard deviation for cash is small and is 
set to 0 for convenience. 

A simple grid was constructed from the assumed lognormal distribution for stocks 
and bonds by partitioning 93? at the centroid along the principal axes. A sample point 
was selected from each quadrant with the goal of approximating the parameter values. 
The sample points are shown in Table 2. 

The planning horizon is T = 3, so that there are 64 scenarios each with probability 
1/64. Problems are solved with the VaR constraint and then for comparison, with the 
stronger drawdown constraint. 


(1) VaR control with w* — a. 
Consider the problem 


3 3 
max EN In (RG) X(Q))| Pr [V In(RG) X(0)) 23lna| 21— a} 


tx] t=] 


With initial wealth J/(0) = 1, the value at risk is a>. 

Table 3 presents the optimal investment decisions and optimal growth rate for several 
values of a, the secured average annual growth rate and 1 — a, the security level. 

The heuristic was used to determine A, the set of scenarios for the security constraint. 
Since only a single constraint was active at each stage the solution is optimal. The 
mean return structure for stocks is quite favorable in this example, as is typical over 
long horizons (see Keim and Ziemba, 2000), and the Kelly strategy, not surprisingly, is 
to invest all capital in stock most of the time. It is only when security requirements are 
high that some capital is in bonds. As the requirements increase the fraction invested in 
the more secure bonds increases. The three-period investment decisions become more 
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Table 3 

Growth with secured rate 

Secured Secured Period Optimal 

growth level growth 

rate a |1-x rate (96) 
l 2 3 


Stocks Bonds Cash Stocks Bonds Cash Stocks Bonds Cash 


0.95 0 l 0 0 ] 0 0 ] 0 0 23.7 
0.85 I 0 0 ! 0 0 | 0 0 23.7 
0.9 li 0 0 l 0 0 ! 0 0 23.7 
0.95 | 0 0 ] 0 0 ] 0 0 23.7 
0.99 l 0 0 0.492 0.508 0 0.492 0.508 0 19.6 
0.97 0 l 0 0 i 0 0 l 0 0 23.7 
0.85 l 0 0 ! 0 0 l 0 0 23.7 
0.9 l 0 0 l 0 0 ! 0 0 23.7 
0.95 l 0 0 l 0 0 l 0 0 23.7 
0.99 | 0 0 0.333 0.667 0 0.333 0.667 0 18.2 
0.99 0 | 0 0 I 0 0 l 0 0 23.7 
0.85 l 0 0 1 0 0 l 0 0 23.7 
0.9 l 0 0 ] 0 0 I 0 0 23.7 
0.95 i 0 0 0.867 0.133 0 0.867 0.133 0 19.4 
0.99 0.456 0.544 0 0.27 0.73 0 0.27 0.73 0 12.7 
0.995 0 i 0 0 l 0 0 i 0 0 23.7 
0.85 l 0 0 0.996 0.004 0 0.996 0.004 0 23.7 
0.9 ] 0 0 0.996 0.000 0 0.996 0.004 0 23.7 
0.95 I 0 0 0.511. 0.489 0 0.442 0.558 0 19.4 
0.99 0.27 0.73 0 0.219 0.59 0.191 0.218 0.59 0.192 12.7 
0.999 0 I 0 0 l 0 0 ] 0 0 23.7 
0.85 ] 0 0 0.956 0.044 0 0.956 0.044 0 23.4 
0.9 l 0 0 0.956 0.044 0 0.956 0.044 0 23.4 
0.95 l 0 0 0.381 0.619 0 0.51 0.49 0 19.1 
0.99 0.27 0.73 0 0.008 | 0.02 0.972 0.008 0.02 0.972 5.27 


conservative as the horizon approaches. Although this example is simplified the patterns 
observed illustrate the effect of security constraints on decisions and growth. 


(ii) Secured annual drawdown: 5. 

The VaR condition only controls loss at thc horizon. At intermediate timcs the 
investor could experience substantial loss, and in practice be unable to continue. The 
more stringent risk control constraint, referred to as drawdown, considers the loss in 
each period. Consider, then, the problem 

3 
max EN IRD  X(0) Pr[In(R(t) X(0) > Ind, t= 1,2,3) 21- a 


tz] 


368 L. C. MacLean, R. Sanegre, Y. Zhao and W. T. Ziemba 


950 L.C. MacLean et ul. | Journal of Economic Dynamics & Control 28 (2004) 937—954 

Table 4 

Growth with secured maximum drawdown 

Draw- Secured Period Optimal 

down level growth 

b 1—-x rate (96) 
l 2 3 


Stocks Bonds Cash Stocks Bonds Cash Stocks Bonds Cash 


0.96 0 I 0 0 | 0 0 ] 0 0 23.7 
50 l 0 0 l 0 0 0.846 0.154 0 23.1 
75 l 0 0 0.846 0.154 0 0.846 0.154 0 22.5 
100 0.846 0.154 0 0.846 0.154 0 0.846 0.154 0 21.9 
0.97 0 | 0 0 | 0 0 l 0 0 23.7 
50 1 0 0 | 0 0 0.692 0308 0 2215 
75 l 0 0 0.602 0308 0 0.602 0.308 0 213 
100 0.6902 0.308 0 0.602 0.308 0 0.692 0.308 0 20.) 
0.98 0 l 0 0 l 0 0 1 0 0 23.7 
50 l 0 0 ] 0 0 0.538 0.462 0 21.2 
75 l 0 0 0.538 0462 0 0.538 0462 0 18.6 
100 0.538 0.462 0 0.538 0462 0 0.538 0.462 0 16.1 
0.99 0 I 0 0 l 0 0 | 0 0 23.7 
50 I 0 0 | 0 0 0.385 0.615 0 21.2 
75 1 0 0 0.385 0.615 0 0.385 0615 0 18.6 
100 0.385 0.615 0 0.385 0615 0 0.385 0615 0 16.1 
0.999 0 1 0 0 l 0 0 I 0 0 23.7 
50 l 0 0 l 0 0 0.105 0.284 0.611 17.7 
75 | 0 0 0.105 0.284 0.61) 0.105 0.284 Á 0.611 11.8 
100 0.105 0.284 0.611 0.105 0.284 0611 0.105 0.284 0.611 5.84 


The simple form of the constraint follows from the arithmetic random walk In W(r), 
where 


Pr[W(t 4-1) 2 bW(r), (=0,1,2] = Pr(In W(t + 1) - In W(t) 2 Inb, 1—0,1,2] 
= Pr[In (RG)  X(1)) 2 Inb, £= 1,2,3]. 


In Table 4 the optimal investment decisions and growth rate for several values of 
b, the drawdown and 1 — o, the security level are presented. The heuristic is used in 
determining scenarios in the solution. The security levels are different in the table since 
constraints are active at different probability levels in this discretized problem. 

As with the VaR constraint, investment in the more secure bonds and cash increases 
as the drawdown rate and/or the security level increases. Also the strategy is more 
conservative as the horizon approaches. For similar requirements (compare a — 0.97, 
1—a=0.85 and b=0.97, 1 — 4 — 0.75), the drawdown condition is more stringent, with 
the Kelly strategy (all stock) optimal for VaR constraint, but the drawdown constraint 
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requires substantial investment in bonds in the second and third periods. In general, 
consideration of drawdown requires a heavier investment in secure assets and at an 
earlier time point. It is not a feature of this aggregate example, but both the VaR and 
drawdown constraints are insensitive to large losses, which occur with small probabil- 
ity. Control of that effect would require the lower partial mean violations condition or 
a model with a convex risk measure that penalizes more and more as larger constraint 
violations occur, see e.g. Cariño and Ziemba (1998). These results can be compared 
with those of Grauer and Hakansson (1997) who do calculations with the standard 
capital growth-Kelly model and Brennan and Schwartz (1998) who use a Merton, con- 
tinuous time model with the instantaneous mean returns dependent upon fundamental 
factors. These models have hair trigger type behavior that is very sensitive to small 
changes in mean values (see Chopra and Ziemba, 1993). 


6. Portfolio rebalancing 


The approach to investment planning with downside risk control is to develop an 
optimal strategy over a T period planning horizon using projections of the multivariate 
returns distributions on securities. Although securities prices are dynamic, the changes 
are generated from a pricing model with seed parameters (4, I, A). 

It is anticipated that the planning horizon is short, and the values of the seed pa- 
rameters will be reconsidered at the end of the horizon. An important feature of the 
proposed prieing model is the ability to revise the seed parameters using data collected 
during the planning period. 

Consider the observations (Y(1),..., Y(T)). Since (Y(T)|n) ~ N(aT, TA) and zx ~ 
N(u I), from Bayes Theorem 


(n(T)| Y(T)) ~ N(nr, P7), (13) 
where 


ny = wt  —- Arr YYr — n), 


l de 
Fr — gg U - ArZr Mr, 


r= YT), 
Ay — TA 
and 
Sp = T+ TT. (14) 
Furthermore, from the increments Z(t) = Yi(t + 1) — Y(t), ¢=1,...,7 — 1, the 


covariance matrix Sr can be computed. With the number of factors (mutual funds) 
set at p < m, perform a factor analysis on Sy obtaining a loading matrix Lr and the 
specific variance matrix Dr = diag(D?,..., D2). Let St — LrL} + Dr. Then $7 is an 
estimate of Zr, and Dy is an estimate of Ay. If there is a common mean, jt! =(f,..., 4) 
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and Yr — 15 7 , YT), then Yr is an estimate of js. All parameters in (13) and (14) 
are now estimated and the rate of return distribution for the next T period planning 
cycle can be specified. 

The revision of the rate of return distributions produces a rebalanced portfolio in the 
next planning cycle in light of new information on securities prices. The formula for 
mr in (14) displays reversion to the grand mean. Considering the impact of errors in 
estimating mean values (Chopra and Ziemba, 1993), the improved estimates lead to 
more reliable investment decisions. 

An alternative to blending the mean estimate with a grand mean would be to 
blend the mean with the prior Bayes estimate. That is, for successive planning pe- 
riods (1,..., T1, Ty -F 1,..., Ti + Tz} the revised estimate for the mean rate of return is 
Xp, =r, + — An Eg XY r, — nr). This approach provides smoothed estimates where 
the full history of prices is considered with the past being weighted in the manner of 
exponential smoothing. 


7. Conclusion 


This paper considers the problem of investment in risky securities with the objective 
of achieving maximal capital growth while controlling for downside risk. Working in 
discrete time, a geometric random walk model for asset prices was developed. The 
model has two important features. The increments in the random walk have a Bayes 
framework, so that the asset prices depend on hyper parameters. In addition the corre- 
lation in the asset price distributions was related to a structural model and thereby the 
hyper parameters were identified from data. 

A variety of risk measures were defined and corresponding capital growth with 
security problems were presented. The emphasis was on the Value at Risk, but control 
of period-by-period drawdown was also considered in the application. 

An algorithm for the computation of growth with security strategies was presented 
for the problems using discrete approximations to the distributions on asset returns. 
The computational procedure is general and applies to any price distributions, although 
it was presented in the context of the geometric random walk and log normal asset 
prices. 

The methods were applied to an example where investment capital is allocated to 
stocks, bonds and cash over time. At low levels of risk control the capital growth or 
Kelly strategy is optimal. As the risk control requirements are tightened the strategy 
becomes more conservative, particularly close to the planning horizon. The solved 
problems are discrete in time and state, and in contrast to the continuous time lognormal 
model (MacLean et al., 2000) a fractional Kelly strategy is generally not optimal. 
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Ac’ portfolio management is concerned with objectives related to the outperformance 
of the return of a target benchmark portfolio. In this paper, we consider a dynamic ac- 
tive portfolio management problem where the objective is related to the tradeoff between the 
achievement of performance goals and the risk of a shortfall. Specifically, we consider an ob- 
jective that relates the probability of achieving a given performance objective to the time it 
takes to achieve the objective. This allows a new direct quantitative analysis of the risk/return 
tradeoff, with risk defined directly in terms of probability of shortfall relative to the bench- 
mark, and return defined in terms of the expected time to reach investment goals relative to 
the benchmark. The resulting optimal policy is a state-dependent policy that provides new 
insights. As a special case, our analysis includes the case where the investor wants to mini- 
mize the expected time until a given performance goal is reached subject to a constraint on the 
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shortfall probability. 


(Portfolio Theory; Benchmarking; Active Portfolio Management; Stochastic Control) 


1. Introduction 

In this paper we analyze an optimal dynamic port- 
folio and asset allocation policy for an investor who 
is concerned about the performance of a portfolio 
relative to the performance of a given benchmark. 
We take as our setting the standard continuous-time 
framework pioneered by Merton (1971) and others. 
Portfolio problems where the objective is to exceed 
the performance of a selected target benchmark is 
sometimes referred to as active portfolio management, 
whereas passive portfolio management just tries to track 
a benchmark, see, for example, Sharpe et al. (1995). 
Many professional and institutional investors in fact 
follow this benchmarking procedure: For example, 
many mutual funds take the Standard and Poors (S&P) 
500 Index as a benchmark; commodity funds seek 
to beat the Goldman Sachs Commodity Index; bond 
funds try to beat the Lehman Brothers Bond Index, 
etc. Moreover, benchmarking is not specific to profes- 
sional investors, as many ordinary investors implicitly 
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follow a benchmarking procedure, for example, by 
trying to beat inflation, exchange rates, or other in- 
dices. In other applications, such as pension funds, 
the benchmark might be a liability. See Litterman and 
Winkelman (1996) for more detail on these and other 
benchmarks. For a treatment of active portfolio man- 
agement in a static setting, see Grinold and Kahn 
(1995). 

This paper extends the earlier analysis in Browne 
(1999a) of active portfolio management problems 
with objectives related to the achievement of relative 
performance goals and shortfalls (see Browne 2000 
for related stochastic differential games). Specifically, 
Browne (1999a) considered a general problem in an 
incomplete market where the benchmark was only 
partially correlated with the active investor's invest- 
ment opportunities and with investment objectives 
related to the achievement of investment goals and 
shortfalls relative to the benchmark. The specific ob- 
jectives that were explicitly solved for there include: 
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maximizing the probability that the investor's wealth 
achieves a certain performance goal relative to the 
benchmark before falling below it to a predetermined 
shortfall; minimizing the expected time to reach the 
performance goal; and maximizing the expected re- 
ward obtained upon reaching the goal, as well as 
minimizing the expected penalty paid upon falling to 
the shortfall level. The corresponding optimal policies 
obtained there are all constant proportion, or constant 
mix, portfolio allocation strategies, whereby the port- 
folio is continuously rebalanced so as to always keep 
a constant proportion of wealth in the various asset 
classes, regardless of the level of wealth. (Observe 
that if the proportion associated with an asset class 
is positive, then this rebalancing requires selling an 
asset when its price rises relative to the other prices, 
and conversely, buying the asset when its price drops 
relative to the others.) It is well-known that such 
policies have a variety of optimality properties asso- 
ciated with them for the ordinary portfolio problem 
(see, e.g., Merton 1990 or Browne 1998 for surveys) 
and are widely used in asset allocation practice (see 
Perold and Sharpe 1988 and Black and Perold 1992). 
Nevertheless, some investors object to using constant 
proportion strategies in that they dictate the same 
strategy for every wealth level, while their individual 
intuition would suggest otherwise. In this paper we 
address some of these issues for the complete market 
case where the investor is allowed to invest in all the 
individual components of the benchmark. From an an- 
alytic point of view, the problem addressed here is 
solvable only in the complete market case, which is a 
somewhat more restrictive setting than that of Browne 
(1999a). However, it is in fact the complete market case 
that is of most interest to active portfolio practition- 
ers. Our analysis allows us to extend the domain of 
goal/shortfall-related objectives with known explicit 
solutions to a case that allows for a very interest- 
ing and intuitive state-dependent optimal policy. (A 
continuous-time active portfolio management prob- 
lem with a finite-horizon probability-maximizing ob- 
jective in a complete market setting was studied in 
Browne (1999b, 1999c). The optimal portfolio policy in 
that case turns out to be intimately related to hedging 
strategies for certain options, and as such is both time 
and state dependent.) 
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An outline of the remainder of the paper, and a 
summary of our main results are as follows: In the 
next section, we provide a description of the model 
and the problems studied. For the objectives con- 
sidered here, the relevant state variable is the ratio 
of the investor's wealth to the benchmark. We then 
state for reference a general theorem in stochastic 
control for our model, which contains the specific 
goal-related objectives considered in the sequel as a 
special case. The upshot of this theorem is that it 
shows how the optimal value function and associated 
optimal control function for a general control problem 
can be obtained as the solution to a particular nonlin- 
ear Dirichlet problem. The theorem is a special case of 
the more general result in Browne (19992), and so it is 
stated without proof. Because the specific goal-related 
problems considered in the sequel are special cases, 
we need only identify and then solve the appropriate 
nonlinear Dirichlet problem. 

In $3 we apply the theorem to show that the ordi- 
nary optimal-growth portfolio policy for the case of an 
investor without a benchmark is once again optima] in 
our extended model, in that regardless of the underly- 
ing benchmark, the ordinary optimal-growth strategy 
will minimize the expected time until that benchmark 
is outperformed by any given percentage. While this 
result is not new, it is quite important for the sequel. 
In particular, it highlights the rather disturbing prop- 
erty of this optimal-growth policy in that it yields no 
insight for the portfolio manager as to how the bench- 
mark affects the investment decision, because for this 
objective the benchmark is in fact irrelevant. Moreover, 
as we show below, not only is the policy independent 
of the benchmark, but the probability that the active 
manager using the optimal-growth policy reaches an 
investment goal before falling to a shortfall level rela- 
tive to the benchmark is also independent of the bench- 
mark as well as any other parameters of the assets. 
Given these disturbing results, we move on in $4 to 
consider a fractional objective that relates the time to 
beat the benchmark to the probability of a shortfall rel- 
ative to the benchmark. For this objective, the optimal 
strategy is no longer a constant proportion, but rather 
a state-dependent amount that modulates the amount 
invested in the risky assets in inverse proportion to 
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wealth. As a special case, our results are then applied 
to the objective of minimizing the expected time to 
reach the goal subject to a constraint on the shortfall 
probability. This objective is motivated by the interest- 
ing gambling model of Gottlieb (1985). 


2. The Model 


The model under consideration here consists of k + | 
underlying processes: k (correlated) risky assets or 
stocks 50),... , S% and a riskless asset B called a bond. 
The investor may invest in the risky stocks and the 
bond, whose price pi processes will be denoted, respec- 
tively, by (5212 0}, and (B, t2 0). 

The probabilistic Setting is as follows: We are 
given a filtered probability space (9, F, {7}, P) 
supporting k independent standard Brownian mo- 
tions, (W(),...,W(?), where Z, is the P-augmentation 
of the natural filtration £/! = c(W(U, WE», ,w(^: 
0 < s < t} (see e.g., Duffie 1996 for a brief review of 
the relevant terminology). 

It is assumed that these k Brownian motions generate 
the prices of the k risky stocks. Specifically, following 
Merton (1971) and many others, we will assume that 
the risky stock prices are correlated geometric Brown- 
ian motions, i.e., s? satisfies the stochastic differential 
equation 


n 3 k " à 
dS) = siat + 3 oj Paw, fori=l,..,k (0 
j=l 


where {pj :i=1,...,k} and {oj:i,j=1,...,k} are con- 
stants. The price of the risk-free asset is assumed to 
evolve according to 


dB, =r B, dt, (2) 


where r > 0. We assume that p; >r for all i= 1,...,k. 
An investment policy is a (column) vector control 
process f= (fi: e in R* with individual! compo- 
nents fi, i= 1,..., k, where f} ? is the fraction (we use 
f for fraction) or proportion of the investor's wealth 
invested in the risky stock i at time t, for i= l,...,k, 
with the remainder invested in the risk-free bond. It 
is assumed that {f;, t > 0} is a suitable, admissible F~ 
adapted control process, ie., f; is a nonanticipative 
function that satisfies FR idt « oo a.s. for every T < oo. 
We place no other restrictions on f, for example, we 
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allow vf Us whereby the investor is leveraged 
and has borrowed to purchase the stocks, as well as 
1? «0, whereby the investor is selling stock i short. 

Let xf denote the wenlth of the investor at time t 
under policy f, with X; — x. Since any amount not 
invested in the risky stock is held in the bond, this 
process then evolves as 


ky as) »\ dB 
aX} = (0 Fs xí ( = »j \ T 
= t 


ll 


k ` 
x{(+ +P- r)a 
i=} 


koeko is " 
t XY Yi fo;awt? (3) 
i=l j=l 
upon substituting from (1) and (2). This is the wealth 
equation first studied by Merton (1971). 

If we introduce now the matrix o=(o)y and the 
column vectors =(hy,...,$a) 1-(1...,1Y, and 
W, = (WY, wy, we can rewrite the wealth pro- 
cess of (3) as 


aX! = Xr + filu- 3))dt + flo aW]. (4) 


For the sequel, we will also need the matrix X = o6". 
It is assumed for the sequel that the square matrix c 
is of full rank, hence o^! (and X^!) exists. 


24. The Benchmark Portfolio 

As described above, our interest [ies in determining 
investment strategies that are optimal relative to the 
performance of a benchmark. The benchmark we work 
with here is the wealth associated with another portfo- 
lio strategy x — (n(1),..., n(k)' where n(i) denotes the 
fraction of the benchmark wealth invested in the i-th 
stock. Accordingly, the benchmark portfolio evolves 
similarly to (4), as 


dX =X r +n (p 


— r1))dt + nodW). (5) 

For example, if n(i)=0 for each i=1,...,k, then the 
benchmark is simply "cash," which is the relevant 
benchmark in a variety of situations (see Litterman and 
Winkelman 1996) Alternatively, if n(i)=1, with 
1(j) 2 0 for all j z i, then the benchmark is just the 
i-th stock or asset. We note that while the problems 
studied in this paper can in fact be treated with more 
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general benchmarks (and in more general settings), 
here we only consider the constant coefficients case 
for analytical and economic simplicity. 


22. Optimal Growth 
Let r^ be the constant vector defined by 


nm =>7!(u~r1). (6) 


The vector x* plays a fundamental role in the theory 
of finance (see Merton 1990, Ch. 6) and will also play 
a fundamental role in the sequel. Following Merton 
(1990), we refer to the vector n” as the optimal-growth 
portfolio strategy. The reason for this is the policy 
fi =n", for all t, has many optimality properties asso- 
ciated with it in an ordinary portfolio setting (where 
there is no benchmark) that are relevant for growth- 
related objectives. In particular, for an investor whose 
wealth evolves according to (4), and who is not con- 
cerned with performance relative to any benchmark, 
(i) x” maximizes the expected logarithm of terminal 
wealth, for any fixed terminal time T, hence (ii) x” max- 
imizes the (actual and expected} rate at which wealth 
compounds. More interesting, perhaps, and certainly 
more relevant to our concerns here is the property (iii): 
n” minimizes the expected time until any given level of 
wealth is achieved (needless to say, so long as that level 
is greater than the initial wealth). Merton (1990, Ch. 6) 
contains a comprehensive review of these properties 
(see also Browne 1998 for further optimality proper- 
ties). Given these results, it is not surprising that the 
policy n* has extended optimality properties in our 
benchmark-based model as well, as we show below. 
Most relevant to our concerns is the fact that indeed n” 
is the policy that minimizes the expected time until the 
benchmark portfolio strategy is beaten by any given 
percentage (see Corollary 1 below). The fact that this 
holds for any benchmark, however, severely limits the 
applicability of this result in that it does not provide 
any insight for the active portfolio manager as to the 
role the benchmark plays in the investment decision, 
and as such might not be a reasonable objective for an 
active portfolio manager. 

We note also that the ratio of the wealth process of 
any (admissible) portfolio strategy to the wealth pro- 
cess determined by the optimal growth strategy is a 
supermartingale. This fact has important consequences 


MANAGEMENT SCIENCE/ Vol. 46, No. 9, September 2000 


for pricing contingent claims, as described, for exam- 
ple, in Merton (1990, Ch. 6). 


23. Active Portfolio Management 

There are of course many possible objectives related 
to outperforming a benchmark. Here, as in Browne 
(1999), we consider objectives related solely to the 
achievement of relative performance goals and short- 
falls or drawdowns. Specifically, for numbers I, u with 
IXt< Xf cu X, we say that performance goal u is 
reached (relative to the benchmark asset allocation 
Strategy n) if X  zuxg, for some f>0, and that per- 
formance shortfall level } occurs if xf =1X/ for some 
t>0. The active portfolio management problems con- 
sidered for an incomplete market (i.e., where there are 
more sources of risk than there are traded securities) 
in Browne (19992) are: (i) maximizing the probability 
that performance goal u is reached before shortfall 
I occurs; (ii) minimizing the expected time until the 
performance goal u is reached; (iii) maximizing the 
expected time until shortfall / is reached; (iv) maximiz- 
ing the expected discounted reward obtained upon 
achieving goal u; and (v) minimizing the expected 
discounted penalty paid upon falling to shortfall level 
l. Among other scenarios, these objectives are relevant 
to institutional money managers, whose performance 
is typically judged by the return on their managed 
portfolio relative to the return of a benchmark. Browne 
(19992) showed that the optimal strategy for each of 
these objectives was in fact a constant proportions asset 
allocation strategy. While constant proportion strate- 
gies are optimal in a variety of other settings as well, 
many professional investors object to them on the 
grounds that they do not take into account the wealth 
level of the investor. 

In this paper we consider an extended fractional ob- 
jective that relates the probability in Objective (i) to the 
expected time in Objective (ii). It turns out that for this 
objective the optimal strategy is no longer constant, 
but rather state dependent. In particular, the policy is 
hyperbolic in the state variable, where the relevant 
state variable is the ratio of the wealth process to the bench- 
mark. We use standard techniques of stochastic control 
theory (e.g., Krylov 1980) to establish our results since 
this ratio is a controlled diffusion. 
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In particular, because x is a controlled geometric 
Brownian motion, and X7 is another geometric Brown- 
ian motion, it follows directly that the ratio process, 
Z/, where zi = XIX? is also a controlled geometric 
Brownian motion. Specifically, a direct application to 
Ito's formula gives: 


Prorosmon |. For x, Xf defined by (4) and (5), let 
ZÍ be defined by ZI — XÍ/XP. Then, using the definition of 
the vector n* as given in (6), we have 


d2{=Zf (fr = ny Xs - n)di e Z[(fu - nY odW,. (7) 


Alternatively, in integral form we have 
t 
z{ =z | (f-m E (i — Af. + 0) ds 
Jo 
] 
+ [domm mm). e 
0 


Next, we provide a general theorem in stochastic 
optimal control for the process {Z/, t > 0) of (7) that 
covers the specific problems treated here as special 
cases. 


2.4. Optimal Control 

The active portfolio management problems considered 
in this paper are special cases of optimal control prob- 
lems of the following (Dirichlet-type) form: For the 


ratio process (zi, t> 0) given by (7), let 
af = inf{t>0:Z/ =x} (9) 
denote the first hitting time to the point x under a spe- 
cific policy f = (fi, t > 0). For given numbers l, u, with 
1 « Zy <u, let t= min{t/,c{} denote the first escape 
time from the interval (I, u), under this policy f. 
For a given real bounded continuous function g(z) 
and a function h(z) given for z =l, z =u, with h(u) < oo, 
let v/(z) be the reward function under policy f, defined 


by 
Tí 
v(z) - E, ([ g(Z! dt Lp a0) 
0 


with 


v(z)= supv/(z), and f'(z)-argsupv/(z) (1D 
fee fev 
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denoting, respectively, the optimal value function and 
associated optimal control function, where 4 denotes 
the set of admissible controls. (Here and in the sequel, 
we use the notations P,(-) and E;(-) as shorthand for 
P(-|Zy) =z) and E(-|Zo =z).) We note at the outset that 
we are only interested in controls (and initial values z) 
for which v/(z) « oo. 


Remark 1. Observe that the reward functional in 
(10) is sufficiently general to cover a variety of goal- 
related objectives. For example, the probability of beat- 
ing the benchmark before being beaten by it, following 
a given strategy (fi), i.e. P, «d, is a special case 
with e(:) 2 0, h(u) = | and h(I) = 0. Similarly, by taking 
g(-)=1 and h(u) 20 — h(D), we obtain E;(1^). Related 
optimal control problems have been treated previously 
in various forms for a variety of models. In particular 
see Pestien and Sudderth (1985), Heath et al. (1987), 
and Browne (1995, 1997, 19992). Related stochastic dif- 
ferential games are treated in Browne (2000). 

As a matter of notation, we note first that here, and 
throughout the remainder of the paper, the parameter 
y will be defined by 


y— y) s (wm - 2) X(n* — 1/2, 02) 


where x” is the optimal-growth policy of (6) and x is 
the benchmark under consideration. 

The following theorem, which is a special case of 
the more general Theorem 1 in Browne (19992), shows 
that the optimal value function is the solution to a par- 
ticular nonlinear ordinary differential equation with 
Dirichlet boundary conditions, and that the optimal 
policy is given in terms of the first two derivatives of 
this solution. 


THEOREM |. Suppose that w(z) is twice continuously dif- 
ferentiable with the first two derivatives given by w, and 
Wz, and is the strictly concave increasing (i.€., 0; >0 and 
W2z « 0) solution to the nonlinear Dirichlet problem 


ui) 
to;(z) 


-y +9(z)=0, forl«z«u, (13) 
with 
w(l)-h(l), and w(u)-h(u), (14) 


and satisfies the following three conditions: 
(i) (w2(z)/tw22(z)) is bounded for all z in (l, u); 
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(ii) for every t > 0, and every admissible policy f , we have 
t 

ef (Zw (ZÍ)? ds «oc; and (15) 
0 


(ili) (w,(Z)/t0,2(z)) is locally Lipschitz-continuous. Then 
w(z) is the optimal value function, i.e., w(z) = v(z), and, 
moreover, the optimal control vector, f*, can then be written 
as 

ft) n- (r - »a( 22), (16) 


ZW(z) 


where n* is the vector defined in (6). 


The utility of Theorem 1 for our purposes is that for 
various choices of the functions g(-) and h(-), it ad- 
dresses the objective problems discussed earlier. More- 
over, it shows that for each of these problems, all we 
need do is solve the ordinary differential Equation (13) 
and then take the appropriate derivatives to determine 
the optimal control by (16). Conditions (i), (ii), and (iii) 
are just technical conditions that ensure integrability 
of certain functionals, which in turn ensure optimal- 
ity. We will not discuss them further here, but the in- 
terested reader should see Browne (19992). Conditions 
(i) and (iii) are easy to check, and while Condition (ii) 
seems potentially hard to verify, for the cases consid- 
ered here it is in fact easy, as demonstrated below. 


Remark 2. Observe that the representation of the 
optimal control vector f*(z) of (16) demonstrates that 
the optimal portfolio strategy consists of two distinct 
parts: (i) the tracking component 7 and (ii) the active 
component, —(n" — 1)wz/(zw;;). Because w is increas- 
ing and concave in z, the active component associated 
with asset i is positive if n*(i)>mn(1), and negative if 
n*(1) « n(i). That is, the active manager will invest more 
heavily in asset i than the benchmark if the benchmark 
is underinvested in asset i relative to the vector x*, and 
vice versa. The extent to which this occurs depends on 
the specifics of the value function w(z). 


Remark 3. As noted earlier, Theorem 1 is a special 
case of a more general result in Browne 19992, and as 
such we do not provide a formal proof. However, to 
provide some insight into the result, observe that the 
Hamilton-Jacobi-Bellman (HJB) optimality equation 
of dynamic programming for maximizing v/(z) of (10) 
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over control policies f, to be solved for an optimal value 
function v, is 


suba EQ -n)zw-Hf-x)y X(f -1)z^»g)-0, 


(17) 


subject to the Dirichlet boundary conditions v(!) = h(1) 
and w(u) — h(u) (see, e.g., Krylov 1980, Theorem 1.4.5, 
or Fleming and Soner 1993, §IV.5). 

Assuming now that (17) admits a classical solution 
with v; >0 and vz; <0, we may then use standard cal- 
culus to optimize with respect to f in (17) to obtain the 
optimal control function f}(x) of (16), with v = w. When 
(16) is then substituted back into (17) and simplified, 
we obtain the nonlinear Dirichlet problem of (13) (with 
v— w). To complete the proof now, one only needs to 
verify that the solution to the HJB equation is indeed 
the optimal value function, and hence that the policy 
f: is indeed optimal. A verification argument based on 
the martingale optimality principle given in Browne 
(19992) covers the case at hand, provided that Con- 
ditions (i), (ii), and (iii) hold. A stochastic differential 
game-theoretic version of the verification argument 
and martingale principle appears in Browne (2000). 


3. Minimizing the Expected Time 
to Beat the Benchmark 


3.1. Optimality of x* 

We can use Theorem 1 to show that, as claimed ear- 
lier, the ordinary optimal growth portfolio policy, n* 
of (6), is indeed also optimal for minimizing the ex- 
pected time to beat the benchmark by any predeter- 
mined amount, regardless of the underlying benchmark 
strategy n. Indeed, we state this formally in the follow- 
ing corollary to Theorem 1. 


Coroitary 1. Let G*(z)= inf E.(th) with optimizer 
f(z) = arg inf; E] ).. Then for any nz m^, and y as 
defined in (12), we have 


G'(z)- : mÒ), with =r, for allz < u. (18) 
Proor. Observe first that while Theorem 1 is stated 


in terms of a maximization problem, it obviously 
contains the minimization case, as we can apply 
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Theorem 1 to G(z)= sup {-E.(th )}, and then recog- 
nize that G* = —G. As such, Theorem 1 applied with 
g(z)=1 and h(u)=0 shows that G" must solve the 
ordinary differential equation 


Giz) , 

Geel 
together with the boundary condition G*(u)=0. 
Moreover, G* must be convex decreasing (since it is the 
solution to a minimization problem). It is easy to sub- 
stitute the claimed values from (18) into (19) to verify 
that in fact that is the case. Furthermore, we have 
Gz/2G;, =~1, and as such (16) of Theorem 1 shows 
that the optimal contro] for this case reduces to n“. 

It remains to verify whether the Conditions (i) 
(ii) and (iii) of Theorem 1 hold: It is clear that (i) and 
(iii) hold. Condition (ii) is seen to hold for this case 
since we have dG"(z)/dz = —l/(zy), and as such Re- 
quirement (15) reduces here to fi vds «oo, which 
holds trivially. o 


x 


Remark 4. We have just shown that the ordinary 
optimal growth policy, 1", minimizes the expected 
time to a goal in the presence of a benchmark. To 
see that this same policy maximizes logarithmic util- 
ity of the ratio for the investor, simply observe that 


sup, (E[In(Z£.)) = sups(E[In(X7.)]) — E[In(Xz)]. 


3.2. Properties of the Growth-Optimal Ratio 
When n™ is substituted back into (7), we obtain the 
following stochastic differential equation for the ratio 
of the growth-optimal wealth to the benchmark 


dZ (17, n) — Z(n', n)ydt + (8 — nyodW|, (20) 


which implies that under z*, this ratio process is the 
geometric Brownian motion given by 


Zin, n) 2 Zoexp(yt + (i^ — ny oW,). (21) 


While we did not consider a lower shortfall barrier, 
l, in the development above proving optimality of the 
standard growth-optimal portfolio policy, it is of im- 
portance in many applications to consider one, since 
many investors indeed are interested in avoiding sub- 
stantial shortfalls. In the following proposition we give 
two fundamental results for the wealth process for an 
investor following the optimal-growth strategy n*: (i) 
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the probability that the investment goal u is reached 
before a shortfall of size } occurs; and (ii) the expected 
time to escape the interval (I, u) (which is not the same 
as the expected time to the goal). 


ProrosiTion 2. For the process Z,(n*, m) of (21), 
let x denote the first escape time from the interval 
(Lu), and let 0(z:1,u) denote the probability of '"'suc- 
cessful” escape, ie, t= inf(t:Zim,n)€(Lu)), and 
Q(z: Lu) = P(Z,(n*, n) =u). Then 8(z) is given by 


9e:Lo- 1 (221). (22) 


u-l 


Also, the expected time of first escape from the interval is 


given by 
Esa, m) 17 [02:1 u)In(5) - n(2)]. (23) 


Thus, we see that while n^ is the policy under which 
any given investment goal will be reached in mini- 
mal expected time, and hence in a sense maximizes 
expected return, it does so with a risk of a shortfall of 
size | occurring with probability 1 — 9(z:1,u), where 
8(z :1,u) is the probability given in (22). 

REMARK 5. It is important to note that the probability 
Q(z: Lu) is independent of any of the underlying parameters 
associated with the underlying model. In particular, 
this probability is independent of the benchmark policy 
n. (For example, the probability of the ratio doubling 
(u —2z) before being halved (!=0.5z) is always 2) 
This limits the usefulness of the optimal-growth pol- 
icy in the active portfolio management setting, since it 
provides no guide to the manager in how to choose 
a benchmark. Of course, the expected time to escape 
the interval (l, u), given by (23), does depend on the 
underlying parameters, but only through the para- 
meter y. 

Remark 6. Observe that for /= 0, we obtain 0(z :0,u) 
=1, and the expected escape time from the inter- 
val E,(t(n*,m)) reduces, as it should, to the opti- 
mal expected first passage time to the upper barrier 
E,(t,(n",m)). Of course, for 1» 0, the expected hitting 
time of the optimal ratio process to the lower shortfall 
level, E;(t(1^, 1)), is infinite due to the fact that the 
drift in (20) is positive. 
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Remark 7. The probability in (22) and the expected 
hitting time in (23) can be established directly via a 
variety of different ways. Most directly we have the 
following lemma for geometric Brownian motion: 


LEMMA l. Let X, denote a geometric Brownian motion 
that satisfies 
dX, — mX,dt  V2s XdW, withXo=x — Q4) 


and let t, = inf(t» 0: X, =z}. For 0Xazx& b, define 
now t= min{ta, xy), and 


K(x) = Py(ta < Ta) = Py(t =) (25) 
A(x) = E(t). (26) 


Then for m x: s we have 


m 


VooQv 
K(x)= > where v=1— (27) 


av 


Hoy=— (kon (2) -«(1)). (28) 


while for m =s we have 


. In(x/a) 
K(x)= In(b/a) (29) 
NES "T 2, In(x/a) 
H(x)= 2m (tn) (Ina) laa) 
- [(Inxy? - any). (30) 


Proor. Recognize first that (Xi) is a diffusion pro- 
cess with drift function u(x) ^ mx and diffusion func- 
tion o?(x) = 2sx. Therefore, if follows from elementary 
results about one-dimensional diffusions that K and H 
are the unique solutions to the respective (Dirichlet) 
problems: 


mxK, +sx?Kyx =0: K(a)=0, K(b) 1 (31) 


mxH, +sx Hys +1 =0: H(a)=0, H(b)=0 (32) 


(see e.g., Karlin and Taylor 1981, pp. 192~193). The gen- 
eral solution to the second order differential equation 
in the left-hand side of (31), for mz s, is K(x) = Cx” + 
C,, and the boundary conditions determine the 
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constants C;, C2, as in (27). Form =s, the general solu- 
tion is C; Inx + Cz, and the boundary conditions give 
Q9). 

The general solution to the left-hand side of (32) 
is C, + Cx? — (m — s)^!Inx for ms, and from 
the boundary conditions we determine C, and C;, 
giving 


l l NN" 
veda) RE 
+(x? — a")Inb 4 (b — x”) Ina], 


which is equivalent to (28) above when we simplify 
using the definition of K(x). For m =s, the general so- 
lution of (32) is C, — (1m)(In x! + C; Inx, which gives 
(30) after applying the boundary conditions. o 


Using this lemma for our purposes, we first note that 
(20) is distributionally equivalent to a diffusion that 
evolves according to the stochastic differential equa- 
tion 


dX, =X; (2ydt + /2ydW,), 


where W is an independent one-dimensional Brown- 
ian motion. As such, identify m —2y, s = y, and hence 
v — —l, and then using b=u and a=], (22) and (23) 
follow upon substitution. o 


4. Risk-Related Objectives 


41. Objective Function and Optimal Policy 

The results of the previous section indicate that the 
optimal growth strategy, n*, regardless of its many 
optimality properties in the ordinary portfolio setting, 
may not be appropriate for an active portfolio man- 
ager who cares about downside risk as well as upside 
growth relative to a benchmark. 

As such, in this section we treat an objective that is 
perhaps more appropriate in that it allows the active 
portfolio manager to incorporate the shortfall prob- 
ability directly in the risk/return tradeoff relative to 
expected growth. In particular, we consider a linear 
tradeoff between the shortfall probability and the ex- 
pected time to get to the surplus level. More specifi- 
cally, we now treat the following objective: For given 
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nonnegative constants a and 8, let 


T 


V) = sip[aP Zt) - BE). (33) 


As we will show, the optimal dynamic portfolio strat- 
egy for this objective is no longer a constant asset al- 
location strategy. Rather, as we will establish below, 
the optimal strategy hyperbolically modulates the frac- 
tions invested in the risky assets by the level of the 
ratio process. 


THEOREM 2. Let V*(z) denote the optimal value function 
in (33), and let fy (z) denote ihe associated optimal control 
function. Then, V*(z) is given by 


" zt+b urb 
V'(z)2a "(25 )/ (145) , forizz&u, 

(34) 

where the scalar b= b(a, D : u,l, y) is given by 

ue] 

i 1—e-vwB ' 65) 

The optimal portfolio policy, fọ(z), is given by 
fi(z) on - (n -n) ( } :) EX EC s ne, (36) 


where b is given by (35), n* is the ordinary optimal growth 
vector given earlier in (6), 


w= E7\(n—71), 
and n is the benchmark strategy. 


Remark 8. Observe that the portfolio strategy f(z) 
is a state-dependent policy that is inversely modulated 
by the level of the ratio process Z. The final repre- 
sentation in (36) shows that the policy is composed 
of two parts: First it just uses the optimal-growth pol- 
icy n^, and then multiplies the difference between the 
optimal-growth policy n* and the tracking portfolio 
n by the correction term b/z. The sign of the correc- 
tion factor is determined by the sign of b. Some direct 
manipulations on (35) shows that the sign of b is the 


sign of 
0-8 


Comparison now with (18) reveals that we can write 
this quantity as G* (ID) — «/8, where G*(I) is the minimal 
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possible expected time to get from the shortfall level / 
to the surplus goal u. Thus, b is positive (negative) if 
the ratio o/f is less (more) than this minimal expected 
time. 

Observe further that if b >0, then the active manager 
invests more heavily in the i-th stock than does n* so 
long as the benchmark is underinvested in that stock 
relative to the optimal-growth policy, i.e., so long as 
0n" ()) » n(i). 

Finally, note that (35) shows that we must always 
have b> — 1. 

Proor. Theorem 1 applies directly to this case with 
9(z) = —B, with h(u) 2 « and h(1) 20. As such, we re- 
quire that V* be the concave increasing solution to the 
nonlinear Dirichlet problem: 


2 
ME —B=0 for 


V. l<z<u, (37) 


and that V* satisfy the boundary conditions V*(u)=« 
and V*(1) - 0. 

The general form of the solution to the nonlinear 
ordinary differential equation in (37) is of the form 
V(z) 2 (B/y)In(z + Cy) + C2, where C, and C; are ar- 
bitrary constants which we can choose to match the 
boundary conditions. Observe that this function is con- 
cave. The boundary condition V*(u)=« determines 
that C; =a — (B/g)In(u + C1), and the boundary con- 
dition V*(I) = 0 determines that C; = b, where b is the 
constant in (35). As such, the value function is given 
explicitly by 


— erp 
el eres eae Ap 


= (38) 
Y u-l 


V*(z)=a + 


Observe now that we can invert the relation in (35) 
to write B/y — o/In[(u + b)/(I + b)], which when placed 
into (38) and then simplified, gives the value function 
in the form that it is given in (34). 

For this value function, the conditions given in 
Theorem 1: (i) and (iii) are seen to hold directly, 
and (ü) can be established by first noting that 
(zw; Y = (/vy (z/[z + b])*, and then by using the fact 
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that since z/[u + b] € z/[z + b] € z/{I + b], we have 


I B rf af 7 
ef (Ziw (ZL) P'ds = (£) E E ds 
0 Y 0 zí +b 


zwi fa 
<| — E(Z5y d. , 
(Em A (Z,) ds «oo 


where the final inequality holds by the assumption of 
admissibility. 

Because we now have an explicit form for the op- 
timal policy, we may place it into (16) to obtain the 
optimal control vector given by f}(z) of (36). a 


42. The Optimal Process 

Observe that when we place the optimal control fi (Z,) 
of (36) back into (7), we obtain an optimal-wealth pro- 
cess Z(fy, n) which we will denote by Z^, that follows 
the stochastic differentia! equation, using the defini- 
tion of y from (12), 


dZr =2y(Zf + b)dt + (Zr + b) —x)y'oaw, (39) 


The unique strong solution to (39) is given by 
Zi = (Zo + b)exp(yt + (t — x) oW) — b. (40) 


Comparison with the results on the optimal-growth 
policy of the last section shows that we can write this 
in terms of the optimal-growth ratio as 


z-(i ez). dc (41) 
Zo 


where Z,(1^, n) is the ratio of the ordinary optimal- 
growth wealth to the benchmark, as given in (21). 

In the next proposition we list the following two 
properties of the optimal ratio process Z*: (i) the prob- 
ability of reaching the upper surplus goal u before the 
lower shortfall level /; and (ii) the expected time it takes 
to escape from the interval (J, u). Recall that the scalar 
b depends on the benchmark through the parameter y, 
and that we always have b > —I. 


Proposition 3. Let (Z?, £20} denote the optimal- 
wealth process associated with the control function f(z) 
of (36), and let «* denote the associated first escape time 
from the interval (l, u), i.e., ^ — inf (t: Zf € (1, w)). Also, 
let $(z : b, l, u) denote the probability of successful escape 
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from the interval, ie., d(z :b, 1, u) - PAZ. =u). Then 


(z — D(u + b) 


oz: b, Ip ES Y 


(42) 


The expected time of the first escape is given by 
J u+b z+b 
E(t) =- ib, l, u)l —Inf => e 
tong [seh ola) (22) 


REMARK 9. Observe that the probability in (42) is 
now dependent on the benchmark policy n through 
the parameter b. Note further that as intuition would 
suggest, the probability in (42) is larger than the as- 
sociated probability for the optimal-growth strategy, 0 
obtained earlier in (22), if b<0, and smaller if b>0. 

Proor. The results above can be established di- 
rectly from the earlier Lemma 1 applied to the process 
Z? + b, because as (41) exhibits, Z7 + b is a geometric 
Brownian motion, with initial state Zo + b. In fact, (41) 
implies that Z; + b is distributionally equivalent to a 
multiple of the optimal-growth ratio described in the 
last section, i.e., Z; + b O  b/Zo)Zi (^, n). As such, 
we have 


$(z:b, l, u) = B(Zi- =u)= B(Zi. +b=u +b) 


= »(( + hze, m=u+b) (44) 


where in the latter t denotes the first escape time of the 
process (1 +/z)Z,(n*, n) from the interval (I 4- b, u +b). 
As such, the results of the previous section allows us to 
evaluate this latter probability in terms of the function 
9(.) defined earlier in (22). Specifically, the argument 
in (44) shows that 


[+b ut+b 
qi 0 e(s: 1 b I1) 
=O(z+b:1+b,ut+b), (45) 


and indeed (42) is obtained when we substitute appro- 
priately into (22). 

The optimal expected hitting time, E(t") of (43) is 
derived directly from the fact that under the optimal 
policy fọ, the value function is 


V"(z)  aP;(Zz. =u) - PEAT), 
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and therefore 
] 
Eye peri =u) -— V*(z)}. (46) 


Substituting now for BR(Z}. =u) from (42) and for 
V*(z) from (34), and using 1/fi = In((u c b)/(E4- b)]/(a) 
gives (43). o 


43. Risk-Constrained Minimal Time 
The results of the previous section can now be applied 
directly to the active portfolio management case where 
the shortfall probability is prespecified. Specifically, 
suppose that the shortfall probability is prespecified 
to the active manager to be no more than 1 — p, where 
p is a given number between 0 and 1, i.e., the active 
manager is told that he must have P.I, -l)&l-p, 
or equivalently, that he must have P,ZI, =u)> p. The 
risk-constrained active portfolio management problem 
is now to minimize the expected time to beat the bench- 
mark subject to a constraint on the shortfall proba- 
bility, specifically, to find the strategy (/f7, #> 0} that 
minimizes E(t/) subject to P(zf, =u)> p, where pisa 
given number in (0, !). This is now related to the gam- 
bling problem first solved in Gottlieb (1985). Follow- 
ing Gottlieb, we observe first that the dual of the risk- 
constrained active portfolio management problem is 
to maximize the probability that P(Z/, =u) subject to 
a constraint on E(1/ ). Moreover, observe that should a 
solution exist, then the constraint will be met at equal- 
ity, and so we would have P(ZE, -u)-p. 

Let us write the solution to the dual problem, should 
it exist, as 


U(z) = sup[P(Z!, =u) — BE], (47) 
f 


where fj is now the value of a Lagrangian multiplier. 

The control problem in (47) is a special case of the 
problem treated above with x= 1, and as such, from 
(34) we know that the solution is given by 


z4b ub 
l z l =}, 4 
Y(z)- (z) (e (48) 


where b is the value of b in (35) evaluated at a— 1, 
i.e, b= b(1, B). The value of f, the Lagrangian mul- 
tiplier, will be determined from the risk constraint 
P(ZÍ, =u) =p. 
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We also know from (36) that the associated risk- 
constrained optimal portfolio strategy is given by 


p= +(x -mt (49) 


Observe now that we may invert the identity for b 
in (35) in this case to write the unknown f in terms of 
the unknown b as 


TEN 
zu] . (50) 


Because we require B» 0, this implies that we require 
b» -1 (we always need b > —1, to ensure that the prob- 
ability $(z: -,-) does not exceed 1, i.e., to keep < 1). 
To determine the value of b, we can use the risk 
constraint evaluated at the initial time 0, i.e., set 


(Zo; b) p. 
Using (42) with b= b, this gives us 


(Zo - ub) _ 
(Zo + b)(u — 1) 
which in turn now can be solved for 6, giving 
pZo(u — 1) — u(Zy - 1) 
Zg-l-p(u-l) ` 


b= Hp, Zo, u,l) (50 


Remark 9. Observe that 

7, Zo -l 
inf $(Zo :b)- 

È> F9 u-l 
and as such, the risk-constrained problem is feasible 
only for an initial probability level p that satisfies 

Zo -1 

u-l 


p> 
Observe too that for p=], b reduces to È= —!, which 
makes the lower barrier } unattainable as in many 


“portfolio insurance'' models (see Browne 1997). The 
insurance level b in (51) is positive for values of p sat- 


isfying 
Zo —1 u fZo-l 
u-l SE (25, 
and P is negative for larger values in the region 


Hu Zo - 1 = 
p> x (2-1) «o, 
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where 6(Z9) is the initial probability that the opti- 
mal growth wealth/ratio hits u before l, as given in 
(22). Thus, as intuition suggests, to have a higher 
""success'' probability than the optimal-growth strat- 
egy, the active portfolio manager must take less risk 
and invest less (since 5 «0) than the ordinary optimal- 
growth investor. 

The optimal expected hitting time, E,(t*) can be 
obtained directly as 


E(t") I zal utb In ute In z+b 
Y|zrb Nu -! Ib +b 


62). 


5. Conclusions 

We have studied a goal-related objective for the prob- 
lem of outperforming a benchmark. The objective 
relates the time to outperform the benchmark to the 
shortfall probability. The resulting optimal policy 
is state-dependent in an intuitive way not captured 
by previous studies where the optimal policy was 
of a constant proportions type. Moreover, this opti- 
mal policy directly addresses and alleviates one of 
the undesirable features, in the benchmark outper- 
formance problem, of the ordinary optimal-growth 
policy, whose shortfall probability was shown to be 
independent of the benchmark and any other model- 
specific parameter. As a special case of this objective, 
we studied the problem of minimizing the expected 
first passage time subject to a constrained probability 
of successful escape. 
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Abstract 


In this paper, we extend the definition of fractional Kelly strategies to the case 
where the investor’s objective is to outperform an investment benchmark. These 
benchmarked fractional Kelly strategies are efficient portfolios even when asset 
returns are not Jognormally distributed. We deduce the benchmarked fractional 
Kelly strategies for various types of benchmarks and explore the interconnection 
between an investor’s risk-aversion and the appropriateness of their investment 
benchmarks. 


1 Introduction 


Classically, a fractional Kelly strategy with fraction f consists in investing a pro- 
portion f of one’s wealth in the Kelly criterion, or log utility, optimal portfolio and 
a proportion 1 — f in the risk-free asset. Fractional Kelly strategies play an impor- 
tant role in active investment management in the asset only case, that is when the 
investor’s objective is to maximize the terminal value of his/her wealth, without any 
benchmark to track or liability to pay (see for example Thorp (2006) and Ziemba 
(2003) for discussion and additional references). In this chapter, we analyze the 
role of fractional Kelly strategies in the asset allocation of a benchmarked investor, 
that is an investor whose objective is to outperform a given investment benchmarks, 
such as the S&P 500 or the Salomon Smith Barney World Government Bond Index. 

The argument developed in this chapter builds on and applies the stochastic 
control-based model proposed by Davis and Lleo (2008a). Their methodology is 
founded on the theory of risk-sensitive stochastic control, rather than on the classical 
stochastic control theory-based approach proposed by Merton (e.g., Merton (1969, 
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1971, 1992)). In the asset only case, this choice has the benefit of being consistent 
with both the Merton model and with the mean-variance analysis while allowing for 
the explicit inclusion of underlying valuation factors and while admitting a simpler 
analytical solution than the Merton model. An added advantage of this model 
is the relative ease with which the asset only problem as formalized by Bielecki 
and Pliska (1999) can be extended to include a benchmark, as in Davis and Lleo 
(20082), or a liability, as in Davis and Lleo (2008b). In in these paper, it is seen 
that fractional Kelly strategies as defined above are not necessarily optimal, with the 
notable exception of the Merton model where they arise naturally as a consequence 
of the assumption that asset prices are lognormally distributed. As a result, to be 
able to interpret the solution to our benchmarked asset allocation problem in terms 
of fractional Kelly strategies and therefore guarantee their optimality, we will find 
it necessary to expand slightly their definition. 

In Section 2, we introduce risk-sensitive asset management from the perspective 
of an asset only investor. The interpretation of the resulting optimal asset allo- 
cation formula is then formalized in Section 3 both as a Mutual Fund Theorem 
and in terms of a redefinition of fractional Kelly strategies. Section 4 presents a 
comparison of the classical Merton model with the risk-sensitive asset management 
approach from the perspective of fractional Kelly strategies. Then, in Section 5, we 
present the analytical solution to the risk-sensitive benchmarked asset allocation 
problem, before interpreting this solution in a Mutual Fund theorem and in terms 
of benchmarked fractional Kelly strategies in Section 6. Finally, Section 7 contains 
a number of case studies showing applications of these ideas to specific types of 
benchmarks and to Kelly criterion investors. 


2 Risk Sensitive Asset Management 


2.1 Risk sensitive control 


Risk-sensitive control is most simply defined as a generalization of classical stochas- 
tic control in which the degree of risk aversion or risk tolerance of the optimizing 
agent is explicitly parameterized in the objective criterion and influences directly 
the outcome of the optimization. Risk sensitive control was introduced by Jacobson 
(1973) and has been developed by many authors, notably Whittle (1990) and Ben- 
soussan and van Schuppen (1985) before being applied to finance by Lefebvre and 
Montulet (1994) and to asset management by Bielecki and Pliska (1999). 

While in classical stochastic control the objective of the decision maker is to 
maximize E[F], the expected value of some performance criterion F, in risk-sensitive 
control the decision maker's objective is to select a control policy h(t) maximizing 
the criterion 


J(t, 2, h; 0) — -5 InE Daa (1) 
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where 


e t and x are the time and the state variable; 

e F is a reward function; 

e the risk sensitivity Ó € (—1,0) U (0,00) represents the decision maker's 
degree of risk aversion. 


A formal Taylor expansion of J around 0 = 0 evidences the vital role played by 
the risk sensitivity: 


J(2,t, 0) = E [F(t £, h)] — f Var [F(t,2, h)] + O(82) (2) 


e 0 — 0 corresponds to the “risk-null” case and to classical stochastic control; 

e when 0 < 0, we have the “risk-seeking” case that is a maximization of the 
expectation of a convex decreasing function of F(t, x, h); 

e finally, 0 > 0 is the “risk-averse” case that is a minimization of the expec- 
tation of a convex increasing function of F(t, x, h). 


To summarize, risk-sensitive control differs from traditional stochastic control in 
that it explicitly models the risk-aversion of the decision maker as an integral part 
of the control framework, rather than importing it in the problem via an externally 
defined utility function. 


2.2 Risk sensitive asset management 


Bielecki and Pliska (1999) pioneered the application of risk-sensitive control to asset 
management. They proposed to take the logarithm of the investor’s wealth V as 
the reward function, i.e. 


F(t,z,h) =InV(t, 2, h) (3) 


The natural interpretation of this choice is that the investor’s objective is to max- 
imize the risk-sensitive (log) return of the his/her portfolio. With this choice of 
reward function, the control criterion is 


1 
J(t2, hs) = -3 mE MCI (4) 
and interpret the expectation 
E MCI = E [V (t,x, h)-9] =: Ue(Vi) (5) 


as the expected utility of time t wealth under the power utility (HARA) function. 
The investor’s objective is to maximize the utility of terminal wealth. The Taylor 
expansion becomes 


J(t, 2;0) = E [In V(t, z, h)] — f Var (In V (t, x, h)] + O(0?) (6) 


Ignoring higher order terms, we recover the mean-variance optimization criterion 
and the log utility or Kelly criterion portfolio in the limit as 0 — 0. 
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2.8 The risk sensitive asset management model 
2.3.1 Asset and factor dynamics 


Embedding the investor's risk-sensitivity in the control criterion provides more lee- 
way in the specification of the asset market than would be obtained in the classical 
stochastic control of the Merton approach. Bielecki and Pliska (1999), in particular, 
propose a factor model in which the prices of the m risky assets follow a SDE of 
the form 


ol) (a+ AX (t)),dt + 5 dW, (t) 
= (a ; 0; 
S(t à £4 ik k 
$;,(0)— s; t=1,...,m (7) 
where W(t) is a N := n + m-dimensional Brownian motion and the market pa- 
rameters a, A, X := [ø], ¢=1,...,.m, j =1,...,N are matrices of appropriate 


dimensions. To avoid redundancy, we assume XY > 0. To these m risky securities, 
we add à money market asset with dynamics 
dSo(t) 
So(t) 
Finally, the asset prices drift depends on n valuation factors modelled as affine 
stochastic processes with constant diffusion 


= (ao + ALX(t))dt, ^ So(0) = so (8) 


dX (t) = (b + BX(t))dt + AdW (t), X(0)=2 (9) 
where X(t) is the IR"-valued factor process with components X(t) and the pa- 
rameters b, B, A :— [Ai], i = 1,...,n, j = 1,...,N are matrices of appropri- 


ate dimensions. These valuation factors must be specified, but they could include 
macroeconomic, microeconomic or abstract statistical variables. 

Under these conditions, the logarithm of the investor’s wealth is given by the 
SDE 


In V(t) 2 nv 4 J i (ao + AlX ()) + R(8Y (à T AX(s)) ds 
0 


t t 
-5 | h (sy EX'h(s)ds + | h(s} EdW (s) (10) 
where V (0) = v, h is the m-dimensional vector of portfolio weights and we used the 
notation à :— a — agl, Â :— A — 1Af, and 1 is a n-element column vector with all 
entries set to 1. 

The equation for V solely depends on the valuation factors (the state process) 
and is independent of the asset prices. This implies that the effective dimension 
of the risk-sensitive control problem will be n, the number of factors, rather than 
m, the number of risky assets. The limited impact of the number of assets is 
significant since for practical applications we would typically use only à few factors 
(possibly 3 to 5) to parametrize a large cohort of assets and asset classes (possibly 
several dozens). The risk-sensitive asset management model is therefore particularly 
efficient from a computational perspective. 
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2.3.2 The associated linear exponential-of-quadratic Gaussian control problem 


The next step in the analysis is due to Kuroda and Nagai (2002) who ingeniously 
observed that under an appropriately chosen change of probability measure (via the 
Girsanov theorem), the risk-sensitive criterion can be expressed as 


UE (11) 


where the expectation E? [| is taken with respect to a newly-defined measure pe 
depending on the investment strategy h, the functional g is 


1 ^ 
g(z, h; 8) = 5 (0 4- 1) j/X:X/h — ag oz — h' (â + Ax) (12) 


1 
I(v,z; h;t, T) = lnv — gn ER 


and the factor dynamics under the new measure p? is 
dX, = (b+ BX, — 0AX/'h(s)) ds + AdW? (13) 
(see Davis and Lleo (20082) for details). 

In this formulation, the problem is a standard Linear Exponential-of-Quadratic 
Gaussian (LEQG) control problem which can be solved exactly, up to the resolution 
of a system of Riccati equations. But before solving this control problem and deriv- 
ing the optimal asset allocation, we will first develop some intuition by considering 
the simple case in which the security and factor risk are uncorrelated, i.e., when 
AX! — Q0. 


2.3.3 Special case: Uncorrelated assets and factors 


When A’ = 0, security risk and factor risk are uncorrelated and the evolution of 
X, under the measure IP? given in equation (13) simplifies to 
dX, = (b+ BX,) ds + AaW? (14) 
The evolution of the state is therefore independent of the control variable h and, 
as a result, the control problem can be solved through a pointwise maximisation of 
the auxiliary criterion function I(v, x; h;t, T). 
In this case, the optimal control h* is simply the maximizer of the function 
g(x; h;t, T) given by 
ie ey (a+ Ax) (15) 
which represents a position of Gat in the Kelly criterion portfolio. 
Let (t, x) be the value function corresponding to the auxiliary criterion function 
I(v,z;h;t, T). Substituting the value of h* in the equation for g yields 
$(5,2)— sup I(wvz;h;t,T) 
he A(T) 


1 
-3 nE}, 


ofo [aterert t (16) 


The PDE for 9 can now be obtained directly via an exponential transformation and 
an application of Feynman-Kac. 
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2.3.4 The general case 


In the general case, the value function ® for the auxiliary criterion function 
I(v,z;h;t, T), defined as 


(t,x) = sup I(v,z; h; t, T) (17) 
A(T) 
satisfies the Hamilton-Jacobi-Bellman Partial Differential Equation (HJB PDE) 
oo 
— + sup LP®(X(t)) =0 (18) 
ot heR™ 
where 
h I 4 l !T32 
Lié(t,r)-— (b+ Bx—0AXh(s) DË + jt (AA D $) 
0 
— ;(D9) AM De — g(x, h; 0) (19) 
/ 
where D® is the gradient vector defined as D® :— (2 ye) A TE 22) and 
D? is the Hessian matrix defined as D?® :— E23 „i,j =1,...,n. The value 


function ® satisfies the terminal condition ®(T, x) = In v. 
Solving the optimization problem gives the optimal investment policy A*(t) 


, 1 _ ^ 
O = gu Ox) ! [à + AX(t) - 0X A' Dà (t, X(0)| (20) 
Moreover, the solution of the PDE is of the form 
(t,x) = z'Q(t)z + z'q(t) + k(t) (21) 


where Q(t) solves à n-dimensional matrix Riccati equation and q(t) solves a n- 
dimensional linear ordinary differential equation depending on Q (see Kuroda and 
Nagai (2002) for details). 


3 Fractional Kelly Strategies in the Risk-Sensitive Asset 
Management Model 


3.1 A mutual fund theorem 


Theorem 1 (Mutual Fund Theorem (Davis and Lleo, 2008a)). Any port- 
folio can be expressed as a linear combination of investments into two “mutual 
funds" with respective risky asset allocations: 


AK (t) — (X) (à + Ax (t)) 


(22) 
h(t) = -(ZE/) TEA’ (g(t) + Q(0X(t)) 
and respective allocation to the money market account given by 
AK (t) — 1—1'(XX)-! (44+ ÀX(t 
KO ŒE) (a+ AX(t) (23) 


hg (t) = 1+ (EENT EN (g(t) + Q(t)X (0) 
Moreover, if an investor has a risk sensitivity 0, then the respective weights of each 


mutual fund in the investor’s portfolio equal z and s respectively. 
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The main implication of this theorem is that the allocation between the two 
funds is a sole function of the investor's risk sensitivity 0. As 0 — 0, the investor's 
wealth gets invested in the Kelly criterion portfolio (portfolio K}. On the other 
hand, as 0 — oo, the investor’s wealth gets invested in the correction portfolio 
C. The investment strategy of this portfolio can be interpreted as a large position 
in the short-term rate and a set of positions trading on the comovement of assets 
and valuation factors. In financial economics, portfolio C is referred to as the 
‘intertemporal hedging term’. 

When we assume that there are no underlying valuation factors, the risky secu- 
rities follow geometric Brownian motions with drift vector u and the money market 
account becomes the risk-free asset (i.e., ay = r and Ao = 0). In this case, NA’ = 0 
and we can then easily see that fund C is fully invested in the risk-free asset. As a 
result, we recover Merton’s Mutual Fund Theorem for m risky assets and a risk-free 
asset (e.g., Theorem 15.1, p. 489 in Merton (1992)). 


3.2 Fractional Kelly strategies in the risk-sensitive asset 
management model 


Fractional Kelly strategies arise naturally in the Merton investment model, since 
Merton's Mutual Fund Theorems guarantees that the optimal investment strategy 
can be split in an allocation to the risk-free asset and an allocation to a mutual fund 
investing in the Kelly criterion portfolio. However, the optimality of fractional Kelly 
strategies is the exception rather than the rule: in the Merton model, fractional 
Kelly strategies are optimal as a result of the assumption that asset prices are 
lognormally distributed. In the factor-based risk-sensitive asset management model, 
we cannot expect either that the ‘classical’ definition of fractional Kelly strategies 
to yield optimal or near optimal asset allocations as soon as 0 Æ 0. 

To address this difficulty, Davis and Lleo (2008a) proposed a generalization of the 
concept of fractional Kelly strategy based on the findings expressed in the Mutual 
Fund Theorem 1. Rather than regarding the fractional Kelly strategy as a split 
between the Kelly portfolio and the short-term rate, Davis and Lleo propose to 
define it as a split between the Kelly portfolio and the portfolio C, as defined in 
the Mutual Fund Theorem 1. In this case, the Kelly fraction, which represents the 
proportion of wealth invested in the Kelly portfolio, is inversely proportional to the 
investor's risk sensitivity and is equal to áp 

This redefinition of fractional Kelly strategies has two important consequences: 


e The fractional Kelly portfolios are always optimal portfolios; 

e In the lognormal case (i.e. when n — 0), the generalized definition of frac- 
tional Kelly strategies reverts to the ‘classical’ definition. This can be ver- 
ified from the fact that in the lognormal case, the Mutual Fund Theorem 1 
simplifies into Merton's Mutual Fund Theorem for m risky assets and a 
risk-free asset. 
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4 Fractional Kelly Strategies, Risk-Sensitive Asset Management 
and the Merton Model with Power Utility 


4.1 Objective 


In this section, we go further in our analysis by drawing a comparison between risk- 
sensitive asset management and the Merton model from the perspective of fractional 
Kelly strategies. 

For comparison purpose, we consider: 


e a fractional Kelly strategy with fraction f; 

e the Merton model maximizing utility of terminal wealth, with the utility 
function chosen to be the homogeneous power utility or hyperbolic absolute 
risk aversion (HARA) function; and 

e the diffusion risk-sensitive asset management model without any underlying 
valuation factors. 


In order to perform a comparison between the risk-sensitive asset allocation 
model and the original Merton model, we assume that the market is comprised of 
m risky assets and that the investor's objective is to find an optimal m-dimensional 
portfolio allocation vector h(t), where h;(t) represents the proportion of the portfolio 
invested in risky security i. We also assume that there are no valuation factors and 
hence n — 0. We will show that the treatment of fractional Kelly strategies is 
comparable in both cases up to a sign difference in the risk-aversion/risk-sensitive 
coefficient. 


4.2 A brief review of the Merton model 


In the Merton model, the objective of an investor is to maximize the utility of 
terminal wealth represented by the criterion: 

I(t,s,h;0):— E[U(V (t, s, h))] (24) 
where 


e V (t, z,h) is the investor's wealth at time t in response to a securities market 
with price vector s and an investment policy h; 

e U is the homogeneous power utility or hyperbolic absolute risk aversion 
(HARA) function, defined as 


U(z) = > (25) 
Y 
where y € (—00,0) U (0, 1) is the risk-aversion coefficient. 


The dynamics of the prices of the m risky assets follows a geometric Brownian 
motion of the form 


dSi(t m l 
E = pa dt + S > oindW;(t), S,(0) = s; t=1,...,m (26) 
‘ k=l 
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where W(t) is a m-dimensional Brownian motion. The price of the money market 
asset satisfies 


dSo(t) 
= rdt So(0) = 27 
So(t) rat, of ) $0 ( ) 
In this setting, the optimal asset allocation is given by 
1 
h*(t) = ——(XX)ü 28 
()- ;— "a (28) 


with £ :— p —r1 where 1 is the m-element unit vector and X is the diffusion matrix, 
defined as X = [ci;]. 


4.3 Observations and conclusions 


From a fractional Kelly perspective, the two models are strikingly similar. Taking 
the case of an investor allocating a fraction f of his/her wealth to the Kelly portfolio, 
we see that 


e in the Merton model, such strategy would be optimal for an investor with 
a level of risk aversion y equal to 
1 
yal- f (29) 


e in the risk-sensitive model, such strategy would be optimal for an investor 
with a level of risk-sensitivity 0 equal to 
1 
0 f 1 (30) 
Looking solely at the Kelly component of the optimal investment policy, this 
would imply that 

ü = -y (31) 
This similarity is in not surprising. Indeed, when we restrict the risk-sensitive 
approach to a 0 factor model, we get the same optimal asset allocation as in the 
Merton model with homogeneous power utility, but for the fact that y has been 

replaced by —6, i.e. 

1 
041 
This observation is confirmed by both the range of 0 and y and by the functional 
form of the utility function associated with the risk-sensitive approach. 

These findings are summarized in Table 1. 


h^ (t) = (Zr) (32) 


Key Points: 


e The situation between the Merton model with homogeneous power utility 
and the risk-sensitive asset management model is parallel, and we can use 
the guideline correspondence 9 = —y = 1/f —1 to link them with the 
fractional Kelly approach; 
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Table 1 Comparison of the Merton Model and of the Risk-Sensitive Asset Management Model from 
a Kelly Perspective 


Merton model with 
Risk-Sensitive Asset Management | homogeneous power utility 


Risk-sensitive parameter /Risk 6 € (—1, 0) U (0, co) ~y € (—o0, 0) U (0,1) 
aversion coefficient 


Range of the risk-sensitive pa- (0, co) (—oo, 0) 
rameter/risk aversion coeffi- 
cient for risk-averse investors 


Y 
Form of Utility Function U(z) —z-? U(z) = Z 
y 
1 1 
Optimal asset allocation h*(t) zp ee (&CX(t)) + ZA’ DO) 1 (ZXxy)-!ü 
-y 
Value of the risk aversion 1/f—1 1—1/f 
coefficient /risk-sensitive 
parameter corresponding to a 
Kelly fraction f 
Range of Kelly fractions for (0, 1) (0, 1) 


risk-averse investors 


e The risk-sensitive approach is therefore consistent with the Merton model 
with homogeneous power utility; 

e In the case when there are no factors (i.e., n = 0), the optimal asset alloca- 
tion obtained in the risk-sensitive approach reverts to that associated with 
the Merton model; 

e In the general case (n > 0), the definition of fractional Kelly strategies in 
the risk-sensitive asset management model can be extended as proposed 
above while remaining consistent with the ‘classical’ definition of fractional 
Kelly strategies as a split bewteen the Kelly portfolio and the risk-free asset. 


5 Adding an Investment Benchmark: Risk Sensitive Benchmarked 
Asset Management Model 


So far, we have introduced risk sensitive asset management in the classical asset 
only context of an investor attempting to maximize the terminal utility of his/her 
wealth. We will now examine the related benchmarked asset management problem 
in which the investor selects an asset allocation to outperform a given investment 
benchmark. 

In the benchmarked case, Davis and Lleo (2008a) propose that the reward func- 
tion F(t,x;h) be defined as the (log) excess return of the investor's portfolio over 
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the return of the benchmark, i.e. 


V(t, x, Rh) 
F(t,z, h) :2 In = 33 
E LOIS 39) 
where L is the level of the benchmark. 
Furthermore, the dynamics of the benchmark is modelled by the SDE: 
aL(t 
Po = (c+ C' X (t))dt + s'dW (t), L(0) =1 (34) 


where C is a scalar constant, C is a n-element column vector, and ¢ is a N-element 
column vector. 
This formulation is wide enough to encompass a multitude of situations such as: 


e the single benchmark case, where the benchmark is, e.g. an equity index 
such as the S&P500 or the FTSE 100. 

e the single benchmark plus alpha, where, for example, a hedge fund has for 
benchmark a target based on a short-term interest rate plus alpha. 

e the composite benchmark case, e.g., a benchmark constituted of 5% cash, 
35% Citigroup World Government Bond Index, 25% S&P 500 and 35% 
MSCI EAFE. 

e the composite benchmark plus alpha that is a combination of the previous 
two cases. 


By Itó's lemma, the log of the excess return in response to a strategy h is 


t t 
Fh) - n? + f ain vis)— | dln L(s) 
0 0 


= In ; + i (ao + Ap X(s) + h(s) (à t ÀX(s))) ds 


a : ^ h(s)'EX/h(s)ds + ji h(s'EdW(s) 


= g cdW(s) (35) 
J0 


F(0,2; h) = fo :— In 7 


Following an appropriate change of measure, the criterion function can be 


expressed as 
1 Tt 
I(fo, x; h;t, T) = ln fo — 5 In Es exp a | g( Xs, h(s); 0)ds (36) 
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where 


g(x,h;0) = = (0 - 1) /EX/h — ag — Aba — h/ (à + Ax 
0 


NI = 


Óh Xxx + (6 + C'x) 4 ; (0 I's (37) 


Once again, the control problem simplifies into a LEQG problem. 
The value function ® for the auxiliary criterion function J(fo,z; h;t, T). Then 
® is defined as 


(t,x) = sup I(fo, z; h; t, T) (38) 
A(T) 
and it satisfies the HJB PDE 
Oo 
— + sup L% =0 (39) 
Ot — neR™ 
where 
1 
Lh$ = (b+ Bx — ON(X/h — c)) Do + P (AA D?8) 
0 
-3(D9) AN De — g(x,h;@) (40) 
Solving the optimization problem gives the optimal investment policy A*(t) 
* 1 W\-1 fo Aion L / 
M = 5 (BP) (a+ Ae - EA Dà + 05s) (41) 
The solution of the PDE is still of the form 
$(t,2) = 2'Q(t)x + z'q(t) + k(t) (42) 


where Q(t) solves a n-dimensional matrix Riccati equation and g(t) solves a n- 
dimensional linear ordinary differential equation. 


6 What About the Kelly Criterion? 


6.1 A mutual fund theorem 


In the benchmarked case, we will follow the same logic as in the asset only and 
rewrite the optimal asset allocation as an allocation between two funds. We will 
then use this result to define benchmarked optimal fractional Kelly strategies, that is 
fractional Kelly strategies, which are also optimal investment policies in the bench- 
marked asset management model. All we will need to do after that is to verify that 
this new definition is consistent with the definition we gave earlier in the asset only 
case, and thus, with the ‘classical’ definition of fractional Kelly strategies in the 
limit as our model converges to the Merton model. 

The following benchmarked mutual fund theorem is due to Davis and Lleo 
(2008a): 
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Theorem 2 (Benchmarked Mutual Fund Theorem). Given a time t and a 
state vector X(t), any portfolio can be expressed as a linear combination of invest- 
ments into two “mutual funds” with respective risky asset allocations 


hK(t) = (XX) (a+ Ax (t)) 


(43) 

h(t) = (2/7)! [c — EN (g(t) + Q(0)X (0)] 

and respective allocation to the money market account given by 
AK (t) 21—-1'(xX»)- (à 4 Ax(t)) (44) 


hg (t) = 1— (EEN) [Us — UA’ (g(t) + HOXH) 
Moreover, if an investor has a risk sensitivity 0, then the respective weights of each 
mutual fund in the investor’s portfolio equal vH and zu respectively. 


There are two main differences between Theorems 1 and 2: the definition of 
portfolio C and the role played by the risk sensitivity 0. In the asset only case of 
Theorem 1, portfolio C is comprised of the money market asset and of a strategy 
trading the comovement of assets and valuation factors. In the benchmark case 
of Theorem 2, portfolio C still includes an allocation to the money market asset 
and the asset-factor co-movement strategy, but it also contains an allocation to a 
strategy designed to replicate the risk profile of the benchmark. Indeed, the term 
ù := (33/7) 1X represents an unbiased estimator of a linear relationship between 
asset risks and benchmark risk ç = Xu. In financial economics, the interpretation 
of û is as the vector of asset systematic exposure, or “betas”, computed with respect 
to the benchmark. 

Hence, when @ is low, the investor will take more active risk by investing larger 
amounts into the log-utility or Kelly portfolio. On the other hand, when 6 is high, 
the investor will divert most of his/her wealth to the correction fund, which is 
dominated by the term (3X) 1X and designed to track the index. In short, while 
an investor with 0 — 0 is a Kelly criterion investor, an investor with extremely high 
0 is a passive investor. 

This first observation leads us to reconsider the role and definition of the risk 
sensitive parameter 0. Indeed, while in the asset only case, 6 represents the sen- 
sitivity of an investor to total risk, in the benchmark case, 6 corresponds to the 
investor's sensitivity to active risk. To some extent, in the benchmark case the 
investor already takes the benchmark risk as granted. The main unknown is there- 
fore how much additional risk the investor is willing to take in order to outperform 
the benchmark. This amount of risk is directly quantified by the risk sensitive 
parameter 0. 


6.2 Fractional Kelly strategies revisited 


In the benchmark case, as in the asset only case, we can expand the classical def- 
inition of fractional Kelly strategies by defining them as a split between the Kelly 
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portfolio K and the benchmark-tracking portfolio C, as per the Mutual Fund The- 
orem 2 and which is includes a benchmark replicating strategy. 

We can quickly check that the extended definition in the benchmarked case is 
consistent with the definition given earlier in the asset only case by setting the 
benchmark to 0, which de facto implies the absence of a benchmark. We see then 
that the definitions of fund C given in Theorems 2 and 1 are identical: the two 
theorems now coincide. As expected, the asset only problem can be viewed as a 
special case of the benchmarked allocation. 

Thus, if we set » = 0 so that no valuation factor is considered and set the 
benchmark to 0, the risk-sensitive benchmarked asset management model is the 
Merton model and the definition of optimal benchmarked fractional Kelly strategies 
reverts to the classical definition of fractional Kelly strategies. 


7 Case Studies 


We will now apply the model to study some specific benchmark structures and 
assess the appropriateness of benchmarks for Kelly criterion investors. 


7.1 Benchmark as a portfolio of traded assets 


First, we revisit two examples considered by Davis and Lleo (2008a) in which the 
benchmark is a portfolio of traded assets respectively with or without the money 
market asset. 


7.1.1 Benchmark as a portfolio of traded assets and the money market asset 


Developing the ideas from the previous paragraph, we consider a benchmark whose 
dynamics is given by 
dL, j / A A I 
LU (ao + Ao X (£)) + v (t) (â + AX (t))dt + v/(t)ZdW, 
i 
= [(1 — 7/1) ao + v'a + (1 — v/1)A$ + v/ A) X (t)] dt 
t v'(t)»ndW, (45) 
where v is an m-element allocation vector satisfying the budget equation 


1l'»21- hy (46) 


and AP is the allocation left in the money market account. The process L(t) repre- 
sents the (log) return of a constant proportion portfolio with risky allocation vector 
v and the remainder (i.e., 1 — 1'v) invested in the money market account. 


Corollary 3. (Fund Separation Theorem with a Constant Proportion Benchmark 
(II)). Given a time t and a state vector X(t), any portfolio can be expressed as 
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a linear combination of investments into a “mutual fund", an index fund and a 
“long-short hedge fund” with respective risky asset allocations 


hK (t) = (XX) (à + Ax(t)) 
h!(t) =v (47) 
h” (t) = (EE) !xN (g(t) + Q(t) X E) 

and respective allocation to the money market account given by 


hK (t) 21-1 (XX)! (à n AX(t)) 


Ad(t) 21—1'v (48) 
hi (t) =0 
Moreover, if an investor has a risk sensitivity 0, then the respective weights of each 
fund in the investors portfolio equal ED EST and "Sm respectively. 


We can interpret this Corollary in terms of optimal fractional Kelly strategies 
in a similar way as we did for Corollary 4. The second component of the strategy, 
the correction fund C, can again be split into: 


1. a portfolio J with risky asset allocation given by v which is designed to strictly 
replicate the risk exposure of the index; 

2. a “long-short hedge fund” H with risky asset allocation given by A (t) and whose 
sole purpose is to trade the comovement of assets and factors. 


Here again, we could show that H is a zero net weight strategy, i.e. 


Uh? =0 (49) 


7.1.2 Benchmark as a portfolio of traded assets only 


We assume that the benchmark follows a constant proportion strategy invested 
in a combination of traded assets. Its dynamics is given by the equation 


dL, 


LU v'(a + AX(t))dt + v/ZdW, (50) 
t 


where v is a m-element allocation vector satisfying the budget equation 
Vv=1 (51) 
In this setting, Corollary 3 can be expressed as: 


Corollary 4. (Fund Separation Theorem with a Constant Proportion Benchmark 
(I)). Given a time t and a state vector X(t), any portfolio can be expressed as 
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a linear combination of investments into a "mutual fund", an index fund and a 
“long-short hedge fund" with respective risky asset allocations 


hK (t) = (xx) (à n ÀX()) 
hi(t) —-v (52) 
hË (t) = —(Xx/)! xA (g(t) + QOX (0) 


and respective allocation to the money market account given by 


hÉ(t) = 1-1 (Xx)! (à n ÀX(0) 
ha(t) 


t)=1 
t 
hg (t) = (EENEN (q(t) + QU) X (2) 


(53) 


Moreover, if an investor has a risk sensitivity 0, then the respective weights of each 


mutual fund in the investor’s portfolio equal ERE z and z respectively. 


From the perspective of optimal fractional Kelly strategies, the implication of 
this corollary is that the second component of the strategy, the correction fund C, 
can now be explicitly split into two sub portfolios: 


1. a portfolio J with asset allocation given by v which is designed to strictly replicate 
the risk exposure of the index. Observe that the linear estimator à :— (XX) 1X 
has vanished and been replaced by the actual asset allocation v: since the bench- 
mark is now comprised of traded assets it can be replicated by a direct investment 
in the appropriate asset allocation and does not require any further estimation; 

2. a “long-short hedge fund" H with risky asset allocation given by 


(EENEN (q(t) + Q(t)X (t) (54) 
and whose sole purpose is to trade the comovement of assets and factors. 


The “long-short hedge fund” H is particularly interesting because it has zero 
net weight in the sense that the sum of all the long positions in the portfolio is 
exactly matched by the sum of short positions. As a result, fund H satisfies the 
budget equation 


Uh" +h —0 (55) 


This “long-short hedge fund” can therefore be viewed as a macro-oriented overlay 
strategy within the asset allocation. 


7.2 Benchmark with alpha target 


We now consider two new examples involving a benchmark based on an interest 
rate plus some measure of risk-adjusted excess return known as alpha. 
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7.2.1 Money market rate plus alpha benchmark 


Money market rate plus some predetermined alpha has been adopted as a bench- 
mark by a large number of hedge funds. What are the implications of this choice 
in terms of the optimal fractional Kelly strategies? 
The dynamics of such a money market rate plus alpha benchmark can be ex- 
pressed as 
dL 
= = (ag + Ao X (t))dt + adt (56) 
t 
where a represents the instantaneous level of risk-adjusted excess return, or alpha, 
required of the fund manager. 
The optimal asset allocation in this case is 
1 


h* = 
041 


(5x)! (à + (À — 9X:A'Q)z — 65N'q) (57) 
which is simply the asset only asset allocation as given in (20). As a result, the con- 
clusion of the Mutual Fund Theorem 1 hold and optimal fractional Kelly strategies 
can be defined accordingly. 

So, what has happened to the benchmark? In our risk-sensitive framework, the 
benchmark is tracked through its risk profile rather than its return profile. Since 
a money market plus alpha benchmark does not have any direct risk, there cannot 
be any comovement between the assets and the benchmark. Thus, the benchmark 
risk profile cannot be replicated, and as a result, the asset allocation is established 
without any regard to the benchmark. 

The money market plus alpha benchmark will have an impact on the value 
function ®. Indeed, in the benchmarked case, the auxiliary criterion function 
I( fo, £; h;t, T) reflects excess return over the benchmark, rather than total return as 
in the asset only case. The value function for the benchmarked problem is therefore 
consistent with excess return rather than total return, resulting in different value 
functions for the benchmark problem and for the asset only case. 

The main conclusion from this case study is that from the perspective of a ratio- 
nal risk-sensitive investor with no investment constraints, money market plus alpha 
benchmarks results in the same investment strategy as not having any benchmark 
at all. To some extent, in a risk-sensitive setting, money market plus alpha is a 
benchmark-free benchmark. In particular, and somehow counter intuitively, the 
choice a higher value for alpha does not result in a the design of a riskier asset 
allocation. In fact, the value of alpha has no impact at all! 


7.2.2 Bill rate or bond yield plus alpha benchmark 


The situation would however be slightly different if the benchmark was for example 
3 month Treasury bill rate plus alpha or a 5 year Treasury note yield plus alpha. 
To generalize slightly, the benchmark dynamics could be expressed as an extension 
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of the model considered above in the “Benchmark as a Portfolio of Traded Assets 
Only? case (in Section 7.1.2) !: 


a = V (a + AX(t))dt + adt + v/EdW, (58) 
t 


where v is a m-element allocation vector satisfying the budget equation 
l5»-1 (59) 


As expected, the level of alpha does not influence the optimal asset allocation 
and the conclusions stemming from Corollary 4 are still valid in this case. 


7.2.3 Solving the alpha puzzle 


Why is the alpha not producing any impact on the asset allocation? The reason is 
that the risk-sensitive benchmarked asset management model does not penalize for 
a non-achievement of the benchmark return. As we have seen above, risk sensitivity 
implies that the risk of the asset portfolio relative to the benchmark matters, but 
not the expected return. Since an alpha target is in essence a “pure return" target, 
it does not change the behaviour of the risk sensitive investor. 

Should the investor be penalized for a non-achievement of the benchmark return? 
Not in our opinion. For an investor who decides of their own asset allocation, 
the benchmark is à measure of acceptable risk rather than an minimum return 
objective. Indeed, the portfolio optimization process will always select, for a given 
risk sensitivity, the asset allocation which maximizes the relative return of the asset 
portfolio with respect to its benchmark. The issue of an alpha target is irrelevant 
in this problem. 

The situation might however be different if the investor hires a manager to 
perform the asset allocation. The alpha imperative may in this case be understood 
as a control imposed by the investor to force the asset manager to select an optimal 
(or near optimal) asset allocation. The investor should still not be penalized for 
a non achievement of the benchmark, but it is conceivable that the investor may 
want to penalize the manager for non achievement of the benchmark return. Here, 
the penalization would occur at the level of the fee earned by the manager and not 
at the level of the investor's terminal wealth. This would result in a completely 
different control problem in which we would need to take the perspective of the 
manager whose objective is to allocate the investor's assets in order to maximize 
the management fee received and subject to a non-achievement penalty. 


l'This slight generalization is required in the event the benchmark is based on a bond. Indeed, 
in the risk-sensitive model, zero coupon bonds are the representatives of the fixed income asset 
class in the investment universe. Any coupon bond must therefore be “recreated” as a linear 
combination of zero coupon bonds. In the event, the benchmark is à Treasury bill, then we have 
the degenerate case in which v is a vector with the element corresponding to the Treasury bill set 
to 1 and all other elements equal to 0 
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7.3  Alpha-omega targets 


An alternative to the pure alpha target is an alpha-omega target, where omega 
represents the variability of alpha (see Grinold and Kahn (1999) for à view of the 
role of alpha and omega in active management). This approach implictly recognises 
that alpha generation is not only uncertain but also risky. The introduction of the 
omega terms changes only slightly the asset allocation problem. The dynamics of 
8 benchmark with alpha - omega targets can be modelled as 
a(t) ; ; ; 
T(t) = [(e - C' X(t))dt + e'aW (t) + [adt + w'aW (t)] 
= (c a 4 C'X (t))dt + (s - w)'dW(t), L(0) =1 (60) 


where the scalar constant c, the n-element column vector C, and ¢ is a N-element 
column vector are related to the benchmark itself and the scalar a and vector w 
refer to the excess return demanded by the investor. The last element of ¢ is set to 
0 while the first n + m element of the vector w are set to 0. This condition ensures 
that the variability of alpha is uncorrelated with the factors and assets, as active 
asset management theory suggests. 


7.3.1 The optimal investment policy 


Extending the reasoning in Davis and Lleo (2008a), we would find that the optimal 
investment policy A* is 


* 1 lfa Â e 1 
h* = 5 (SE!) (a+ Âz — EA D9 + 0Els +w)) (61) 
The solution of the PDE is still of the form 
$(t,z) — z'Q(t)z + z'a(t) + k(t) (62) 


where Q(t) solves a n-dimensional matrix Riccati equation and q(t) solves a n- 
dimensional linear ordinary differential equation. Specifically, 


Olt) - Q(0KsQU) + KH + QU). + AEE) À- 0 — (63) 


0+ 
for t € [0, T], with terminal condition Q(T) = 0 and with 
0 
Ko =6|A | I— —— Y (ENEA 64 
ep r-zREemPÁz)vl (64) 
K,=B- Ê AS'(EX))-1À (65) 
* 041 


q(t) + (K1 — Q(t) Ko) a(t) + Q(t)b + OQ HAC +w) + Ao - C 
+ 4 (24' — 6Q'()x/) (ZX/) (â 4- 0X(s + w)) 
9 (66) 
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with terminal condition q(T’) = 0. 
T 
k(s) = fo + J l(t)dt (67) 
3 
for 0 < s < T and where 


KO = Str (AM QU)) — za Ma) + a(t) 


1 PU /Y—l5 1 2! 1 /4—1 / 
+ oe Y) â+ 3 L1? 4 (0AE (x) EAN q(t) 
TAA HAEE) tâ — E HAD (hE) HEY 
Fat AE (ED) — a (AEN 
1 0 
+ 6(s + wy A'q(t) 7 (0 — 1) (c - w)'(s * w)4 jr j£ (EX) "EG + w) 
1 
4 Faye YE) Bs +w) 4- ag —(e+ a) (68) 


7.3.2 Mutual fund theorem 


We can now restate the optimal asset allocation in terms of the following Mutual 
Fund Theorem: 


Theorem 5 (Alpha-Omega Benchmarked Mutual Fund Theorem). 

Given a time t and a state vector X(t), any portfolio can be expressed as a lin- 
ear combination of investments into two "mutual funds" with respective risky asset 
allocations 


hK (t) = (xx) (à + Ax(t)) 


(69) 
AC (t) = (EX)! [X(c + w) — VA‘ (a(t) + ROXO 
and respective allocation to the money market account given by 
nf (E) = 1-1 (G2) (a + ÂX (8) i 


hg (t) = 1—1(X7)7! [Z(s +w) — EA’ (alt) + QQ X (t))] 
Moreover, if an investor has a risk sensitivity 0, then the respective weights of each 


mutual fund in the investor's portfolio equal $u and ao respectively. 


Proof. This Corollary can be proved in a similar fashion to Theorem 3 in Davis 
and Lleo (2008a). m 


7.3.3 Kelly strategies 


In terms of Kelly strategies, this Mutual Fund Theorem represents a split between 
a fraction aa invested in the Kelly portfolio K, and a fraction zh invested in 


the correction fund C. As in Theorem 2, portfolio C includes an allocation to the 
money market asset and the asset-factor co-movement strategy and an allocation 
to a strategy designed to replicate the risk profile of the benchmark. In addition, 
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the allocation to portfolio C features a new term related to the variability of alpha, 
defined as à :— (XX) 1Xiw and which represents an unbiased estimator of a linear 
relationship between asset risks and alpha risk w = X'u. 

When 9 is high, the investor will divert most of the wealth to the correction fund 
in order to replicate the risk profile of the benchmark and the appropriate level of 
active risk given by w. On the other hand, when 0 is low, the investor will take 
more active risk by investing larger amounts into the Kelly portfolio and will not 
attempt to stay within the bounds of active risk imposed by w. 


7.4 Benchmarks and Kelly investors: No benchmark for buffett! 


Our last application of the idea of benchmarked fractional Kelly strategies takes 
the form of an anecdote concerning the appropriateness of benchmarks for Kelly 
investors. A few years ago, a controversy shook the North American investment 
industry: what would be a proper investment benchmark to assess the performance 
of legendary investor Warren Buffettt's Berkshire Hathaway? Although this ques- 
tion may appear, at first glance, trivial, it is in fact the reflection of a wider concern 
shared by asset managers and investors alike on what constitutes an appropriate 
investment benchmark. This question has received an increasing deal of attention 
in the past two decades and it is considered so important in the investment man- 
agement industry that the Research Foundation of the CFA Institute (then called 
AIMR), a leading professional association in the investment management indus- 
try, has recently devoted a monograph to the question (see Siegel (2003) for more 
details). 

Although we do not intend on entering the benchmark design and specification 
debate?, the results we derived reveal a new dimension to the problem: the impact 
of the investment benchmark on a fund manager’s investment strategy depends 
to a significant extent on the risk-aversion of the manager or investor. Indeed, 
we have shown that in the risk-sensitive benchmarked asset management model, 
the importance of the benchmark in the investment decision increases as the risk 
aversion of the investor increases. In fact, the risk-sensitivity 0 is a direct indication 
of the amount of active risk that an investor is willing to take. Thus, the exercise 
consisting in setting an investment benchmark becomes increasingly irrelevant as 
risk-aversion gets nearer to 0: Kelly criterion investors invest in the log-utility 
portfolio and have no regards for a benchmark. 

Before setting a benchmark, it is imperative to know the investor's risk prefer- 
ences as this will influences his/her investment style. A very rough correspondence 
would be to identify extremely high risk aversion investors to passive portfolio man- 
agers whose mandate is to track an index, low risk aversion investors to purely active 
managers, whose mandate is to generate the best possible risk-adjusted return in 


2We have, after all, conveniently assumed throughout that an appropriate care had been taken 
beforehand to select the benchmark. 
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a specific market, and medium risk-aversion investors to so called “core plus" or 
“index plus" managers who track closely and index while trying to generate some 
incremental extra return or “alpha”. The only time these three broad categories of 
investors will coincide is when their benchmark is the Kelly portfolio. 

Going back to our original question, Warren Buffet focuses on long-term growth 
maximization rather than the avoidance of short-term losses (see Ziemba (2005) for 
a discussion of Berkshire Hathaway’s risk-adjusted performance and Thorp (2006) 
for a Kelly criterion perspective). In this sense, Mr. Buffet behaves similarly to a 
Kelly criterion bettor. The RSBAM model then demonstrates that benchmarks are 
irrelevant to Mr. Buffet’s investment strategy. 

As a concluding note, since at least 1995, the Berkshire Hathaway annual report 
features a table presenting a proxy for the performance of the Berkshire Hathaway 
share against the return of the S&P500 going back to 1965. Warren Buffet, however, 
does not recognize the S&P500 as his benchmark. He made it clear that this 
comparative table had been included in the annual reports at the request of investors 
but that he personally sees little value in it. 


8 Conclusion 


Historically, Kelly and fractional Kelly strategies have played an important part 
in asset-only investment management. But the essential role of fractional Kelly 
strategies does not stop at this level: it also extends to benchmarks and even to 
asset and libaility management (see Davis and Lleo (2008b)). In benchmarked 
asset allocation problems, fractional Kelly strategies highlight the fundamental split 
between a purely active growth maximizing strategy (the Kelly criterion portfolio) 
and pure benchmark replication. Moreover, the Kelly fraction, which represent the 
fraction of one's wealth invested in the Kelly portfolio, is a sole function of the 
investor's degree of risk-sensitivity. 

This has profound implications. A Kelly investor, with 0 risk-sensitivity, will 
invest solely in the purely active portfolio and as a result, benchmarks are irrelevant 
to Kelly investors. On the other end of the spectrum, investors with extremely 
large risk-sensitivity will tend to opt for full replication of the benchmark: they 
are passive investors. Somewhere in the middle, we find investors with an average 
risk-sensitivity, who adopt core-plus strategies: a basic replication of the benchmark 
with some departure in order to generate excess returns. The only time these three 
broad categories of investors will coincide is when their benchmark is the Kelly 
portfolio. 
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Abstract 


This paper introduces a general market modeling framework, the benchmark 
approach, which assumes the existence of the numéraire portfolio. This is the 
strictly positive portfolio that when used as benchmark makes all benchmarked 
non-negative portfolios supermartingales, that is intuitively speaking downward 
trending or trendless. It can be shown to equal the Kelly portfolio, which maxi- 
mizes expected logarithmic utility. In several ways, the Kelly or numéraire portfo- 
lio is the “best” performing portfolio and cannot be outperformed systematically 
by any other non-negative portfolio. Its use in pricing as numéraire leads di- 
rectly to the real world pricing formula, which employs the real world probability 
when calculating conditional expectations. In a large regular financial market, 
the Kelly portfolio is shown to be approximated by well-diversified portfolios. 


JEL Classification: G10, G13 


1991 Mathematics Subject Classification: primary 90A12; secondary 60G30, 
62P20. 


Keywords and phrases: Kelly portfolio, real world pricing, numéraire portfolio, 
strong arbitrage, diversification. 


Introduction 


The classical asset pricing theories, as developed in Debreu (1959), Sharpe (1964), 
Lintner (1965), Merton (1973a,b), Ross (1976), Harrison and Kreps (1979), Con- 
stantinides (1992), and Cochrane (2001) represent forms of relative pricing, since 
the existence of an equivalent risk neutral probability measure is typically requested. 
However, relative pricing ignores in the long run the presence of the equity premium. 
This paper presents an extension of classical risk neutral pricing in a general model- 
ing framework, the benchmark approach, where a form of absolute pricing naturally 
emerges, see Platen (2002) Platen (2002, 2006) and Platen and Heath (2006). The 
benchmark represents here the “best” performing, strictly positive, tradable port- 
folio, which turns out to be the Kelly portfolio, see Kelly (1956). Most results we 
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present can be stated in a model independent manner. The benchmark approach 
only requires the existence of the benchmark, playing the role of the numéraire 
portfolio, originally discovered in Long (1990). The benchmark approach with the 
Kelly portfolio as benchmark is directly applicable in portfolio optimization and 
covers a wider modeling world than classical theories allow. 

We will see that non-negative portfolios, when denominated in units of the 
benchmark are supermartingales. This supermartingale property has been already 
observed for particular settings, for instance, in Long (1990), Bajeux-Besnainou 
and Portait (1997), Becherer (2001), Platen (2002), Bühlmann and Platen (2003), 
Platen and Heath (2006), Karatzas and Kardaras (2007), and Kardaras and Platen 
(2008b). For the purpose of pricing, the inverse of the benchmark plays the role 
of the stochastic discount factor in the language of Cochrane (2001). Within this 
paper, there will be no major additional assumption made beyond the request on the 
existence of a numéraire portfolio. A series of fundamental results follows from this 
assumption by a few basic arguments. Some of these describe "best" performance 
properties of the Kelly portfolio which underline its fundamental importance in 
investing. 

In Platen and Heath (2006), the benchmark approach has been described for 
jump-diffusion markets. Probably the most striking feature of the rich benchmark 
framework is the possible co-existence of several self-financing portfolios that may 
perfectly replicate one and the same payoff. The presence of different replicating 
portfolios is not consistent with the classical Law of One Price. The proposed 
Law of the Minimal Price identifies for a given contingent claim the corresponding 
minimal replicating portfolio process, which characterizes also in an incomplete 
market the economically correct price process. The Law of the Minimal Price 
yields directly the real world pricing formula, where the expectation is taken with 
respect to the real world probability measure and the Kelly portfolio appears as the 
numéraire. No change of probability measure is performed under real world pricing, 
which extends the classical risk neutral approach. Consequently, the request on the 
existence of an equivalent risk neutral probability measure is here avoided and a 
much wider modeling world is available. The real world pricing formula generalizes 
the risk neutral pricing formula as well as the actuarial pricing formula. These 
formulae represent the central pricing rules in their respective streams of literature 
and nothing else is here requested than the existence of the Kelly portfolio. The 
fact that the Law of One Price does not hold does not create strong arbitrage 
opportunities in the sense of this paper. We will see that there is no economic 
reason to exclude any weaker form of arbitrage as under the classical no-arbitrage 
approach. 

A Diversification Theorem will be derived along the lines of Platen (2005). It 
states that any diversified portfolio in a regular financial market approximates the 
Kelly portfolio. This makes the benchmark approach rather practical and allows 
one to interpret a well-diversified portfolio as proxy for the Kelly portfolio. For 
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further results on the benchmark approach the reader is referred to Platen and 
Heath (2006). 

The remainder of the paper is organized as follows: It introduces the numéraire 
portfolio in Section 2. Section 3 derives various manifestations of "best" perfor- 
mance for the Kelly portfolio. The Law of the Minimal Price is proposed in Sec- 
tion 4. In Section 5 we discuss the concept of real world pricing. Section 6 introduces 
a strong form of arbitrage. Finally, Section 7 provides a version of the Diversification 
Theorem. 


2 Benchmark Approach 


Along the lines of Platen (2002) (Platen, 2002, 2006) and Platen and Heath (2006), 
we consider a general financial market in continuous time with d risky, non-negative, 
primary securities, d € {1,2,...}. These securities could be, for instance, shares, 
currencies or other traded securities. Denote by SÍ the value of the corresponding 
j'^ primary security account, j € {0,1,...,d}, at time t> 0. This non-negative 
account holds units of the j*^ primary security together with all dividends or interest 
payments reinvested. The 0*^ primary security account S? denotes the value of the 
locally riskless savings account at time t > 0. The dynamics of the primary security 
accounts need not be specified when formulating the main statements. 

The market participants can form self-financing portfolios with primary security 
accounts as constituents. A portfolio value S? at time t is described by the number 
g of units held in the jth primary security account S for all j € {0,1,...,d}, 
t> 0. For simplicity, assume that the units of the primary security accounts are 
perfectly divisible, and that for all t € [0, co) the values 6?, 6},..., 6%, for any given 
strategy 5 = (64 = (6?,6},...,6%)', t > 0}, depend only on information available 
at time t. The portfolio value at this time is given by the sum 


d 
S cO ADS 
j=0 


We consider only self-financing portfolios where changes in their value are only due 
to changes in the values of the primary security accounts. We neglect any market 
frictions or liquidity effects. 

By V; denote the set of all strictly positive, finite, self-financing portfolios with 
initial capital x > 0. The benchmark approach employs a very special strictly 
positive portfolio as benchmark, which we denote by S°* c Vi. Later it will 
become apparent that it is the Kelly portfolio, see Kelly (1956), which is in several 
ways the “best” performing strictly positive tradable portfolio. On the other hand, 
it will turn out that it is also the natural numéraire , see Long (1990), for pricing 
any type of claim when employing the real world probability for calculating the 
respective conditional expectation in the resulting pricing formula. 
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Let E(X} denote the conditional expectation under the real world probability 
measure P given the information available at time t > 0. 


Definition 2.1. For given x > 0, a strictly positive, finite, self-financing portfolio 
S^ c Vi is called a numéraire portfolio if all non-negative portfolios S, when 
denominated in units of S°*, are supermartingales. 


The notion of a numéraire portfolio was originally introduced by Long (1990) in 
a rather special setting. Later it was generalized in Bajeux-Besnainou and Portait 
(1997) and Becherer (2001). These authors worked under classical no-arbitrage 
assumptions that are consistent with the existence of an equivalent risk neutral 
probability measure, see Delbaen and Schachermayer (1998). More recently, Platen 
(2002), Bühlmann and Platen (2003), Platen and Heath (2006), and Platen (2006) 
emphasized that in a more general setting one still obtains a viable financial market 
model, as long as a numéraire portfolio exists. Also Fernholz and Karatzas (2005), 
Karatzas and Kardaras (2007), and Kardaras and Platen (2008b) consider financial 
market models beyond the classical no-arbitrage framework. 

To provide a basis, let us formulate the only major assumption of the paper: 


Assumption 2.2. For given x > 0, there exists a numéraire portfolio S?* € vy. 


This assumption is satisfied for a wide range of financial market models used in 
practice. For instance, in Platen and Heath (2006) it has been verified for jump- 
diffusion markets. Karatzas and Kardaras (2007) and Kardaras and Platen (2008b) 
confirm the validity of Assumption 2.2 for a wide range of semi-martingale markets. 

Now, under the benchmark approach we choose the numéraire portfolio as 
benchmark. The benchmarked value Ey of a portfolio S? is given by the ratio 
8-5 

S» 

for all t 2 0. Definition 2.1 leads by Assumption 2.2 directly to the following 
conclusion: 


Corollary 2.3. The benchmarked value E of any non-negative portfolio S? sat- 
isfies the supermartingale property 


ô? > (82) (2.1) 
for all 0 € t € s « co. 


Consequently, the currently observed benchmarked value of à non-negative port- 
folio is always greater than or equal to its expected future benchmarked value. This 
means intuitively, if there were any trend in a benchmarked non-negative portfolio, 
then this trend could only point downward. 

The supermartingale property (2.1) is the fundamental property of a financial 
market. For instance, it yields easily the uniqueness of the benchmark by the 
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following argument: Consider two strictly positive portfolios that are supposed 
to be numéraire portfolios. According to Corollary 2.3, the first portfolio, when 
expressed in units of the second one, must satisfy the supermartingale property 
(2.1). By the same argument, the second portfolio, when expressed in units of the 
first one, must also satisfy the supermartingale property. Consequently, by Jensen's 
inequality the portfolios have to be identical, and the value process S?* c V of a 
numératre portfolio is unique. Note that the stated uniqueness does not imply that 
the number of units invested has to be unique, which is due to potential redundancies 
in primary security accounts. 


3 Best Performance of the Benchmark 


In this section we list several manifestations of the fact that our benchmark S?* 
is the “best” performing, strictly positive, tradable portfolio and equals the Kelly 
portfolio: 


3.1 Numéraire portfolio 


First, the definition of the numéraire portfolio S?* itself, given by Definition 2.1, 
expresses the fact that this portfolio performs “best” in the sense that the expected 
returns of benchmarked non-negative portfolios never become strictly positive. 


3.2 Kelly portfolio 


As a second manifestation of “best” performance, we derive the growth optimality 
of the benchmark, which identifies it very generally as the Kelly portfolio, a fact 
that has been documented for special models in the literature, for instance, in Long 
(1990). The expected growth dis of a strictly positive portfolio S? over the time 
period (t,t +h] for t, h > 0 is given by the conditional expectation 


gi, = Ex (In (A? n)) 


of the logarithm of the portfolio ratio 


5 
AS a= SU Lh 
t, ó 
St 


To identify the strictly positive portfolio that maximizes the expected growth, let us 
perturb at time t > 0, the investment in a given strictly positive portfolio SÈ € VW, 
X > 0, by some small fraction e € (0,2 
analyzing the changes in the expected growth of the perturbed portfolio S* define 
the derivative of expected growth in the direction of S? as the right hand limit 


) of some non-negative portfolio Sê. For 


gs, 
Oe 


c def ó 
Je lim = (ois = Gen) (3.1) 


e= 
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for t,h > 0. Obviously, if the portfolio that maximizes expected growth coincides 
in (3.1) with the portfolio SŽ, then the resulting derivative of expected growth will 
never be greater than zero for all non-negative portfolios S?. This leads to the 
following definition of growth optimality: 


Definition 3.1. A strictly positive portfolio S? is called growth optimal if the 
corresponding derivative of expected growth is less than or equal to zero for all 
non-negative portfolios S?, that is, 


era 
ðe |..g9 


for all t, h > 0. 


Note that this definition is different to the classical characterization of the Kelly 
portfolio or growth optimal portfolio. In the literature, it is typically based on the 
maximization of expected logarithmic utility from terminal wealth, as used in Kelly 
(1956) and later also employed in a stream of literature, including Latané (1959), 
Breiman (1960), Hakansson (1971), Merton (1973a), Roll (1973) and Markowitz 
(1976), among many others. It is clear from Definition 3.1 and the standard log- 
utility definition of the Kelly portfolio, see for instance Kelly (1956) and Thorp 
(1972), that the above growth optimal portfolio equals the Kelly portfolio. The 
following result provides a convenient method for the identification of the benchmark 
or numéraire portfolio in à given investment universe by searching for its Kelly 
portfolio. 


Theorem 3.2. The numéraire portfolio is growth optimal. 


Proof: Fore c (0, 2) two consecutive times t and t+ h with h > 0, and a non- 
negative portfolio SÊ, with S? > 0, consider the perturbed portfolio S°: under the 
choice S? = S?" in (3.1), yielding a portfolio ratio A? = £ Ad, — (1 — ©) AP > 
0. One then obtains by the well-known inequality ln(x) < z—1lforz» 0, 
the relations 


Gh =- In| 4] < thi] = th 4 (3.2) 
moe E E AT Ati 
and 
- 1 A* 1 A* A — A* 
Gin = n( 2| 2 is a an (3.3) 
E Arh E Arh Arh 


Since ASS, > 0 one obtains from (3.3) for EUM — Aes, > 0 the inequality 


Gh, > 0 (3.4) 
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and for 42, — Aj*, < 0, because of e € (0, 1) and A2, > 0, the relation 
: AT 1 1 
GU > -t = > >—2 (3.5) 
th Ave AT, 1 —£ 
th l-ete— 
AGS 


Summarizing (3.2)-(3.5) yields the upper and lower bounds 
A 
-2 < G, x T -1 (3.6) 


where by Definition 2.1 one has 


Aŝ, 
R | =+] <1 (3.7) 
($ 
By using (3.6) and (3.7) it follows by the Dominated Convergence Theorem, see 
Shiryaev (1984), that 
99j5, 
OE 


| 
B 
m 
E S 
Q 
T 
— 
Il 
i 
A cC 
B 
R 
LO 
Nr 


e=0 


Ave A 
E, 2 In | A =F; |" )=1 
£ Arh j le=0 Ath 


This proves by Definitions 2.1 and 3.1 that the numéraire portfolio S?* is growth 
optimal. 


3.3 Long term growth 


To formulate a third manifestation of “best” performance for the benchmark, define 
the long term growth rate g° of a strictly positive portfolio S? € V+ as the upper 
limit 
: 1 Se ) 
6}: t 
= limsup - ln | — 3.8 
ge mip ( Si (3.8) 
The long term growth rate (3.8) is defined pathwise almost surely and does not 
involve any expectation. By exploiting the supermartingale property (2.1), the 
following fascinating property of the Kelly portfolio will be shown very generally 
below. 


Theorem 3.3. The numéraire portfolio S9?* e€ Vi achieves the maximum long 
term growth rate. This means, when compared with any other strictly positive port- 
folio S? € V+, one has 


g? < g” (3.9) 
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Proof: Similar as in Karatzas and Shreve (1998) consider a strictly positive port- 
folio S? € Vi, x > 0, with the same initial capital as the numéraire portfolio, 
that is, S = S$* = x > 0. By Corollary 2.3, we can use the following maximal 
inequality, derived in Doob (1953), where for any k € (1,2,...) and £ € (0,1) one 
has 


exp{e k} P ( sup 5S? > expf{e 2 < Eo (St) «$$ -1 


k<t<oo 
One finds for fixed £ € (0,1) that 
oo . [o9] 
»( sup m (82) > eh) < So exp{-ek} < oo 
k=1 K<t<oo k=1 


By the Borel-Cantelli lemma, see Shiryaev (1984), there exists a random variable 
ke such that for all k > kẹ and t > k it holds that 


In (82) <ek<et 
Therefore, it follows for all k > kę the estimate 


1 a 
sup — In (3?) <e 
t>k É 


which implies that 


ô 1 Sgó* 
limsup 1 In Si <limsup—In{f— | +e 3.10 
ô Ox 
So So 


too t too t 


Since the inequality (3.10) holds for all £ € (0, 1) one obtains with (3.8) the relation 
(3.9). 


According to Theorem 3.3, the trajectory of the Kelly portfolio outperforms in 
the long run those of all other strictly positive portfolios that start with the same 
initial capital. This property is independent of any model choice and, therefore, 
very robust. An investor, who is aiming in the long run for the highest possible 
wealth, has to invest her or his total tradable wealth into the Kelly portfolio. 


3.4 Systematic outperformance 


Over short and medium time horizons, almost any strictly positive portfolio can gen- 
erate larger returns than those exhibited by the Kelly portfolio. However, the fourth 
manifestation of “best” performance will show that such short term outperformance 
cannot be achieved systematically. To formulate a corresponding statement prop- 
erly, we will employ the following definition: 


Definition 3.4. A non-negative portfolio S? systematically outperforms a strictly 
positive portfolio S? if 


(i) both portfolios start with the same initial capital SÈ, = sË ; 
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(ii) at a later time t the portfolio value S? is at least equal to S$, that is P(S? > 
S?) = 1, and | 

(iii) the probability for SÊ being strictly greater than S? is strictly positive so that 
p (s? > sj) >0. 


Systematic outperformance of one portfolio by another one is possible under the 
benchmark approach, see Platen and Heath (2006). The above notion of systematic 
outperformance was introduced in Platen (2004) and was motivated by the super- 
martingale property (2.1). It relates in some sense to the notion of relative arbitrage, 
later studied in Fernholz and Karatzas (2005), and also to the notion of a maxi- 
mal element earlier employed in Delbaen and Schachermayer (1998). The following 
result presents a fourth manifestation of “best” performance of the benchmark: 


Theorem 3.5. The numéraire portfolio cannot be systematically outperformed 
by any non-negative portfolio. 


Proof: Consider a non-negative portfolio S? with benchmarked value Ey =lata 
given time t > 0, where $? > 1 almost surely at some later time s € [t,00). Then 
it follows by the supermartingale property given in Corollary 2.3 that 


0» E, ($3 - 8°) = E (83 - 1) 


Since one has $$ > 1 almost surely and E;($2) < 1, it can only follow that $9 = 1. 
This means that one has at time s the equality S? = S?* almost surely. Therefore, 
according to Definition 3.4 the portfolio S? does not systematically outperform the 
numéraire portfolio. 


As a consequence of Theorem 3.5, one can conclude that in the given very 
general modeling setting, no active fund manager can systematically outperform 
the benchmark. On the other hand, if the market portfolio is not the numéraire 
portfolio, which is most likely the case, and a fund approximates well the Kelly 
portfolio, then this fund will not be outperformed systematically by the market 
portfolio or any other significantly different portfolio. In the long run its path will 
by property (3.8) beat the path of the market portfolio almost surely. 

There are further manifestations of “best” performance of the Kelly portfolio. 
For instance, in Kardaras and Platen (20082), it is shown that the Kelly portfolio 
minimizes some expected market time to reach a given wealth level. Results in this 
direction can be found, for instance, in Browne (1998). 


4 The Law of the Minimal Price 


The fundamental supermartingale property (2.1) ensures that the maximum ex- 
pected return of a benchmarked non-negative portfolio can at most equal zero. In 
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the case when it equals zero for a given benchmarked price process for all time in- 
stances, then the current benchmarked value of the price is always the best forecast 
of its future benchmarked values. In this case, one has equality in relation (2.1) 
and we call such a price process fair. A benchmarked fair price process forms a 
martingale. In general, not all primary security accounts and portfolios need to be 
fair under the benchmark approach. This is the key property that differentiates this 
approach from the classical approaches. For instance, there may exist benchmarked 
portfolios that form local martingales but are not true martingales, as is exten- 
sively discussed in Platen and Heath (2006). Furthermore, note that non-negative 
fair price processes are somehow minimal, as we shall see below. 

It may be puzzling to some readers that discounting with another discount fac- 
tor than the savings account ought to lead to "fair" prices or could be particularly 
meaningful. Below we will give a valid reason why the Kelly portfolio is the “uni- 
versal currency" that should be used for discounting when pricing under the real 
world probability. It stems from the fact that the Kelly portfolio represents the 
best performing portfolio and, thus, the natural numéraire for valuation when the 
real world probability measure for calculating expectations. 

The following Law of the Minimal Price substitutes the widely postulated clas- 
sical Law of One Price, which no longer holds in our general setting: 


Theorem 4.1. Law of the Minimal Price. If a fair portfolio process replicates 
a given non-negative payoff at a given maturity date, then this portfolio represents 
the minimal replicating portfolio among all non-negative portfolios that replicate this 
payoff. 


Proof: A stochastic process $6 , which satisfies relation (2.1) is a supermartingale, 
see Shiryaev (1984). When equality holds in (2.1) then the process is fair and its 
benchmarked value process is a martingale. Within a family of non-negative super- 
martingales with the same value at a given future payoff date, it is the martingale 
among these supermartingales which attains almost surely the minimal possible 
value at all times before the maturity date. This fundamental fact about the opti- 
mality of martingales in a family of supermartingales proves directly Theorem 4.1. 


'The existence of unfair price processes under the benchmark approach creates 
new realistic effects that can be modeled and are not captured by any classical 
no-arbitrage framework. For a given replicable payoff, it follows by Theorem 4.1 
that the corresponding fair replicating portfolio describes the least expensive hedge 
portfolio. This is also the economically correct price process in a competitive mar- 
ket. We emphasize that the Law of the Minimal Price generates a unique price 
system for contracts and derivatives under the benchmark approach which only re- 
lies on the existence of a numéraire portfolio. As shown in Platen (2009), pricing 
purely based on hedging or classical no-arbitrage arguments, as employed under the 
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classical Arbitrage Pricing Theory, see Ross (1976), can lead in our general setting 
to significantly more expensive prices than provided by fair price processes. 


5 Real World Pricing 


Define à contingent claim Hr as a non-negative payoff delivered at maturity 
T € (0,00), which is expressed in units of the domestic currency and has finite 


expectation 
H 
Ep (Sc) « oo. (5.1) 


By the Law of the Minimal Price one can identify the corresponding fair price 
process via the following real world pricing formula: 


Corollary 5.1. If for a contingent claim Hr, T € (0,00), there exists a fair 
portfolio S?" that replicates this claim at maturity T such that Hr = Sat, then 
its minimal replicating price at time t € |0, T] is given by the real world pricing 


formula 
si = s? B, (S) (5.2) 


Relation (5.2) is called the real world pricing formula because it involves the 
conditional expectation E, with respect to the real world probability measure. This 
formula can be interpreted in the sense of Cochrane (2001) as a pricing formula 
that uses the stochastic discount factor (S?*)-!. Also the closely related use of the 
corresponding state price density, pricing kernel and deflator are consistent with 
the real world pricing formula (5.2) under appropriate assumptions. However, all 
the classical pricing approaches exclude classical arbitrage, which is equivalent to 
the existence of an equivalent risk neutral probability measure, see Delbaen and 
Schachermayer (1998). In this manner, they postulate the Law of One Price, see 
for instance Ingersoll (1987), Long (1990), Constantinides (1992), Duffie (2001), and 
Cochrane (2001) for classical no-arbitrage settings. The benchmark approach with 
its real world pricing concept does not require a risk neutral probability measure 
to exist or equivalent classical no-arbitrage constraints and goes beyond the Law 
of One Price. The real world pricing formula (5.2) only requires the existence of 
the numéraire portfolio and the finiteness of the expectation in (5.1). No measure 
transformation needs to be applied. 

An important special case of the real world pricing formula (5.2) arises when 
Hr is independent of S2. In this case one obtains the actuarial pricing formula 


S)" = P(t, T) E,(Hr) (5.3) 
with the fair zero coupon bond price 


P(t, T) = Sb E (55) 3) (5.4) 
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that pays one monetary unit at maturity T. The fair zero coupon bond price 
P(t,T) provides the discount factor in (5.3) in a way as it has been used intuitively 
by actuaries for obtaining the, so called, net present value of Hr. The real world 
pricing formula provides a rigorous and general derivation of the actuarial pricing 
formula that has been in use for centuries. 

Let us now derive risk neutral pricing as a special case of real world pricing. For 
this purpose we rewrite for t — 0 the real world pricing formula (5.2) in the form 


Sot = F ICE. Hr) (5.5) 


0 
while employing the benchmarked normalized savings account Ap = a By the 


supermartingale ae (2.1) of the normalized benchmarked savings account pro- 


cess A= {Ay = = Ao > Eo(Ar). Together with equation (5.5) 
SQ" 
this yields the T 
0 
‘ (Ar E Hr) 
Si" < z (5.6) 
Eo(Ar) 


If the benchmarked savings account is not a martingale, then equality does not 
hold in relation (5.6). To ensure equality in (5.6), one needs to impose the strong 
assumption that the savings account is fair, that is, intuitively its benchmarked 
value has no trend and is a martingale. In this particular case, the expression on 
the right hand side of (5.6) can be interpreted by Bayes’ formula as the conditional 
expectation of the discounted contingent claim under the, in this case existing, 
equivalent risk neutral probability measure Q with Radon-Nikodym derivative Ap = 
2$. Only in this case when A is a martingale the relation (5.6) yields, in general, 
for a contingent claim H7 the classical risk neutral pricing formula 


ste = 39 (Sh) 


see, for instance, Harrison and Kreps (1979) or Karatzas and Shreve (1998). Here 
Eg denotes the conditional expectation at time t — 0 under the equivalent risk 
neutral probability measure Q. By inequality (5.6) it follows that the fair derivative 
price is never more expensive than the price obtained under some formal application 
of the standard risk neutral pricing rule. 

Finally, it shall be noted for not perfectly hedgable contingent claims that utility 
indifference pricing, in the sense of Davis (1997), leads in the case of incomplete 
jump-diffusion markets again to the real world pricing formula (5.2), see Platen and 
Heath (2006). Under real world pricing, the hedgable part of the claim is replicated 
via the minimal possible hedge portfolio and the benchmarked unhedgable part 
remains untouched. In some sense, the real world pricing formula provides the least 
squares projection of a given unhedgable benchmarked contingent claim into the set 
of current benchmarked prices. The benchmarked hedge error has zero mean and 
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its variance is minimized. This generalizes also the notion of local-risk minimization 
for pricing in incomplete markets, as advocated in Fóllmer and Schweizer (1991), 
Hofmann et al. (1992), and Schweizer (1995). In practice, when benchmarked hedge 
errors can be diversified in the large book of a well diversified bank or insurance 
company, then the total benchmarked hedge error vanishes by the Law of Large 
Numbers asymptotically from the bank's trading book and the market becomes 
asymptotically complete from this perspective. This shows that in practice real 
world pricing makes perfect sense. Any systematically more expensive pricing would 
make an institution less competitive. On the other hand, any lower prices would 
make it unsustainable. 


6 Strong Arbitrage 


Since the benchmark approach goes significantly beyond the classical no-arbitrage 
world, it is of importance to clarify the potential existence of arbitrage opportu- 
nities. In the literature, there exist many different mathematical definitions of 
arbitrage, and one has to ensure that the given modeling framework is economi- 
cally viable. Obviously, arbitrage opportunities can only be exploited by market 
participants. These have to use their portfolios of total tradable wealth when trying 
to exploit potential arbitrage opportunities. Due to the established legal concept 
of limited liability, only non-negative total tradable wealth processes have to be 
considered when studying the exploitation of potential arbitrage. Therefore, any 
realistic arbitrage concept has to focus on non-negative, self-financing portfolios. 

An obvious, strong form of arbitrage arises when a market participant can gen- 
erate strictly positive wealth from zero initial capital. This leads to the following 
definition of strong arbitrage, which was introduced in Platen (2002) motivated by 
the supermartingale property (2.1): 


Definition 6.1. A non-negative portfolio S? is a strong arbitrage if it starts with 
zero initial capital, that is SÈ = 0, and generates strictly positive wealth with strictly 
positive probability at a later time t € (0,00), that is, P(S? > 0) > 0. 


The exclusion of the above form of arbitrage has been independently argued 
for on purely economic grounds by Loewenstein and Willard (2000). Important is 
that if only strong arbitrage is excluded, then weaker forms of arbitrage may still 
exist. However, this does not harm the economic viability of the market model. 
For instance, there may exist, so-called, free snacks and cheap thrills, in the sense 
of Loewenstein and Willard (2000), which usually represent situations of system- 
atic outperformance in the sense of Definition 3.4. Also free lunches with vanishing 
risk, as excluded in Delbaen and Schachermayer (1998), can exist without creating 
complications from an economic point of view. Furthermore, by looking at any of 
the weaker forms of arbitrage one realizes that these cannot be exploited in prac- 
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tice without providing adequate collateral, see Platen and Heath (2006). This re- 
quest, however, makes the corresponding theoretical notions of weaker forms of arbi- 
trage questionable from their practical relevance. By exploiting the supermartingale 
property (2.1), the following result can be established: 


Theorem 6.2. There does not exist any non-negative portfolio that is a strong 
arbitrage. 


Proof: For a non-negative portfolio S$, which starts with zero initial capital, it 
follows by the supermartingale property given in Corollary 2.3 that 


0 = SÊ = z SÈ > x Eo (82) = z E(S?) > 0 


for t > 0, where E(-) denotes expectation under the real world probability. By the 
nonnegativity of S? and the strict positivity of 52*, the event S? > 0 can only have 
zero probability, that is 


P(S >0) =0 


This leads to the conclusion that S? equals zero for all t > 0, which proves by 
Definition 6.1 the Theorem 6.2. 


Theorem 6.2 states that strong arbitrage is automatically excluded in the given 
general benchmark framework. Therefore, different to the classical no-arbitrage ap- 
proaches, pricing by excluding strong arbitrage does not make any sense. Instead, 
one should use real world pricing which is economically and also theoretically mean- 
ingful, as we have seen. 


7 Diversification 


To conclude the paper, we consider the practical problem of identifying or approx- 
imating the Kelly portfolio of a given market. We will indicate how to construct 
proxies for the Kelly portfolio that can be used for portfolio optimization and val- 
uation under the benchmark approach. For simplicity, let us consider a continuous 
financial market with its benchmarked jt" primary security account value Si at 
time t satisfying the driftless stochastic differential equation (SDE) 


j 
dS} = 5} X` ob" aw? (7.1) 
k=1 
for t € [0,00), with $2 > 0, see Platen and Heath (2006). The more general setting of 
jump diffusion markets has been considered in Platen (2005) and forthcoming work 
will generalize the results presented below, avoiding any particular assumptions. In 
(7.1) W* = (WF, t € [0,co)} denotes an independent standard Wiener process, 
k € {1,2,...}. A benchmarked portfolio S94 with fractions Tl invested at time 
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t in the jth primary security account, where j € {1,2,...,d} with d € {1,2,...}, 
satisfies the SDE 


d j 
aS? = Si Soak) M ott aW? (7.2) 
j=l k=1 


for t € [0, 00), with S4 > 0. 

It is clear that the benchmarked Kelly portfolio $; equals the constant one and 
its diffusion coefficients vanish. This leads us to define a sequence (S92)46 6.5... of 
benchmarked approximate Kelly portfolios as a sequence of strictly positive portfolios 
such that for all € > 0 we have 


2 
d 


d 
lim P| 3 |S riat 2e|=0 (7.3) 


doo c 
k=1 \j=l 


for t € [0, oo). 

Furthermore, a sequence (S92)4e 6.3... of benchmarked portfolios is called a 
sequence of benchmarked diversified portfolios if some constants Kı, Ke € (0,00) 
and K3 € {1,2,...} exist independently of d, such that for d € (Ka, Ka + L,...] 
each fraction UM is bounded in the form 


Ke 


<q < —* 
O < 7m 4S di^ 


(7.4) 
almost surely for all j € {1,2,...,d} and t € [0, oo). 

If most benchmarked primary security accounts would have the same driving 
Wiener process, then it may become difficult to form a benchmarked portfolio with 
vanishing volatility. To avoid such a situation we assume that the given continuous 
financial market is regular, which means that for all t € |0,oo) and k € {1,2,...} 
there exists a finite adapted stochastic process C = (Ci, t € [0,00)} such that 


2 


" 
e| <a (7.5) 


almost surely. 
This allows us to prove, similar as in Platen (2005), and Platen and Heath 
(2006), the following Diversification Theorem: 


Theorem 7.1. In a regular continuous financial market each sequence of diver- 
sified portfolios is a sequence of approximate Kelly portfolios. 


Proof: For a sequence of diversified portfolios it follows from (7.4) and (7.5) that 
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for each d € {1,2,...} one has 


2 


which proves (7.3). 


8 Conclusion 


A general financial modeling and pricing framework, the benchmark approach, has 
been presented, which only assumes the existence of the numéraire portfolio. It 
turns out that this portfolio coincides with the Kelly portfolio, which is in several 
ways the “best” performing strictly positive portfolio. It can be used as benchmark 
in the traditional sense of portfolio optimization but also as numéraire in deriva- 
tive pricing. Under the benchmark approach, the classical Law of One Price does 
generally not hold. It has been replaced by the Law of the Minimal Price, accord- 
ing to which the minimal replicating price process for a given contingent claim is 
trendless when expressed in units of the Kelly portfolio. By exploiting this fact, the 
real world pricing concept emerges with the Kelly portfolio as numéraire and the 
real world probability measure as pricing measure. Real world pricing turns out to 
be the natural pricing concept for nonhedgable contingent claims in an incomplete 
market. Certain weak forms of arbitrage may exist under the benchmark approach 
that are excluded under the classical no-arbitrage approach. However, this does 
not harm the economic viability of the resulting general modeling framework. It 
has been shown that diversified portfolios approximate asymptotically in a regular 
market the Kelly portfolio, as the number of constituents increases. 
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Abstract 


This chapter surveys theoretical research on the long-term performance of fixed- 
mix investment strategies. These self-financing strategies rebalance the portfolio 
over time so as to keep constant the proportions of wealth invested in various 
assets. The main result is that wealth can be grown from volatility. Our findings 
demonstrate the benefits of active portfolio management and the potential of 
financial engineering. 


1 Introduction 


Investment advice is usually based on some optimality principle such as the maxi- 
mization of expected utility or the growth rate. The present chapter takes a broader 
view by studying generic features of an investment style based on constant propor- 
tions, or fixed-mix, strategies. These self-financing strategies aim to maintain fixed 
proportions between the value of portfolio positions by trading in the market at 
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specific points in time. It is an active portfolio management that rebalances posi- 
tions by selling assets, whose portfolio value exceed the given benchmark to finance 
the purchase of those assets with a too low weight in the portfolio. 

'The significance of constant proportions strategies for investment science was 
established in Kelly (1956)'s work on information theory and its application to 
betting markets. A detailed account is provided in Part I of this volume. Kelly's 
research was inspired by Claude Shannon's lectures on investment problems in which 
the founder of the mathematical theory of information outlined his pioneering ideas 
in the field of investment science (though he never published in it); see Cover (1998) 
for the history of this approach. 

'The optimality property of the Kelly rule, which maximizes the growth rate of 
capital in the case of independent and identically distributed returns on investment, 
motivated the research on the log-optimum investment principle, Breiman (1961), 
Algoet and Cover (1988), MacLean, Ziemba and Blazenko (1992), Hakansson and 
Ziemba (1995), Browne and Whitt (1996), Cover (1991), Li (1998), and others. Con- 
stant proportions strategies have been studied in many different frameworks, e.g., 
Browne (1998), Luenberger (1998), Aurell et al. (2000) and Aurell and Muratore- 
Ginanneschi (2000), Fernholz (2002), Fernholz and Karatzas (2005), Fernholz et al. 
(2005), Evstigneev, Hens and Schenk-Hoppé (2009a, 2009b), Kuhn and Luenberger 
(forthcoming). This approach has proved quite successful in practical finance as 
witnessed by a considerable body of empirical literature, see Thorp (1971), Perold 
and Sharpe (1988), Ziemba and Mulvey (1998), Mulvey (2001, 2009), Dries et al. 
(2002), Dempster et al. (2003), Mulvey et al. (2007), and others as well as Part VI 
in this volume. 

In contrast to this literature on investment, we will ignore questions of opti- 
mality of trading strategies and will not make use of expected utility or related 
concepts. Our goalis to analyze the performance of arbitrary fixed-mix (constant 
proportions) strategies in markets with different characteristics. We are only in- 
terested in ‘generic’ properties of (not necessarily optimal) fixed-mix strategies and 
their performance relative to buy-and-hold strategies. 

This survey is based on the authors’ research, Evstigneev and Schenk-Hoppé 
(2002) and Dempster, Evstigneev, and Schenk-Hoppé (2003, 2007, 2008). In these 
papers, we study the wealth dynamics of investors employing fixed-mix strategies. 
Very general (and mostly counter-intuitive) results on fixed-mix strategies are ob- 
tained under the assumption that markets will exhibit some degree of stationarity 
either of (relative) prices or returns. This assumption of stationarity of asset returns 
is widely accepted in financial theory (and, thus, the basis for practical investment 
advice) allowing, as it does, expected exponential price growth and mean reversion, 
volatility clustering and very general intertemporal dependence, such as long mem- 
ory effects, of returns. Stationarity of returns for instance is a salient feature of the 
Cox-Ross-Rubinstein binomial asset pricing model. 


1Of course it is well-known that optimally-chosen rebalancing strategies perform at least as well 
as any buy-and-hold portfolio, see Part II in this volume. 
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The remainder of this chapter is organized as follows. Section 1.1 invites the 
readers to test their intuition while Section 1.2 provides a very simple illustrative 
example. The general theory of constant proportions strategies in stationary mar- 
kets is covered in Section 2. A generalization to fixed-mix strategies as well as an 
application to currency market models is given in Section 3. The most intrigu- 
ing case of stock markets with stationary returns, covered in Section 4, delivers 
counter-intuitive results that even experienced researchers find puzzling. Section 5 
discusses several prominent explanations for volatility-induced growth and provides 
an interpretation that agrees with our findings. Section 6 concludes. 


1.1 A test of your intuition 


There is barely any ‘better’ result than one that is (completely) counter-intuitive. 
But before one is tempted to make such a claim on one’s own findings, it seems 
appropriate to test the audience’s intuition. In the following, we ask a few questions 
to which the reader has to promptly guess the answer. Many scientists have been 
asked these questions casually in private, at seminars, and at conferences. The 
following insights were first reported in Dempster, Evstigneev, and Schenk-Hoppé 
(2007). Non-technically minded readers can skip to the next section which provides 
the informal discussion of an intriguing example. 

Stationary markets: puzzles and misconceptions. Consider a market with K as- 
sets whose price process p, = (pl,..., pF) is ergodic and stationary. (It is assumed 
that prices are log-integrable.) The assumption of stationarity of asset prices, per- 
haps after some detrending, seems plausible when modeling currency markets where 
‘prices’ are determined by exchange rates of all the currencies with respect to some 
selected reference currency. 


Question 1. Suppose vectors of asset prices p, = (p},...,p*) fluctuate randomly, 
forming a stationary stochastic process (assume even that the vectors p, are iid (in- 
dependent identically distributed)). Consider a fixed-mix self-financing investment 
strategy prescribing rebalancing one's portfolio at each of the dates t = 1,2,... so 
as to keep equal investment proportions of wealth in all the assets. What is the 
tendency of the portfolio value in the long run, as t > oo? Will the value: (a) 
decrease; (b) increase; or (c) fluctuate randomly, converging (in some sense) to a 
stationary process? 


The audience of our respondents was quite broad and professional, but practi- 
cally nobody succeeded in guessing the correct answer, which is (b). Among those 
with a firm view, nearly all selected (c). There were also a couple of respondents 
who decided to bet on (a). Common intuition suggests that if the market is sta- 
tionary, then the portfolio value for a constant proportions strategy must converge 
in one sense or another to a stationary process. The usual intuitive argument in 
support of this conjecture appeals to the self-financing property. The self-financing 
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constraint seems to exclude possibilities of unbounded growth. This argument is 
also substantiated by the fact that in the deterministic case both the prices and 
the portfolio value are constant. This way of reasoning makes the answer (c) to the 
above question more plausible a priori than the others. 

It might seem surprising that the wrong guess (c) has been put forward even 
by those who have known about examples of volatility pumping for a long time. 
The reason for this might lie in the non-traditional character of the setting where 
not only the asset returns but the prices themselves are stationary. Moreover, 
the phenomenon of volatility-induced growth is more paradoxical in the case of 
stationary prices, where growth emerges "from nothing". In the conventional setting 
of stationary returns, volatility serves as the cause of an acceleration of growth, 
rather than its emergence from prices with zero growth rates. 

A potentially promising attempt to understand the correct answer to Question 
1 might be to refer to the concept of arbitrage. Getting something from nothing 
as a result of an arbitrage opportunity seems to be similar to the emergence of 
growth in a stationary setting where there are no obvious sources for growth. As 
long as we deal with an infinite time horizon, we would have to consider some kind 
of asymptotic arbitrage. All known concepts of this kind? however are much weaker 
than what we would need in the present context. According to our results, growth 
is exponentially fast, unbounded wealth is achieved with probability one, and the 
effect of growth is demonstrated for specific (constant proportions) strategies. None 
of these properties can be directly deduced from asymptotic arbitrage. 

Thus, there are no convincing arguments showing that volatility-induced growth 
in stationary markets can be derived from, or explained by, asymptotic arbitrage 
over an infinite time horizon. But what can be said about relations between station- 
arity and arbitrage over finite time intervals? As is known, there are no arbitrage 
opportunities (over a finite time horizon) if and only if there exists an equivalent 
martingale measure. A stationary process can be viewed as an ‘antipodal concept? 
to the notion of a martingale. This might lead to the conjecture that in a stationary 
market arbitrage is à typical situation. Is this true or not? Formally, the question 
can be stated as follows. 


Question 2. Suppose vectors of asset prices p, — (pt, — oP) form a stationary 
stochastic process (assume even that the vectors p, are iid) Furthermore, suppose 
the first asset k = 1 is riskless with constant price p? = 1. The market is frictionless 
and there are no portfolio constraints (in particular, short selling is allowed). Does 
this market have arbitrage opportunities over a finite time horizon? 


An arbitrage opportunity over a fixed time horizon is understood in the conven- 
tional sense: the existence of a trading strategy that does not require any investment 


2E.g. Ross (1976), Huberman (1982), Kabanov and Kramkov (1994), and Klein and Schachermayer 
(1996). 
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Table 1 Net return of the two assets in the market 


Asset 1 Asset 2 
Heads +50% —5596 
Tails —4096 +100% 


at the initial time, is self-financing, does not incur a loss at the terminal time, and 
makes a gain with strictly positive probability. Again, the answer to this question 
is practically never guessed immediately. The correct answer depends on whether 
the distribution of the price vector of the risky assets is continuous or discrete. 
For example, if (p2,... pry takes on a finite number of values, then an arbitrage 
opportunity exists. But if its distribution is continuous, there are no arbitrage 
opportunities. For details see Evstigneev and Kapoor (2006). 

If your intuition led you astray, the remainder of this chapter will help to offer 
insights into these counter-intuitive results. Even if your intuition led you to the 
correct answers, you might be interested in the precise formulation of the results 
and the ideas behind their proofs. 


1.2 An illustrative example 


The potential of constant proportions investment strategies for financial growth can 
be demonstrated by means of the following example which seems simple enough to 
be discussed at high-school level. A related example is presented in Dempster, 
Evstigneev, and Schenk-Hoppé (2008) (see also Luenberger, 1998, Chapter 15). A 
more demanding illustration, though with a sound empirical background, is given 
in Ziemba (2008). 

Two investment opportunities available at every point in time: 0, 1, 2.... The 
realized returns of the assets between two points in time are determined by flipping 
a fair coin (only one!). Think of the outcome (heads/tails) as a economy-wide event 
that effects the two investments in different ways. The net returns are defined in 
Table 1. 

Investors who buy and hold the first asset will see their wealth decline (expo- 
nentially fast) over time. The grow rate is negative: 

gi :— 0.5 In(1.5) + 0.5 In(0.6) ~ —0.05268 < 0 
Investment in only the second asset does not yield any better performance because 
the growth rate is negative as well and equal to g1: 
go := 0.5 In(0.45) + 0.5 In(2.0) = gi < 0 

Now consider an investor with a constant proportions strategy. Suppose the 
investor divides his wealth 50:50 between the two assets. After each change in the 
value of the two portfolio positions, the investor rebalances his portfolio by trading 
assets to restore the 50:50 split of wealth. The growth rate of his wealth is 


0.5 In(0.5 - 1.5 + 0.5 - 0.45) + 0.5 In(0.5 - 0.6 + 0.5 - 2.0) ~ +0.1185 > 0 
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Figurel Wealth dynamics of the two passive investors (both decreasing) and the active investor 
with 50:50 constant proportions strategy (increasing). Initial wealth is 1.0. Logarithmic scale on 
y-axis. 


Both passive investors (holding only one asset) will see their wealth halve in less than 
14 periods on average. In contrast, the investor following a constant proportions 
strategies with proportions 50:50 will, on average, double his wealth more often 
than every 6'^ period. A typical realization of the wealth dynamics is depicted in 
Figure 1. 

The active investor's choice of the proportions is arbitrary. Indeed, qany con- 
stant proportions investment strategy holding both assets will generate growth in ex- 
cess of either buy-and-hold strategy. The 50:50 proportions create positive growth 
while strategies with proportions close to 100:0 (or 0: 100) entail growth at negative 
rate but one that is higher than g! — g?. 

Examples, examples, examples. The above example is just one of many. Indeed 
most descriptions (and analyses) of the phenomenon of generating financial growth 
from volatility are restricted to examples involving specialized models. Since the 
deduction of general principles from examples is an unreliable route if à thorough 
mathematical analysis is feasible, it might come as no surprise that several terms 
have been coined to describe this phenomenon and many different explanations 
for its origins have been put forward. In the remainder of this chapter, we will 
present mathematical results on the growth-volatility nexus in general (discrete- 
time) models and attempt to de-mystify its origins and explanations. 


1.3 Notation 


All models discussed in this chapter share the basic setup. The definitions and 
concepts are listed and discussed here. An investor observes prices and takes actions 
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only at particular points in time. These time periods are denoted t = 0,1,2,.... 
The price dynamics are driven by random factors. These factors are described 
by a stochastic process s, with f£ = 0,+1,+2,.... The process takes values in a 
measurable space S. The realization of the random parameter s, corresponds to 
the state of the world at time t. Denote by P the probability measure induced by 
the stochastic process s+, t = 0, +1, £2,..., on the space of its paths. There are 
K > 2 assets traded in the market. Asset prices p, = (pl,..., pF) > 0, at the time 
periods ¢ = 0,1,2,... are described as a sequence of strictly positive random vectors 
with values in the K-dimensional linear space RX. We will assume that the price 
vector p, depends on the history of the process s; up to time t: 


pi = pi(s*) E R5, s'=(..., s i8) 


It is supposed, without further mentioning, that all functions of s' will be 
measurable. 

A market is called stationary if the process s, is stationary and the price vectors 
pi do not explicitly depend on t, i.e., p; = p(s*). We will further assume that the 
process s; is ergodic and that E|1np(s*)| < oo for all k — 1,..., K. 

A stochastic process £1,65,... is stationary if, for any m = 0,1,2,... and 
any measurable function $(zo,z1,..., t4), the distribution of the random vari- 
able à :— O(&,&41,-.-,&4m) (t = 0,1,...) does not depend on t. Accord- 
ing to this definition, all probabilistic characteristics of the process 6, are time- 
invariant. If £, is stationary, then for any measurable function $ for which 
E|$(& &t+1,---+&t+m)| < 00, the averages 


bite +b 
t 


(1) 


converge almost surely (a.s.) as t — oo by Birkhoff’s ergodic theorem — see, e.g., 
Billingsley (1965). If the limit of all averages of the form (1) is non-random (equal 
to a constant a.s.), then the process £, is called ergodic. In this case, the above 
limit is equal a.s. to the expectation Ed, which does not depend on t by virtue of 
stationarity of £;. 

The exponential growth rate of a stochastic process €,&2,... with & > 0 is 
defined by 


1 
lim —In& (2) 
too t 
provided the limit exists. Suppose the process € is stationary and ergodic with 


Elném < oo, then its growth rate zero because t^! In£, — 0 a.s. by Proposition 
4.1.3 in Arnold (1998). One also has 


In£ +---+m& 
i t 


E esc &) > Eln Em = const. (a.s.) (3) 


This property is frequently used below. 
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At each time period t, the investor chooses a portfolio h,(s*) = (hl(s*), ..., 
AF (s*)) with a non-negative number of units of asset k, hf(s*) > 0. Short selling of 
assets is ruled out in our model by this assumption of non-negativity. À sequence 
h,(sf), t = 0,1,2,..., of portfolios is called a trading strategy. The value of a 
portfolio A, at time t is pehi = D>, DEAF. 

A trading strategy H = (ho, hi,...) with initial wealth wo > 0 is self-financing 
if pg ho = wo and 


past) hilst) < pils*) hi a(st 1) š f= Li Dee ae. (a.s.) (4) 


The inequalities in (4) are supposed to hold almost surely with respect to the 
probability measure P. The market value of the portfolio after trade does not 
exceed the vale of the yesterday's portfolio at current prices. This budget constraint 
restricts the investor's choice. 

A balanced trading strategy H is of the form 


hist) = y(s!) -..- (sf) R(sf), £=1,2,... (5) 


) 20. ££ 20, we 
(s*)| are integrable 
| are finite. Define 


with a scalar-valued function 7(-) > 0 and a vector function A(- 
assume that ho(s?) = h(s?). It is assumed that In ^(s*) and In |h 
with respect to the measure P, i.e., E| ln y(st)| and E|In |A(s*)| 
|h| = >>, |h*| for a vector h = (h*). 

These strategies are called balanced because they are of balanced growth: (5) 
implies that all proportions between the amounts of different assets in the portfolio 


hi(sf)  A(s*) 
hk(st) — h*(st)" 


k (6) 
form stationary stochastic processes. The random growth rate of the amount of 
each asset k = 1,..., K, in the portfolio 


his") 
hf (st71) 


h^ (s!) 
hk(st-1) 


— (st 


is a stationary process. À balanced strategy is self-financing (4) if and only if 
Y (s*) p(s*) h(s*) < p(s*) h(s*7)  (a.s.) (7) 


Stationarity implies that if (7) holds for some t, it holds for all t. 

Suppose the vector of asset prices is stationary. Then every non-negative vector 
function h(s*) with E|In |h(s*)|| < œ defines a self-financing balanced strategy (5) 
by 


(8) 


since E|In p*(s*)| < oo, and relations (4) and (7) hold as equalities. 
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Balanced trading strategies have the property that the growth rate of wealth 
of any investor employing it is completely determined by the expected value of y. 
Proposition 1 in Evstigneev and Schenk-Hoppé (2002) states that for any balanced 
trading strategy (5) 


1 1 
lim — In(p(s‘) hi(s*)) = lim 2In|h,(sf)) = Elny(s°)  (a.s.) (9) 
i00 tow $ 

This result shows strict positivity of EIny(s°) = Elny(s*) implies exponential 


growth of wealth, i.e., p(s’) h4(s*) — oo a.s. exponentially fast. 


2 Asset Markets with Stationary Prices 


We first discuss the simplest model in which one can analyze generic properties of 
fixed-mix strategies: a market in which prices are stationary processes. It turns out 
that any constant proportions strategy produces growth, though in this stationary 
market the growth rate of each asset price is zero and, therefore, buy-and-hold 
strategies do not yield positive growth. This results might seem, at the first glance, 
counter-intuitive. This section is based on Evstigneev and Schenk-Hoppé (2002). 
All notations are introduced in Section 1.3. 


2.1 The model 


A constant proportions strategy in a market with K assets is characterized by a 
vector A = (A1,...,A«) in the set 


K 
a- [ose n o soci] (10) 
k=1 
To avoid pathological cases, we assume strict positivity of all components. Propor- 
tional investment rules of this kind are sometimes termed completely mixed. 
The trading strategy h; is a constant proportions strategy if 


pË hË = A pula (11) 


for allt =1,2,... and k = 1,..., K. The investor rebalances the portfolio in every 
period in time by investing the constant share A; of the wealth p; h;, into the kth 
asset. The investor’s wealth at the beginning of period t is determined by evaluating 
the portfolio hy, (bought in the previous period) at the current price system p^. 
We assume that all the coordinates of h, are non-negative: short sales are ruled 
out. 

Constant proportions strategies are self-financing because (11) implies 


pi ha = pila 


Given a vector A, the strategy is uniquely defined by its initial portfolio ho(s°). 
Recursive application of (11) determines h,(s*) for every t = 1,2,.... 
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In the study of the growth rate of wealth, the initial portfolio does not matter, 
see Evstigneev and Schenk-Hoppé (2002, pp. 568). Indeed if there are two constant 
proportions strategies h, and hy both generated by the same À € A but with different 
(non-zero) initial portfolios, then there exist c, € > 0 such that chë < hk < ThE for 
all k and all t > 1. Obviously, lim t^! In |h4| > 0 a.s. if and only if lim £^! In |h4| > 0 
a.s. (and exponential growth is at the same speed). 


2.2 The growth rate 


Constant proportions strategies generate balanced portfolios. From the representa- 
tion (5), the growth rate of the investors wealth is obtained as the expected value 
Eln7(s°). The notion of balanced portfolios goes back, though in a somewhat dif 
ferent form, to Radner (1971)’s study of stochastic generalizations of the von Neu- 
mann economic growth model. Arnold, Evstigneev, and Gundlach (1999) study 
this concept of balanced paths in a general setting. The application to financial 
problems is recent. 

The central result on the growth rate of constant proportions strategies in mar- 
kets with stationary asset prices is obtained by showing that they can be written 
as balanced strategies and that Elny(s°) > 0. To have a strictly positive growth 
rate, the price process p(s‘) must be non-degenerate. 

We impose the following assumption: 


(A) With strictly positive probability, the random variable 
p^ (s*)/p'(s-*) 
is not constant with respect to k = 1,2,..., K, i.e., there exist m and n (that might 
depend on s*) for which 
p" (st) p" (s*) 

p(s) 7 p=) 
All results in this Handbook section will be proved under this assumption, though 
it will be convenient to repeat this condition in different disguises-each best suited 
for the particular application. 


Under assumption (A), one has the following result, see Evstigneev and Schenk- 
Hoppé (2002, Theorem 1). 


(12) 


Theorem 1. Fix any X = (X,...,Àk) € A, and let wo be a strictly positive 
number. Then there exists a vector function ho(s°) > 0 such that the constant 
proportions strategy h, generated by A and ho is a balanced strategy with initial 
wealth wo, and we have 


1 1 
lim -ln pihs = lim —1n|Ahi| 2 O0. (a.s.) (13) 
tooo t tooo t 


The construction of the vector function ho(s°) > 0 is as follows. Suppose the 
proportions À € A and initial wealth wo > 0 are given. To show that hy is a balanced 
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trading strategy one needs to define à vector function A. ) > 0 and a scalar-valued 
function y(-) > 0 such that h; coincides with the strategy defined in (5) (and that 
hils?) = h(s9)). Define 


T à wo ÀK wo 
fy | —— l, Se 14 
Bis!) = (Fee ER (14) 
and 
p st) A(st 1) 
y) = moe m »» Ep kr Fe. (15) 
This is a balanced trading strategy which soincides with hy recursively defined by 
(11). 


Strictly positive growth (E ln y(s‘) > 0) holds because Jensen’s inequality (ap- 
plied to the probability measure A; on the set {1,... n implies that 


DITE » a 5 237. et (16) 


with strict inequality on a set of positive orobability by assumption (12), which 
ensures 


e 
3) 


Strict positivity of Elny(s') means that wealth tends to infinity at an expo- 
nential rate. If the non-degeneracy condition (A) is not satisfied, the market is 
essentially deterministic and all prices are constant. In this case, all strategies give 
zero growth. Indeed, (12) is a very weak requirement that is satisfied in virtually 
every market. 

The result presented in Theorem 1 holds under (sufficiently) small proportional 
transaction costs, see Evstigneev and Schenk-Hoppé (2002, Theorem 2). 


Elno(sf) > Yan EEL 


2.3 Interpretation 


Our analysis shows that constant proportions strategies provide growth in a market 
in which buy-and-hold strategies do not deliver any. The intuition behind this result 
is that constant proportions strategies ‘exploit’ the persistent fluctuation of prices. 
When keeping a fixed fraction of wealth invested in each asset, a change in prices 
leads the investor to sell those assets that are expensive relative to the other assets 
and to purchase relatively cheap assets. The stationarity of prices implies that this 
portfolio rule yields a strictly positive expected rate of growth, despite the fact that 
each asset price has growth rate zero. The notion of an asset being ‘cheap’ resp. 
‘expensive’ makes sense when prices are stationary. If the current price is below 
the expected value, then the tomorrow’s price has a higher-than-average chance 
of being larger than today’s price. This asset is cheap today. In other words, if 
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prices are stationary, there is reversion to the median (which is also true for price 
ratios). Rebalancing strategies, on average, buy low and sell high. In the model 
with stationary returns (rather than prices), accepting this interpretation would be 
falling victim to the gambler's fallacy, see Section 4. 

The result highlights the benefit of financial engineering. The implementation 
of a constant proportions strategy requires to invest actively in the available assets 
by rebalancing at discrete time periods — the growth is financially engineered by 
making use of Jensen's inequality. 


3 Fixed-Mix Strategies in Stationary Markets 


The model with constant proportions strategies can be generalized to accommodate 
the transfer of fractions between different portfolio positions. An application to 
currency markets is provided. In this model, the exchange rates fluctuate randomly 
in time as stationary stochastic processes. This aspect of our analysis is inspired, 
in particular, by recent work of Kabanov (1999) and Kabanov and Stricker (2001). 
This section follows Dempster, Evstigneev, and Schenk-Hoppé (2003). Again, the 
main result holds under small proportional transaction costs. 


3.1 The model 


A fixed-mix strategy is determined by a (non-random) matrix Ayj, k, j — 1,..., K, 
such that 
K 
ApoU. 3 t (17) 
k=1 


In each time period, this strategy prescribes the transfer of a fixed share Ay; > 0 
of the jth position of the portfolio to the kth position (k,j € {1,...,A}). The 
dynamics of the wealth invested in the portfolio positions are given by 

K T » 

piht = So ugpihia (18) 

j=l 
For any matrix Ag; satisfying (17)), a strategy H is called a fixed-mix strategy 
associated with the matriz \ = (Ax;), if (18) holds for all k, t and s*. 

Any fixed-mix strategy is self-financing: phe = pih; ,. As in the previous 
section, we are interested in the asymptotic behavior of the portfolio h, (and its 
market value) of a trader employing a fixed-mix investment rule. 

In the deterministic case, the analysis of the long-term dynamics is simple. 
The price p; = p = (pl,...,p^) > 0 is a constant vector. The corresponding 
process h, defined by (18) is deterministic and will always converge to a steady 
state. This claim follows from results on positive matrices, Kemeny and Snell (1960). 
This result, combined with the assumption of stationarity, might seem to rule out 
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unbounded growth and could lead to the (wrong) conjecture of the convergence of 
h; to a stationary distribution in the stochastic case. 

The case studied in Section 2 corresponds to the situation in which A; does not 
depend on j: 


Nee = Ap With àp > 0, and à+- +àÀg=l (19) 
because (18) reduces to (11). 


3.2 Currency markets 


The foreign exchange market is arguably the real-world example closest to a market 
in which prices are stationary. We show to formulate the wealth dynamics of a 
currency trader with a fixed-mix strategy in the above model. 

Currencies k = 1,2,..., K are traded in a frictionless market. The exchange 
rates ahd = qa" (st) > 0 fluctuate randomly in time. The real number ahd denotes 
the amount of currency k which can be purchased by selling one unit of currency 7 
at time t. We assume absence of arbitrage at each point in time t, i.e.z the exchange 
rates must satisfy 


af? m af a (20) 


for all k,m and 7. 

Assume the trader follows any fixed-mix strategy (17) by dividing the holdings 
hj , > 0 of currency j purchased at time t — 1 according to the proportions A; > 0, 
k= 1,...,K at time t. The amount Aj;A? , is exchanged into currency k. After 
execution of all these transactions, the amount of currency k obtained at time t is 
equal to 


K 
TEE 
hi = $? Ay mp hii (21) 


j=l 
The dynamics (21) can be written in the form (18). Take currency 1 as a numeraire 
and define 


pë = m” 
The relation (20) implies xf? = 1/z* and sj = 1. Therefore, af? = pl/pf. 


Multiplying (21) by 11^ and using these relations, yields the formulation (18) of the 
wealth dynamics. 


3.3 The growth rate 


The analysis of the long-term growth of an investor following a fixed-mix strategy is 
conceptually identical to that of constant proportions strategies. First, one shows 
that there is an initial portfolio such that the fixed-mix strategy leads to a bal- 
anced portfolio. Then, one proves that the growth rate is independent of the initial 
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portfolio. Finally, one shows that this growth rate is strictly positive under a non- 
degeneracy condition. The mathematical tools required in the proof however are 
very different. The case of fixed-mix strategies requires considerably more advanced 
methods. 

As in Section 2 we assume that the market is stationary and the state of the 
world s, is ergodic. The price process p, = p(s*) satisfies E|In p*(s*)| < oo for all 
k — 1,..., K. The growth rate of each asset price is zero because t^! In p, — 0 as. 
and, therefore, buy-and-hold strategies do not yield positive growth. 

We assume non-degeneracy of the price process p(s*): 


(B) The vector p(s*) = (p! (st), ..., p (s*)) of normalized prices 


is not constant a.s. with respect to st. 

This assumption says that there is no constant vector c for which p(s‘) = c 
almost surely. By virtue of stationarity of (s;), condition (B) holds for all t if 
is satisfied for some t. If s, is ergodic, the condition (B) is equivalent to the 
following requirement (Dempster, Evstigneev, and Schenk-Hoppé, 2003, pp. 271): 
With positive probability, the ratios p*(s*)/p* (sf!) are not constant with respect 
to k. This is condition (A). 

'The main result on the performance of an investor employing any fixed-mix 
strategy is as follows, Dempster, Evstigneev, and Schenk-Hoppé (2003, Theorem 1). 


Theorem 2. Let h;(sf), t > 0, be a fized-mix strategy associated with the matriz 
A = (A5) satisfying (17). For each k € {1,2,...,K}, the limit 


lim sin ht (22) 


ioo 
exists and is strictly positive almost surely. Furthermore, this limit does not depend 
on k, and 
_ 1 k O1 : 
lim - Inh? = lim -ln pihi > 0 (a.s.) (23) 
too t too t 

The result ensures that the wealth of a fixed-mix investor tends to infinity at 
an exponential rate. All portfolio positions and the investor's wealth grow at the 
same positive exponential rate. This finding generalizes the results in Section 2 to 
the general case of fixed-mix investment strategies. 

The proof of Theorem 2 rests on the observation that one can define a balanced 
fixed-mix strategy for any given matrix A satisfying (17). Recall that a trading 
strategy H is balanced if h;(s*) = (s!) ... y(s‘) h(s*) (a.s.) for t > 1. For t = 0, 
we assume ho(s°) = h(s°). The function 4(-) > 0 is scalar-valued and h(-) > 0 is a 
vector function such that E|1Iny(s*)| < oo and |h(s*)| = 1. The norm |A| of a vector 
h = (h^) is defined as >, |h*|. The assumption |h(s*)| = 1, which was not imposed 
in (5), will be satisfied automatically in the present case. 
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For balanced strategies, all the proportions between the amounts of different 
assets in the portfolio and the random growth rate of the amount of each asset in 
the portfolio are stationary stochastic processes. To show existence of à balanced 
fixed-mix strategy associated with the matrix A = (Ax), one needs to show there are 
appropriate functions y(-) and A(-). Indeed, Theorem 2 in Dempster, Evstigneev, 
and Schenk-Hoppé (2003) ensures that for each matrix A satisfying (17), there is a 
unique balanced fixed-mix strategy with |A(s*)| = 1. 

The construction can be sketched as follows. Denote by Ay = A(s*) = (Axj(s*)) 
the positive random K x K matrix defined by 

F(t 
Au = os EA (24) 
We have E| In Ax; (s*)| < oo. With this definition, the fixed-mix strategy H can be 
represented as 
hy(s") = A(s‘)A(s’*) --- A(s?)ho(s°) (25) 
Stationarity of (s+) implies that functions y(-) and Z(-) (satisfying E|Iny(s*)| < oo 
and |z(s*)| = 1) generate a balanced A-strategy if and only if 
y(s*)z(s*) = A(sf)z(sf^l) (a.s.) (26) 
The existence of a solution to this equation follows from a stochastic version of the 
Perron-Frobenius theorem, Theorem A.1 in Dempster, Evstigneev, and Schenk- 
Hoppé (2003). The problem (26) cannot be solved by applying the conventional 
Perron—Frobenius theorem because the vector Z(s*) on the left-hand side does not 
coincide with the vector Z(sí-!) on the right-hand side as the function Z(s*) is 
obtained by ‘time-shifting’ z(s'-!). 

The independence of the growth rate from the initial portfolio is proved by 
couching the one-step forward portfolio between multiples of the portfolio z(s!), 
analogous to the outline in Section 2. All have the same growth rate. 

Finally, the strict positivity of the fixed-mix strategies' growth rate is asserted 
by proving that Eln^(sf) > 0. This can be seen as follows. Denote by (y(-), Z(-)) 
the balanced strategy corresponding to the fixed mix strategy A. The relation (26) 
implies 


mt et\ ak fet) — pits) y t-1 L. 
y(s*) (sf) = Mgr 8 y edit okt (27) 
The Perron-Frobenius theorem yields existence of a vector r = (ri,...,r«) > 0 


with 


K 
Tk = A Ke {1,...,K} 


j=l 
Put Bey = rz Axjr;. One finds that 


K if et j( t1 
189 = AS E 


3 (s* Yr 
wey, BE Lh KY (28) 
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From this representation of y(s‘) one can conclude (using Jensen's inequality and 
several other arguments, see Dempster, Evstigneev and Schenk-Hoppé, 2003, Sec- 
tion 3) that E1n-y(s*) > 0 under assumption (B). 


3.4 Price processes with trend 


The concept of a stationary market, where asset prices p, fluctuate as stationary 
stochastic processes, is an idealization. The assumption that only the relative pro- 
portions pl /pË are stationary seems more realistic. Following Dempster, Evstigneev, 
and Schenk-Hoppé (2003), let us assume that the prices are of the form 


Pe = Erbe 


where p, = p(s*) is a process satisfying the assumptions we previously imposed on 
pi, and & = &(s*) > 0 is any sequence of strictly positive random variables. The 
factors € represent the dynamics of a price index. The normalized prices f; are free 
of this trend. 

'The above analysis can directly be applied to this more general price process. 
The portfolio value pA, satisfies 


1 1 1 
Pi In piht = 1 In £, + t In jh, 


The growth rate of pihs is determined by that of £, and Pehi. 

Under assumption (B), the process ih, grows exponentially fast almost surely. 
Consequently, if the price index €; grows at an exogenous exponential rate rm, 
then the investor's wealth p,h, will grow almost surely at a rate r^ strictly greater 
than r. 


4 Stock Markets with Stationary Returns 


The observation that rebalancing a portfolio by following any constant proportions 
or, more generally, any fixed-mix strategy leads to a strictly higher growth rate 
of wealth than any buy-and-hold portfolio has been confirmed in markets with 
stationary prices. After seeing the results and following the intuition that a reversion 
to the mean is a powerful source of capital growth, one might wonder about the 
case in which returns (rather than prices) are stationary processes. Indeed, as 
explained in Section 1.1, there is no general agreement that the previous results 
hold for stationary returns. This is quite sensible because the any notion of cheap 
(or expensive) assets based on the argument that reversion to the long run trend 
renders an asset cheap after a series of low returns would mean to fall victim to 
the gambler’s fallacy. This lack of a simple intuition for the wealth dynamics of 
constant proportions strategies in markets where returns (not prices) are stationary 
demands a thorough analysis of this case. How can one rest without having gained 
an understanding of this problem? The presentation follows Dempster, Evstigneev, 
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and Schenk-Hoppé (2007), which also contains a proof of the main result under 
small (proportional) transaction costs. 


4.1 The model 
Denote the (gross) return on asset k between time t — 1 and t by 


k 
Rex Pt k=1,2...,K,t>1 (29) 


Let R, := (R1,..., RK). We impose the assumption: 


(R) The vector stochastic process Ri, t = 1,2,..., is stationary and ergodic. 
The expected values E|In R£|, k =1,2,...,K, are finite. 


The typical example of a stationary ergodic process is a sequence of iid (inde- 
pendent identically distributed) random variables. To avoid misunderstandings, we 
emphasize that Brownian motion and a random walk are not stationary. 

The asset price at time t and the initial price are related as pë = pë RE - ..- 
RF, where the random sequence R^ is stationary by (R). This assumption on the 
structure of the price process is a fundamental hypothesis in finance. Moreover, it is 
quite often assumed that the random variables Rf, t = 1,2,... are independent, i.e., 
the price process p£ forms a geometric random walk. This postulate, which is much 
stronger than the hypothesis of stationarity of RP, lies at the heart of the classical 
theory of asset pricing (Black, Scholes, Merton), see e.g. Luenberger (1998). 

Birkhoff’s ergodic theorem implies 


t 
. 1 1 O1 : 
Jim Fin pi = Jim i dm RE = Eln RË (as) (30) 
n= 
for each k = 1,2,..., K. This means that the price of each asset k has almost 


surely a well-defined and finite (asymptotic, exponential) growth rate, which turns 
out to be equal a.s. to the expectation p, := E ln RE, the drift of this asset’s price. 
The drift can be positive, zero or negative. It does not depend on t in view of the 
stationarity of Ri. 

Consider an investor following a constant proportions strategy with proportions 
(10). Fix any vector A € A. Given an initial portfolio hg > 0, the (self-financing) 
trading strategy H is defined recursively by 


AK = Appehi_1/pk k-—1,2,..,K, t 21 (31) 


This definition is equivalent to (11). Our aim is to study the asymptotic behavior 
of the portfolio value V, = phy as t > co, i.e. the limit lim, 55 £^! In(V;) describing 
the (exponential) growth rate of the strategy. 
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4.2 The growth rate 
We impose the following non-degeneracy condition: 


(C) With strictly positive probability, 


pt(s*) , pals) 


pr(st)' pg!) 

This condition is a very mild assumption on the existence of volatility of the 

price process. Condition (C) does not hold if and only if the relative prices of the 

assets are constant in time (a.s.), i.e., if with probability one, the ratio p/p?” of 

the prices of any two assets k and m does not depend on t. This condition is also 
equivalent to (B), see Dempster, Evstigneev, and Schenk-Hoppé (2003, pp. 271). 

The condition (C) on asset prices has an equivalent formulation for asset returns: 


for some 1 < kım < K andt » 1 


(D) For some t > 1 (and hence, by virtue of stationarity, for each t > 1), the 
probability 


P{R* # R” for some 1 X k,m < K} 
is strictly positive. 


Equivalence can bee seen as follows. Note that p/p” Z pF ,/p?* , if and only 
if p/p; A p/p, ie. RE z R™. Denote by & the random variable that is 
equal to 1 if the event (RF #4 RT" for some 1 < k, m < K} occurs and 0 otherwise. 
Condition (C) means that P(maxi»1ó, = 1} > 0, while (D) states that, for some 
t (and hence for each t), P(ó, = 1) > 0. The latter property is equivalent to the 
former because [maxi»1 6: = 1) = UP, (6, = 1}. 

We can now state the main result on the growth of wealth of investors following 
constant proportions strategies. 


Theorem 3. Fiz any A € A. 

(i) The growth rate of the constant proportions strategy is almost surely equal to 
a constant which is strictly greater than >>, Anpk, where pp is the drift of the price 
of asset k. 

(ii) Suppose all the assets have the same drift (therefore, almost surely the same 
asymptotic growth rate), ie., Eln RE = p for each k = 1,..., K with some real 
number p. Then the growth rate of the constant proportions strategy is almost surely 
strictly greater than the growth rate of each individual asset. 


In Theorem 3, assertion (ii) immediately follows from (i). The result (ii) shows 
that any completely mixed constant proportions strategy grows at a rate strictly 
greater than p, the growth rate of each particular asset. The growth of the investor’s 
wealth is only driven by the volatility of the price process. This result seems to 
contradict conventional finance theory which usually regards the volatility of asset 
prices as an impediment to financial growth. In the present context, volatility serves 
as an endogenous source of its acceleration. Theorem 3 (ii) asserts the validity of 
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the conclusion drawn in the illustrative example discussed in Section 1.2. Indeed, 
the constant proportions strategy (.5,.5) (as well as any other completely mixed 
constant proportions strategy) yields a higher growth rate than the price of each 
asset. 

The first part of Theorem 3 places a floor under the constant proportions strat- 
egy's growth rate. When asset prices grow at different rates, there is no general 
result that constant proportions strategy grow faster than any (completely diversi- 
fied) buy-and-hold strategy. The latter grows at rate max, pk which is not strictly 
dominated by 5^, Axex. The growth-optimal constant proportions strategy however 
will always grow at least as fast as any buy-and-hold in the model. 

The proof of the above result is surprisingly simple and can be presented with 
success to an audience with knowledge of only elementary probability and calculus: 
Fix any vector À € A. The random wealth dynamics of the corresponding constant 
proportions strategy (31) are given by 


K k 
E Pe p 
V = ph = X pEhE a -X3 : "us T -y "m Akpi-iha-a 
- k=1 


K 


XO RIA 


k=1 


Via = (Rà) Vi- (32) 


for each t > 1. This implies 
= (R,A)-...- (E4A)Vo (33) 


for allt > 1. The ergodic theorem ensures 


lim one = = Jim - 2» (R,A)- Eln(R;A) (as.) (34) 


t= 


It remains to show that EIn(R,A) > S% Axpy under the condition (D). In- 
deed, Jensen's inequality and (D) ensure the relation 


K K 
In $ 5 REA > XO Ag (In RP) 
k=1 k=1 


with strictly positive probability, while the non-strict inequality holds always. 
Consequently, 


K K 
Eln(Ryd) > XO  A&E(In RE) = 3 npr (35) 
k=1 k= 


This proves Theorem 3. 
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4.8 Interpretation 


The rigorous proof of the presence of volatility-induced growth in markets where 
asset returns are stationary leaves open the question on the intuition behind this 


result. 
It seems any explanation one can give is nothing but a repetition of the mathe- 
matical reasoning behind this result: If R},..., R are the random returns of the 


K assets, then the asymptotic growth rates of these assets are En RE, while the 
asymptotic growth rate of a constant proportions strategy is E In(5 ^, Ax RE), which 
is strictly greater than $7, AE In(R*) by Jensen's inequality because the logarith- 
mic function is strictly concave. The proof in Section 2 confirms that Jensen’s 
inequality is the central tool used in the proof. Any explanation not based on this 
fact would be flawed. 

It is well-known that a linear combination of assets can produce non-linearity 
in a portfolio's characteristics. Indeed, this feature drives mean-variance portfolio 
choice, cf. Luenberger (1998). In the present model, the rebalancing of the portfolio 
so as to maintain constant proportions causes a non-linear effect in the portfolio's 
growth rate with the feature that Eln(»5, A, RY) — >>, AuEln(R$) > 0 for any 
A= (A,..., Ak) € A provided assumption (C) holds. 

Other attempts to appeal to intuition are discusses (and rejected) in the next 
section. 

An interesting problem is the question under which (general) assumptions 
Em, A RE) > 0 even if max; py < 0 (or, less restrictive, ming pk < 0). The 
example in Section 1.2 possesses this property. In that example, pı = p2 < 0, but 
Eln(\, R} 4- A H2) > 0 for the constant proportions strategy À = (.5,.5). But if A is 
close to (1,0) or (0, 1) the growth rate of wealth becomes negative as well. A partial 
answer to this ‘black swan’ question is provided in Section 5. We discuss a case in 
which the addition of a slower growing asset always enhances the growth of a con- 
stant proportions strategy and raises the growth rate over that of any buy-and-hold 
strategy. 


5 Myths and Misconceptions 


The phenomenon of volatility-induced growth is simply counter-intuitive as con- 
firmed by our test in Section 1.1. Providing à common-sense explanation for the 
mathematica] results can therefore not be a simple task. In this section, we discuss 
some of the more prominent suggestions for intuition put forward in the literature. 


5.1 Volatility pumping 


The term ‘volatility pumping’ appears to have been first used by Luenberger (1998). 
His suggestion is that constant proportions strategies force the investor to ‘buy low 
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and sell high’—the common sense dictum of stock market trading. Those assets 
whose prices have risen from the last rebalance date will be overweighted in the 
portfolio, and their holdings must be reduced to meet the required proportions and 
to be replaced in part by assets whose prices have fallen and whose holdings must 
therefore be increased. This behavior leads to growth if asset returns exhibit some 
stationarity properties. 

In Section 2.3, we argued that this explanation is correct for stationary prices. 
In the present case, accepting this explanation would mean to fall victim to the 
gambler's fallacy. This explanation is not valid because when the price follows a 
geometric random walk, the set of its values is generally unbounded and for every 
value there is a smaller as well as a larger value. Suppose returns are iid, then ‘high’ 
and ‘low’ do not have any meaning. Reversion to the long run trend (as postulated 
by the ergodic theorem) is not an explanation either: arguing that the longer the 
run of black numbers, the higher the odds of red numbers at the next spin of the 
roulette wheel is the gambler's fallacy. When returns are determined by the flip of 
a coin, an asset’s upside and downside potential does not change over time. Such 
an asset is not cheap or expensive at any point in time. 


5.2 The importance of constancy 


A more substantial lacuna in the above reasoning is that it does not reflect the 
assumption of constancy of investment proportions. This leads to the question: 
what will happen if the portfolio is rebalanced so as to sell all those assets that gain 
value and buy only those ones which lose it? (This should lead to an even higher 
growth rate — provided volatility could be ‘pumped’.) 

Assume, for example, that there are two assets, the price pl of the first (riskless) 
is always 1, and the price p? of the second (risky) follows a geometric random walk, 
so that the gross return on it can be either 2 or 1/2 with equal probabilities. Suppose 
the investor sells the second asset and invests all wealth in the first if the price p? 
goes up and performs the converse operation, betting all wealth on the risky asset, 
if p? goes down. Then the sequence A; = (Ai 4,22) of the vectors of investment 
proportions will be iid with values (0, 1) and (1,0) taken on with equal probabilities. 
Furthermore, Az; will be independent of R;. By virtue of (34), the growth rate 
of the portfolio value for this strategy is equal to Eln(R;A, 1) = [In(0- 14-1. 2)4- 
In(0-12- 1: D) In(1: 1-0: 2)- In(1-1 4- 0- 2)]/4 = 0, which is the same as the 
growth rate of each of the two assets k — 1,2. But this growth rate is strictly lower 
than that of any completely mixed constant proportions strategy. 


5.3 Energy-interpretation of volatility 


Our results highlight the importance of volatility because fixed-mix strategies do 
not produce growth beyond buy-and-hold portfolios in any of the above models 
when prices or returns are constant. It is tempting to see volatility as a source of 
energy that can be tapped to generate growth. 
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Indeed, Luenberger (1998) presents an intriguing continuous-time example which 
supports this view. (A closely related observation is made in Fernholz and Shay 
(1982).) There are K assets whose prices follow independent (but identical) geo- 
metric Brownian motions. The constant proportions strategy with equal weights 
on all assets has a higher growth rate than any asset price. The remarkable feature 
however is that increasing the number of assets K leads to a higher growth rate and 
—at the same time- to a reduction in the volatility of the return. It would seem as 
if fixed-mix strategies turn the classical return-volatility tradeoff on its head. 

'The framework for this example is the well-known continuous-time model de- 
veloped by Merton and others, in which the price processes SE, t > 0, of two 
assets k = 1,2 are supposed to be solutions to the stochastic differential equa- 
tions dSF/SF = updt + o,dWF, where the WF are independent (standard) Wiener 
processes and SE = 1. As is well-known, these equations admit explicit solutions 
SE = exp[uxt — (02/2)t-- o WF]. Given some 6 € (0, 1), the value V; of the constant 
proportions portfolio prescribing investing the proportions 0 and 1 — 60 of wealth into 
assets k — 1,2 is the solution to the equation 


dV;/V, = [61 + (1 — 0)uz]dt + 601dW] + (1 — 0)e9dW?2 


Equivalently, V; can be represented as the solution to the equation dV;/V, = pdt + 
dW, where ji :— 6441 + (1 — 0)us, 8? :— (001)? + [(1 — 0)o2]? and W; is a standard 
Wiener process. Thus, V; = exp[ft — (0?/2)t + EW], and so the growth rate and 
the volatility of the portfolio value process V, are given by ñ — (6?/2) and 8. In 
particular, if uj = u2 = u and 91 = og = c, then the growth rate and the volatility 
of V; are equal to 


u—(0?/2) and = o8? (1-0)? «oc (36) 
while for each individual asset, the growth rate and the volatility are u — (07/2) 
and c, respectively. 

Thus, in this example, the use of a constant proportions strategy prescribing 
investing in a mixture of two assets leads (due to diversification) to an increase of 
the growth rate and to a simultaneous decrease of the volatility. When looking at the 
expressions in (36), the temptation arises even to say that the volatility reduction 
is the cause of volatility-induced growth. Indeed, the growth rate u — (6?/2) is 
greater than the growth rate  — (02/2) because 8 < c. This suggests speculation 
along the following lines. Volatility is something like energy. When constructing a 
mixed portfolio, it converts into growth and therefore decreases. The greater the 
volatility reduction, the higher the growth acceleration. 


5.4 A counter-example 


Do the observations made in the previous section have any grounds in the general 
case, or do they have a justification only in the above example? To formalize this 
question and answer it, let us return to the discrete-time framework. Suppose there 
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are two assets with iid vectors of returns R, = (R1, R?). Let (£,9) := (Rt, R?) 
and assume, to avoid technicalities, that the random vector (€,7) takes on a finite 
number of values and is strictly positive. The value V, of the portfolio generated 
by a fixed-mix strategy with proportions zr and 1 — zr (0 < x < 1) is computed 
according to the formula 
t 
V: = Vi [[ eR, 0-228] t22 
n=2 

see (33). The growth rate of this process and its volatility are given, respectively, 
by the expectation Eln¢, and the standard deviation,/Var ln C, of the random 
variable In ¢,, where C := xé +(1—2)n. We know from the above analysis that the 
growth rate increases when mixing assets with the same growth rate. What can be 
said about volatility? Specifically, we consider the following question. 


Question 3. (a) Suppose Var lIn€ = Var Inn. Is it true that Varln[x£ + (1. — 2) 
Var Iné when x € (0,1)? (b) More generally, is it true that Var In[zé + (1 — z)n 
max(Var ln £, Var lnn) for x € (0,1)? 


Query (b) asks whether the logarithmic variance is a quasi-convex functional. 
Questions (a) and (b) can also be stated for volatility defined as the square root of 
logarithmic variance. They will have the same answers because the square root is a 
strictly monotone function. Positive answers to these questions would substantiate 
the above conjecture of volatility reduction — negative, refute it. 

In general (without additional assumptions on € and 7), the above questions 3(a) 
and 3(b) have negative answers. To show this, consider two iid random variables 
U and V with values 1 and a > 0 realized with equal probabilities. Consider the 
function 


f(y) = VarIn[yU + (1 — y)V], y € [0, 1] (37) 
By evaluating the first and the second derivatives of this function at y — 1/2, one 
can show the following. There exist some numbers 0 < a_ < 1 and a, > 1 such 
that the function f(y) attains its minimum at the point y — 1/2 when a belongs to 


the closed interval [a_,a,] and it has a local maximum (!) at y = 1/2 when a does 
not belong to this interval. The numbers a_ and a+ are given by 


a+ = 2ef — 1 + (264 — 1? — 1 


where a_ ~ 0.0046 and a; ~ 216.388. If a € [a_, a4], the function f(y) is convex, 
but if a ¢ [a.., a4], its graph has the shape illustrated in Figure 2. 

Fix any a for which the graph of f(y) looks like the one depicted in Figure 2. 
Consider any number yo « 1/2 which is greater than the smallest local minimum 
of f(y) and define € := yoQU + (1 — yo)V and 7 := yoV + (1 — yo)U. (U and V may 
be interpreted as "factors" on which the returns € and 7 on the two assets depend.) 
Then Varln[(£ + 7)/2] > Varlné = Var Inn, which yields a negative answer both 
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Figure 2 Graph of the function f(y) in equation (37) for a = 104. 


to (a) and (b). In this example, £ and 7 are dependent. It would be of interest to 
investigate questions (a) and (b) for general independent random variables € and 7. 
It can be shown that the answer to (b) is positive if one of the variables € and 7 is 
constant. But even in this case the function Var In|z£ + (1 — z)n] is not necessarily 
convex: it may have an inflection point in (0,1), which can be easily shown by 
examples involving two-valued random variables. 

Thus, it can happen that a fixed-mix portfolio may have a greater volatility than 
each of the assets from which it has been constructed. Consequently, the above 
conjecture and the ‘energy interpretation’ of volatility are generally not valid. It is 
interesting, however, to find additional conditions under which assertions regarding 
volatility reduction hold true. In this connection, we can assert the following fact, 
see Theorem 4 in Dempster, Evstigneev, and Schenk-Hoppé (2007). 


Theorem 4. Let U and V be independent random variables bounded above and 
below by strictly positive constants. If U is not constant, then one has that 
Var InfyU + (1— y)V] < VarlnU for all y € (0,1) sufficiently close to 1. 


Volatility can be regarded as a quantitative measure of instability of the portfolio 
value. The above result shows that small independent noise can reduce volatility. 
This result is akin to a number of known facts about noise-induced stability, e.g., 
Abbott (2001) and Mielke (2000). An analysis of links between the topic of the 
present work and results about stability under random noise might constitute an 
interesting theme for further research. 


5.5 Growth under transaction costs 


In this section, we consider an example (a binomial model) in which quantitative 
estimates for the size of the transaction costs needed for the validity of the result 
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on volatility-induced growth can be provided. Suppose that there are two assets 
k = 1,2: one riskless and one risky. The price of the former is constant and equal 
to 1. The price of the latter follows a geometric random walk. It can either jump 
up by u > 1 or down by u^! with equal probabilities. Thus, both security prices 
have growth rate zero. 

Suppose the investor pursues the constant proportions strategy prescribing to 
keep 50% of wealth in each of the securities. There are no transaction costs for 
buying and selling the riskless asset, but there is a transaction cost rate for buying 
and selling the risky asset of e c [0, 1). Assume the investor’s portfolio at time t — 1 
contains v units of cash; then the value of the risky position of the portfolio must 
be also equal to v. At time t, the riskless position of the portfolio will remain the 
same, and the value of the risky position will become either uv or u^ !v with equal 
probability. In the former case, the investor rebalances his/her portfolio by selling 
an amount of the risky asset worth A so that 


vt(l—2)A— vu— A (38) 


By selling an amount of the risky asset of value A in the current prices, the investor 
receives (1 — ¢)A, and this sum of cash is added to the riskless position of the 
portfolio. After rebalancing, the values of both portfolio positions must be equal, 
which is expressed in (38). From (38) we obtain A = v(u—1)(2—2) !. The positions 
of the new (rebalanced) portfolio, measured in terms of their current values, are 
equal to v + (1—e)A = v[14- (1—&)(2— 2e) !(u — 1)]. In the latter case (when the 
value of the risky position becomes u v), the investor buys some amount of the 
risky asset worth B, for which the amount of cash (1 + £) B is needed, so that 
v—(1*e)B—u lv B 

From this, we find —B = v(u! — 1)(2 + 2) !, and so v — (1 +£)B = v[1 + (1+ 
e)(2  e)-1(u7! - 1)J 

Thus, the portfolio value at each time f is equal to its value at time t — 1 
multiplied by the random variable £ such that P(£ = g') = P(£ = g") = 1/2, where 
g := 14+(1+e)(2+e)71(u71-1) and g” := 1--(1—6)(2—&)^1(u— 1). Consequently, 
the asymptotic growth rate of the portfolio value, Ælné = (1/2)(In g' + Ing”), is 
equal to (1/2) In (e, u), where 


“b= 1 

d | 4 (1-&) 
We have Elné > 0, i.e., the phenomenon of volatility induced growth takes place, 
if é(z,u) > 1. For e € [0,1), this inequality turns out to be equivalent to the 
following very simple relation: 0 € € < (u — 1)(u + 1) !. Thus, given u > 1, the 
asymptotic growth rate of the fixed-mix strategy under consideration is greater than 
zero if the transaction cost rate £ is less than e*(u) := (u — 1)(u + 1) !. We call 
e*(u) the threshold transaction cost rate. Volatility-induced growth takes place—in 
the present example, where the portfolio is rebalanced in every one period—when 
0 € e « e*(u). 


o(e,u) :— |l FE (1€ TE 
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The volatility ø of the risky asset under consideration (the standard deviation 
of its logarithmic return) is equal to In u. In the above considerations, we assumed 
that c—or equivalently, u—is fixed, and we examined ¢(e, u) as a function of e. Let 
us now examine ó(e, u) as a function of u when the transaction cost rate £ is fixed 
and strictly positive. For the derivative of $(e, u) with respect to u, we have 


1 r= 
+E | zx 


ig = 


If u = 1, then ¢/,(e,1) < 0. Thus, if the volatility of the risky asset is small, 
the performance of the constant proportions strategy at hand is worse than the 
performance of each individual asset. This fact refutes the possible conjecture that 
‘the higher the volatility, the higher the induced growth rate’. Further, the derivative 
¢! (e,u) is negative when u € [0,us(e)), where ux(e) := (1 —e)71/2(1 + &)72. For 
u = u, (£), the asymptotic growth rate of the constant proportions strategy at hand 
attains its minimum value. For those values of u which are greater than u.(é), 
the growth rate increases, and it tends to infinity as u — oo. Thus, although the 
assertion ‘the greater the volatility, the greater the induced growth rate’ is not valid 
always, it is valid (in the present example) under the additional assumption that 
the volatility is large enough. 


6 Conclusion 


'This chapter surveys the authors! recent theoretical work on the phenomenon of 
volatility-induced growth. We study the performance of fixed-mix strategies in 
markets where asset prices or returns are stationary ergodic. We have established 
the surprising result that these strategies generate portfolio growth rates in excess 
of the individual asset growth rates, provided some volatility is present. As a 
consequence, even if the growth rates of the individual securities all have mean 
zero, the value of a fixed-mix portfolio tends to infinity with probability one. 

By contrast with the twenty five years in which the effects of ‘volatility pumping’ 
have been investigated in the literature by example, our results are quite general. 
'They are obtained under assumptions that accommodate virtually all the empirical 
market return properties discussed in the literature. We have in this paper also 
dispelled the notion that the demonstrated acceleration of portfolio growth is simply 
a matter of ‘buying low and selling high’ or stems from some ‘volatility—energy’ link. 

'The example of Section 5.2 shows that our result depends critically on rebal- 
ancing to an arbitrary fired mix of portfolio proportions. Any such mix defines the 
relative magnitudes of individual asset returns realized from volatility effects. This 
observation and our analysis of links between growth, arbitrage, and noise-induced 
stability suggest that financial growth driven by volatility is a subtle and delicate 
phenomenon. 

A major obstacle in practical applications of our results is the fact that in- 
vestment is not over an infinite period. This point has been forcefully argued by 
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Samuelson (1979), but its validity is not as clear cut as one may be led to believe, see 
MacLean, Zhao, and Ziemba (2006), Ziemba (2008) and the discussion in Part IV in 
this volume. Other issues are the size of transaction costs in real markets. Though 
our findings do hold under sufficiently small proportional transaction costs, this 
issue can only be resolved empirically. Another intriguing question is how much 
stationarity is present in real markets. All of these questions are left for future 
research. 
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Introduction to the Good and Bad Properties of Kelly 


Multiperiod lifetime investment-savings optimization dates at least to Ramsey 
(1928). Phelps (1962) extended the model to include uncertainty while maximiz- 
ing expected utility of lifetime consumption by choosing between consumption and 
investment in a single risky asset using an additive utility function. He obtained 
explicit solutions for à constant member of the isolastic utility class. Samuelson 
(1969) and Merton (1969) in companion articles develop, following Ramsey (1928) 
and Phelps (1962), in both discrete-time and continuous time, lifetime portfolio 
selection models where the objective function is the discounted sum of concave 
functions of period by period consumption. Samuelson solves the case when there 
are interior maxima, and shows that for isoelastic period by period utility functions 
u(C) = C?-1, 6 < 1, the optimal portfolio decisions are independent of current 
wealth at each stage and independent of all consumption-savings decisions with a 
stationary optimal policy to invest a fixed proportion of current wealth in each pe- 
riod. Ziemba and Vickson (2010) review this literature and point to some queries 
regarding the validity of the interior maxima as discussed in problems in Ziemba 
and Vickson (1975, 2006). 

For log utility, 6 — 0, Samuelson (1969) showed that the optimal decision splits 
into two independent parts with the Ramsey savings problem independent of the 
lifetime portfolio selection problem. Log utility frequently splits the problem in this 
way, see Rudolf and Ziemba (2004) reprinted in part VI of this book for a similar 
splitting in a multiperiod pension model. 

Samuelson (1971) focuses on the third point of his 1979 paper that follows. He 
states the following result which has broad agreement. 

Theorem: If one acts to maximize the geometric mean at each step, and if the 
period is "sufficiently long", *almost certainly" higher terminal wealth and terminal 
utility [for a log utility investor]! will result than from any other [essentially different] 
decision rule. 

Samuelson points out the following: 


False Corollary: If maximizing the geometric mean almost certainly leads to a 
better outcome, then the expected utility of its outcomes exceeds that of any other 
rule, provided T is sufficiently large. 

The editors of this book agree with Samuelson that the Elog maximization is 
for the special utility function u(w) — logw and not for other utility functions. 


1[] indicate phrase added by the editors 
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Samuelson presents one example that illustrates this. Samuelson's criticisms are 
with the pure theory and not in conflict with any of the conclusions of this book, 
much of which is summarized in the papers in this part of the book. But he ends 
with the observation that his critical remarks 


*...do not deny that this criterion, arbitrary as it is, still avoids 

some of the even greater arbitrariness of conventional mean- 

variance analysis. Its essential defect is that it attempts to re- 

place the pair of “asymptotically sufficient parameters" E log X;, 

Variance(log X;) by the first of these alone, thereby gratuitously 
x 


ruling out arbitrary y in the family u(x) = z in favor of 
u(x) = logz". 


Luenberger (1993), reprinted in part V, provides a careful analysis of these two 


parameters. The paper on Kelly simulations in part IV by MacLean et al. also 
considers both parameters in an analysis of proportional investment strategies. 


Samuelson' paper (1979) is written in a style, with one syllable words, that is 


hard to read, but when you cut through the language he makes a few points: 


(1) 


'Those who follow the rule maximization of mean log of wealth will with higher 
and higher probability have more wealth in the long run than those that use 
an [essentially different] strategy. This is agreed and is one of the basic results 
due to Breiman and others. 

Some of those who have favorable asset returns period by period and maximize 
the expected log of wealth can lose a lot and in fact almost all of their wealth. 
'This is agreed and examples of this are in the simulation paper by MacLean, 
Thorp, Zhao and Ziemba (MTZZ) (2010) which appears in this part of the book. 
While the Kelly expected log criterion faced with favorable positive expectation 
bets will provide very high returns much of the time, in a small percent of the 
time there will be huge losses. In one example in Ziemba and Hausch (1986), 
which is redone in MTZZ with a 1496 advantage on each play you can lose 9896 
of your initial wealth. Of course, with fairly high probability, you can achieve 
a 10- or 100- fold increase in your initial wealth. This is, of course, the pure 
theory and in actual applications by hedge funds and other investors, financial 
engineering methods may be used to limit losses should they occur. 

Latane (1959, 1978) seems to have claimed (although even that is debatable) 
that the expected log maximizing strategy is in some sense better than a strat- 
egy based on some other utility function, this is not discussed or claimed by 
those involved with this book. The expected log maximizing strategy is about 
long run wealth maximizing most of the time. The only utility functions used 
are log in general for full Kelly and negative power in the special case with log 
normally distributed assets. The papers in this part of the book discuss these 
and other points further. 
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Markowitz (1976) argues that when one traces out the set of mean-variance 
(MV) efficient portfolios, one passes through a portfolio which gives approximately 
the maximum Flog X. Thorp (1969) gives an example of an asset distribution 
such that the Kelly log optimal portfolio is not on the efficient frontier, so the 
approximation is not exact. Markowitz argues that this Kelly portfolio should be 
considered the upper limit for conservative choice among E,V efficient portfolios, 
since portfolios with higher (arithmetic) mean give greater short-run variability with 
less return in the long run. À real investor might prefer a lower mean and variance 
giving up return in the long run for stability in the short run. Markowitz (1952) 
conjectured and Young and Trent (1969) proved that 


Elog(X;) ~ log E — : ix) 
for a wide class of ex post distributions of portfolio returns, where X — gross return, 
E = EX and V =Varx. 

Thorp (2008) provides an introduction to the Kelly criterion and discusses a 
number of interesting topics about it. His paper and the following ones on simula- 
tions of Kelly returns and good and bad properties show what Kelly strategies do 
and do not do. Thorp begins with how he coined the term “Fortune’s Formula” in 
1960 in his blackjack application, where he developed favorable card counting strate- 
gies that have had a huge impact across the world. He discusses various examples 
and paradoxes and the issue of bet concentration, that is, limited diversification. 
Experts with good asset situations can plunge heavily into a few assets as Warren 
Buffett has done. Thorp’s treatment and Ziemba's paper on great investors in part 
VI of this book provide clues that Buffett acts like a Kelly bettor. Buffett himself 
argues that amateurs are likely better off diversifying into index funds, after all, 
they beat about 7596 of the professional managers in net returns with essentially no 
work. Thorp cautions the reader against common errors in the application of Kelly 
betting such as the failure to consider all investments, risk tolerance misjudgments, 
forgetting how aggressive Kelly betting is in the short run, overbetting because of 
data errors, forgetting about the impact of bad scenario black swans and forgetting 
that most of the superior Kelly betting properties are for the very long run. He 
presents some results on fractional Kelly wagering that supplement the papers in 
part III of this book. He discusses the subtle concept of "essentially different" that 
confounds even some top experts. He discusses some subtleties such as strategies 
that can beat Kelly strategies. An important topic is the use of Kelly strategies 
by great investors trying to multiply their fortunes, akin to the Kelly property 
that it will get you to a wealth goal faster than any other essentially different 
strategy for asymptotically high goals. Thorp shows the relationship between frac- 
tional Kelly strategies and Markowitz mean-variance strategies with the full Kelly 
being the limiting portfolio on this surface supplementing the Markowitz paper 
discussed above. 
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Thorp also discusses and deals with Paul Samuelson’s criticisms of Kelly invest- 
ment strategies and his discussion supplements the three Samuelson papers and the 
last papers in this part. The editors of this book agree that even when you make 
a long sequence of very good bets, you can, in fact, lose and the loss can be a lot. 
But most of the time you win a lot. Thorp mentions two papers he co-authored 
in the 1970s which show that different utility functions indeed have different opti- 
mal solutions. That, as fully discussed in the Thorp and Whitley's (1972) paper 
reprinted here, deals with the false notion that Kelly proponents think that log 
optimal strategies are optimal for other concave utility functions, which notion is 
neither claimed nor true. 

Thorp’s paper concludes with a discussion of Proebsting’s paradox which shows 
you can make a series of favorable bets and be asymptotically ruined. But the 
paradox needs a series of correlated bets made at different points in time to show 
this result. So it does not contradict the property that betting full Kelly or any 
fixed fraction less than full Kelly leads to exponential growth assuming the bets are 
independent (as in Breiman's paper in part I or have weak dependence as in Cover's 
papers in part II or Thorp's paper in part VI). 

One approach to successfully model black swans is to use a scenario optimization 
stochastic programming model where you include the possibility of an extreme 
event, specifying its consequences but not actually indicating the nature of the event 
(see Geyer and Ziemba (2008) for the application to the Siemens Austria Pension 
Fund). Correlations change as the scenario sets move from normal conditions to 
volatile to crash which include the black swans (see also Ziemba (2003) for additional 
applications of this approach). This is, of course, fully applicable to Kelly betting 
and will lower the wager. 

Thorp and Whitley (1972) show, for interesting concave utilities, that different 
utility functions, namely those which differ by more than a positive linear transfor- 
mation, have distinctly different sets of optimal strategies. Moreover, for the case 
of cash and a two valued risky investment, the optimal strategies are unique. This 
means that if two utility functions have the same set of optimal strategies for each 
such investment, then these two utility functions are equivalent, that is, they are 
positive linear transformations of one another. Hence, given any concave utility dif- 
ferent from log there are investment settings in which the two utilities have different 
optimal strategies. Thorp and Whitley show that utility functions that are close to 
one another may have very different optimal solutions. Kallberg and Ziemba (1983) 
show that two utility functions with the same Rubinstein risk aversion,? under the 
assumption of normality, will have the same sets of optimal solutions. Thorp and 
Whitley show that for two continuous non-decreasing utility functions that are not 
equivalent, there is a one-period two security investment setting such that these 
two utility functions have distinct optimal strategies if either (a) the two utility 


Egu” (g'z) 


2The Rubinstein risk aversion — Egura) YO is a constant. 
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functions are either concave or convex, or (b) they have a second derivative that 
exists except possibly for a set of isolated points. 

Short term Kelly and even fractional Kelly strategies are very risky since the 
Arrow-Pratt risk aversion index is very low. MacLean, Thorp, Zhao and Ziemba 
(2010) present three simple investment situations to simulate the behavior of full 
Kelly and fractional Kelly over medium time horizons. These simulations extend 
and redo over longer horizons with more scenarios earlier studies by Bicksler and 
Thorp (1973) and Ziemba and Hausch (1986). The results show that: 


(1) The great superiority of full Kelly and close to full Kelly strategies over medium 
term horizons (40 to 700 periods) with very large gains a large fraction of 
the time. 

(2) The short term performance of Kelly and high fractional Kelly strategies is 
very risky. 

(3) There is a well defined and consistent tradeoff of growth versus security as a 
function of the bet size determined by the various strategies. 

(4) No matter how favorable the investment opportunities are or how long the finite 
horizon is, a sequence of bad scenarios can lead to poor final wealth outcomes 
with a loss of most of the investor's initial capital. 


The calculations show the final wealth distributions including possible huge gains 
and huge losses and show the fraction of the time these are likely to occur. Mean- 
variance tradeoffs are shown versus the Kelly fraction. The mean is highest for full 
Kelly and decreases for lower and higher Kelly fractions. The variance increases 
with the Kelly fraction confirming that full Kelly is very volatile. The minimum 
and maximum wealth levels are shown versus the Kelly fraction. The wealth ac- 
cumulated from the full Kelly strategy does not stochastically dominate fractional 
Kelly strategies. To lower risk, one must lower the Kelly fraction. The results 
showed that for reasonable horizons with three different experiments, that the full 
Kelly and fractional Kelly strategies are not merely long run approaches. Proper use 
over short and medium time horizons can yield good wealth goals while protecting 
against drawdowns most of the time. 

The Kelly criterion has many good long-term properties and some favorable 
short-term properties. MacLean, Thorp, and Ziemba (2010) discuss the good and 
bad properties of the Kelly criterion. The main advantage is the long run growth 
rate maximization. That is an asymptotic mathematical result. Also the medium 
term simulation results show that most of the time the Kelly bettor has a lot higher 
final wealth than that from other strategies. But for a small percentage of the 
time, despite many periods of play with very favorable investments, there can be 
huge losses. Besides this, the main disadvantage of the Kelly criterion is that it is 
very risky short term and the near zero Arrow-Pratt risk aversion index typically 
leads to wild swings in the wealth path. But the Kelly strategy has favorable short 
run competitive optimality compared to other strategies. Moreover, the extreme 
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sensitivity of Flog to errors in the mean signals that mean estimates must be 
carefully and conservatively estimated to avoid overbetting which can lead to large 
portfolio losses. Fractional Kelly strategies such as half Kelly, moderate this but 
they can still have large losses. There is a tradeoff of lower growth for more security 
as the Kelly fraction is reduced and more is invested in riskless cash. Also it may 
take a long time for the Kelly bettor to dominate an essentially different strategy 
with high probability No one disagrees with the Samuelson objection that E log 
maximizing policies are optimal for only log utility functions. Thorp and Whitley's 
(1972, 1974) result shows that this “objection” is simply a universal fact, true for 
all utilities. None the less, the long run properties indicate very high final wealth 
most of the time for Kelly and fractional Kelly bettors. 


Review of Economics and Statistics, 51, 239—246 (1969) 


31 


465 


LIFETIME PORTFOLIO SELECTION 
BY DYNAMIC STOCHASTIC PROGRAMMING 


Paul A. Samuelson * 


Introduction 


OST analyses of portfolio selection, 

whether they are of the Markowitz- 
Tobin mean-variance or of more general type, 
maximize over one period.’ I shall here formu- 
late and solve a many-period generalization, 
cotresponding to lifetime planning of consump- 
tion and investment decisions. For simplicity 
of exposition I shall confine my explicit dis- 
cussion to special and easy cases that suffice to 
illustrate the general principles involved. 

As an example of topics that can be investi- 
gated within the framework of the present 
model, consider the question of a "business- 
man risk” kind of investment. In the literature 
of finance, one often reads; “Security A should 
be avoided by widows as too risky, but is highly 
suitable as a businessman's risk." What is in- 
volved in this distinction? Many things. 

First, the “businessman” is more affluent 
than the widow; and being further removed 
from the threat of falling below some sub- 
sistence level, he has a high propensity to 
embrace variance for the sake of better yield. 

Second, he can look forward to a high salary 
in the future; and with so high a present dis- 
counted value of wealth, it is only prudent for 
him to put more into common stocks compared 
to his present tangible wealth, borrowing if 
necessary for the purpose, or accomplishing 
the same thing by selecting volatile stocks that 
widows shun. 

* Ald fram the National Sclence Foundation is gratefully 
acknowledged. Robert C. Merton has provided me with 
much stimulus; and in a companion paper in this issue of 
the Revæw he is tackling the much harder problem of 
optimal control in the presence of continuous-time sta- 
chastic variation. I owe thanks also to Stanley Fischer. 

‘See for example Harry Markowitz [5]; James Tobin 
{14], Paul A. Samuelson [10]; Paul A. Samuelson and 
Robert C. Merton [13]. See, however, James Tobin [15], 
for a pioneering treatment of the multi-period portfolio 
problem; and Jan Mossin [7] which overlaps with the 
present analysis In showing how to solve the basic dynamic 
stochastic program recursively by working backward from 
the end in the Bellman fashion, and which proves the 


theorem that portfolio proportions will be invariant anly 
if the margina] utility functian is iso-elastic, 


Third, being still in the prime of life, the 
businessman can “recoup” any present losses 
in the future. The widow or retired man near- 
ing life's end has no such “second or #® 
chance," 

Fourth (and apparently related to the last 
point), since the businessman will be investing 
for so many periods, “the law of averages will 
even out for him," and he can afford to act 
almost as if he were not subject to diminishing 
marginal utility. 

What are we to make of these arguments? 
It will be realized that the first could be purely 
a one-period argument. Arrow, Pratt, and 
others? have shown that any investor who 
faces a range of wealth in which the elasticity 
of his marginal utility schedule is great will 
have high risk tolerance; and most writers 
seem to believe that the elasticity is at its 
highest for rich—but not ultra-rich! — 
people. Since the present model has no new 
insight to offer in connection with statical risk 
tolerance, I shall ignore the first point here 
and confine almost all my attention to utility 
functions with the same relative risk aversion 
at all levels of wealth. Is it then still true that 
lifetime considerations justify the concept of 
a businessman’s risk in his prime of life? 

Point two above does justify leveraged in- 
vestment financed by borrowing against future 
earnings. But it does not really involve any 
increase in relative risk-taking once we have 
related what is at risk to the proper larger base. 
(Admittedly, if market imperfections make 
loans difficult or costly, recourse to volatile, 
“leveraged” securities may be a rational pro- 
cedure.) 

The fourth point can easily invalve the in- 
numerable fallacies connected with the “law of 
large numbers." I have commented elsewhere ? 
on the mistaken notion that multiplymg the 
same kind of risk leads to cancellation rather 


2See K. Arrow (11; J. Pratt [9]; P. A. Samuelson and 
R. C. Merton [£3]. 
1P, A, Samuelson [11]. 
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than augmentation of risk. I.e., insuring 
many ships adds to risk (but only as Va); 
hence, only by insuring more ships and by 
also subdividing those risks among more people 
is risk on each brought down (in ratio 1/\/n). 

However, before writing this paper, I had 
thought that points three and four could be 
reformulated so as to give a valid demonstra- 
tion of businessman's risk, my thought being 
that investing for each period is akin to agree- 
ing to take a 1/4'^ interest in insuring x inde- 
pendent ships. 

The present lifetime model reveals that in- 
vesting for many periods does not itself in- 
troduce extra tolerance for riskiness at early, 
or any, stages of life. 


Basic Assumptions 
The familiar Ramsey model may be used as 
à point of departure. Let an individual maxi- 
mize 
T 
f e-? U[C(t) ]dt (1) 
subject to initial wealth W, that can always be 


invested for an exogeneously-given certain | 


rate of yield r; or subject to the constraint 


C(t) = W(t) — Wet) (2) 
Tf there is no bequest at death, terminal wealth 


is zero, 
This leads to the standard calculus-of-varia- 


tions problem 
T Max f eo UW — W]at 
(Wn) *? 
This can be easily related * to a discrete- 
time formulation 


(3) 


T 
Max Do (1-9) 4 ULCI a) 
subject to 
cw- Zn G) 
lor 
or, ii 
T = _ Wiri 
Max Dao (1-pp) -* U [w. Tir ] (6) 


4See P. A. Samuelsan [12], p. 273 for an exposition of 
discrete-time analogues to calculus-of-variations models. 
Note: here I assume that consumption, C:, takes place at 
the heginning rather than at the end of the period. This 
change alters slightly the appearance of the equilibrium 
conditions, but not their substance. 
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for prescribed (We, Wr,.). Differentiating 
partially with respect to each W, in turn, we 
derive recursion conditions for a regular inte- 
rior maximum 


vl Wi-i-— ui | 
= U L7 = "n (?) 


If U is concave, salving these second-order 
difference equations with boundary conditions 
(Wo, Wos1) will suffice to give us an optimal 
lifetime consumption-investment program. 

Since there has thus far been one asset, and 
that a safe one, the time has come to introduce 
a stochastically-risky alternative asset and to 
face up to a portfolio problem. Let us postulate 
the existence, alongside of the safe asset that 
makes $1 invested in it at time £ return to you 
at the end of the period $1(1 + 7), a risk asset 
that makes $1 invested in, at time £, return to 
you after one period $1Z,, where Z, is a random 
variable subject to the probability distribution 

Prob (Zi S z} = P(z). z20 (8) 
Hence, Z,,, — 1 is the percentage “yield” of 
each outcome. The mast general probability 
distribution is admissible: ie. a probability 
density over continuous z's, or finite positive 
probabilities at discrete values of z. Also I 
shall usually assume independence between 
yields at different times so that P(zy zy ..., 
Zuj-. ., Žr) = P(z,)P(z) ets .P (2g). 

For simplicity, the reader might care to deal 
with the easy case 

Prob (Z 2 A} = 1/2 

= Prob{Z = 37t}, åà>1 
(9} 

In order that risk averters with concave utility 
should not shun this risk asset when maximiz- 
ing the expected value of their portfolio, à must 
be large enough so that the expected value of 
the risk asset exceeds that of the safe asset, i.e., 


1 on 
^tT > 1+y,or 

A»l1c«rd rM. 
Thus, for X = 1.4, the risk asset has a mean 
yield of 0.057, which is greater than a safe 
asset’s certain yield of y = .04. 

At each instant of time, what will be the 
optimal fraction, w, that you should put in 
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the risky asset, with 1 — w, going into the safe 
asset? Once these optimal portfolio fractions 
are known, the constraint of (5) must be 
written 


emm [ W: — AW —— ] . 
[Qa —9) (+r) + wide} 

(10) 
Now we use (10) instead of (4), and recogniz- 
ing the stochastic nature of our problem, 
specify that we maximize the expected value 
of total utility over time. 'This gives us the 
stochastic generalizations of (4) and (5) or 


(6) 


Max T 
(C, 1) E EXC + 5) UIC] (11) 
subject to 
a DENM 
m [r (L7) (i—m,) + m2, | 


Wa given, Wo+: prescribed. 


If there is no bequeathing of wealth at death, 
presumably Wy,, = 0. Alternatively, we could 
replace a prescribed Wr, by a final bequest 
function added ta (11), of the form B(Wr4:), 
and with Wr}, a free decision variable to be 
chosen so as to maximize (11) + B(Wz,i). 
For the most part, I shall consider Cr = Wp 
and Wa =0.- 

In (11), Æ stands for the “expected value 
of," so that, for example, 


EZ,- y ; 


In our simple case of (9), 
1 [MEE 
EZ, = FX T rx ‘ 


Equation (11) is our basic stochastic program- 
ming problem that needs to be solved simul- 
taneously for optimal saving-consumption and 
portfolio-selection decisions aver time. 

Before proceeding to solve this problem, ref- 
erence may be made to similar problems that 
seem to have been dealt with explicitly in the 
economics literature. First, there is the valu- 
able paper by Phelps on the Ramsey problem 
in which capital's yield is a prescribed random 
variable, This corresponds, in my notatian, to 
the {w,} strategy being frozen at some frac- 
tional level, there being no portfalio selection 
problem. (My analysis could be amplified to 
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consider Phelps’® wage income, and even in 
the stochastic form that he cites Martin Beck- 
mann as having analyzed.) More recently, 
Levhari and Srinivasan [4] have also treated 
the Phelps problem for T = œ by means of 
the Bellman functional equations of dynamic 
programming, and have indicated a proof that 
concavity af U is sufficient for a maximum. 
Then, there is Professor Mirrlees’ important 
work on the Ramsey problem with Harrod- 
neutral technological change as a random vari- 
able. Our problems become equivalent if I 
replace W, — W,,i[(1--f)(1—39,) + w] 
in (10) by Af (WA) — 44W, — (Wi, — W,) 
let technical change be governed by the prob- 
ability distribution 

Prob (4, € 4:112) = P(Z); 
reinterpret my W, to be Mirrlees’ per capita 
capital, K,/L,, where L, is growing at the nat- 
ural rate of growth n; and posit that 4.f(W,/ 
A,) is a homogeneous first degree, concave, neo- 
classical production function in terms of cap- 
ital and efficiency-units of labor. 

It should be remarked that I am confirming 
myself here to regular interior maxima, and 
not going into the Kuhn-Tucker inequalities 
that easily handle boundary maxima, 


Solution of the Problem 
The meaning of our basic problem 


T 
Jn(Wo) = Max E X (1—5)-*U[Ci 
(Cuts) t-0 4 

subject to C, = W, — Wiji[(1—w,) (19r) 
+ w,Z,]-^* is not easy to grasp. I act now at 
£ = 0 to select Co and wo, knowing We but not 
yet knowing how Z, will turn out. I must act 
now, knowing that one period later, knowledge 
of Z,’s outcome will be known and that W, will 
then be known. Depending upon knowledge of 
Wi, a new decision will be made for C, and 
w, Now I can only guess what that decision 
will be. 

As so often is the case in dynamic program- 
ming, it helps to begin at the end of the plan- 
ning period. This brings us to the well-known 


(11) 


* E, S. Phelps [8]. 

‘J. A. Mirrlees [6]. I have converted his treatment 
into a discrete-time version. Robert Metton's companion 
paper throws light on Mirrlees’ Brownian-motion model 
for 4. 
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one-period portfolio problem. In our terms, 
this becomes 
AQN4) = Max 
{Crin} 
+ E(14-p) U [ (Wr — Cs) 
((1—wr_i) (1+) 
Tp i Zr). (12) 
Here the expected value operator E operates 
only on the random variable of the next period 
since current consumption Cz , is known once 
we have made aur decision. Writing the second 
term as EF (Zr), this becomes 


ER (Zp) NEC ET 


U(Crz-i] 


- J Paa), by our independence 
9 
postulate. 


In the general case, at a later stage of decision 
making, say £ = T—1, knowledge will be avail- 
able of the outcomes of earlier random vari- 
ables, Z,.2,... ; since these might be relevant 
to the distribution of subsequent random vart- 
ables, conditional probabilities of the form 
P(Zsi|Zrz-a...) are thus involved. How- 
ever, in cases like the present one, where in- 
dependence of distributions is posited, condi- 
tional probabilities can be dispensed within 
favor of simple distributions, 

Note that in (12) we have substituted for 
Cr its value as given by the constraint in (11) 
or (10). 

To determine this optimum (Cy, We—1), 
we differentiate with respect ta each separately, 
to get 

0 = U [Cr] — (1+p) -1 EU’ [Cz] 

((1— wr) +7) + rZ) (27) 

0 = EU’ [Cr] (Wry — Cr-1) (Ze — 1—7) 


= fv [ (2, —Cr_x) 
0 


{0 -wr (+r) — wr-ilr}] 
(We—1—Cr_3) (27-1 -1—r) dP (Zr) 
a2”) 
Solving these simultaneously, we get our 
optimal decisions (C*p_,, wW*p—ı) as functions 
of initial wealth W; , alone. Note that if 
somehow C*,_,; were known, (12^) would by 
itself be the familiar one-period portfolio op- 
timality condition, and could trivially be re- 
written to handle any number of alternative 
assets. 
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Substituting (C*,_,, w*;_,} into the expres- 
sion to be maximized gives us J,(Wz i) ex- 
plicitly. From the equations in (12), we can, 
by standard calculus methods, relate the de- 
rivatives of U to those of /, namely, by the 
envelope relation 

Ji (Wo) = U' [Cr]. (13) 

Now that we know J [Wr], it is easy to 
determine optimal behavior one period earlier, 
namely by 

Ja(Wr-2) = Max 
(Cr-atr—2) 

+ E(14p) 7! A4((W 5 .3— Cra) 
((1— wra) (+r) + 390-22.) ]. 
(14) 

Differentiating (14) just as we did (11) 
gives the following equations like those of (12) 

Q0 = V’ [Cra] — (14-9) EJ [Wea] 

{1 wra) (12er) + tm aZ) (157) 
[t] = Ely’ [Wai] (Wyp — Cpe) (Zna — I—r) 


= Sx (Wes — Cra (toe a) (1+) 


+ WrL3Zr-2)] (Wea — Cr 2) (Zp_a — 1—r) 
dP (Zra). 


U{Cr-2] 


a5”) 
These equations, which could by (13) be re- 
lated to U'[Cz..,], can be solved simultaneous- 
ly to determine optimal (C*z 5,1*5.,) and 
JOE, 4). 
Continuing recursively in this way for T—3, 
T—4,...,2, 1, 0, we finally have our problem 
solved. The general recursive optimality equa- 
tions can be written as 
0 = U'(Cs] — (+e) Er [We] 
{(1— wo) (12-7) + woo} 
0 = EP; [Wi] (Wo — Co) (Zo — 1—r) 


0 = U'[Cr-,] — (+e)? EP[W:] 
(wei) (Lr) + Welt} 
0 = EJs [Wi Ci 1) (Zi — 1-7), 
(f= d... T- 1). 6") 
In (16), of course, the proper substitutions 
must be made and the E operators must be 
over the proper probability distributions: Solv- 
ing (167) at any stage will give the optimal 
decision rules for consumption-saving aud for 
portfolio selection, in the form 
C*, = f[(Wi; Zi, nn Za] 
= fr—(We] if the Z's are independently 
distributed 


(67) 
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we, = g( We; Zia, tt) Zo] 
= gr—[W.] if the Z's are independently 
distributed. 

Our prablem is now solved for every case 
but the important case of infinite-time horizon. 
For well-behaved cases, one can simply let 
T — œ in the above formulas. Or, as often 
happens, the infinite case may he the easiest of 
all to solve, since for it C*, = f(W,), w*, = 
£(W,), independently of time and both these 
unknown functions can be deduced as solutions 
to the following functional equations: 


O= U [/(W)] — (+p) 
Anton -1w caen 


-gW)(Z-1-ni[acrn 
— g(W)(Z - 1 — r)]aP(Z) 
o= f viw- r0) 
(14 r— gW)(z-i-7] 
[2—- 1 — ] aee (17°) 
Equation (17^), by itself with g(W) pretended 
to be known, would be equivalent to equation 
(13) of Levhari and Srinivasan (4, p. f]. In 
deriving (17’)-(17”), I have utilized the enve- 
lope relation of my (13), which is equivalent to 
Levhari and Srinivasan's equation (12) [4, 
p. 5]. 


aT) 


Bernoulli and Isoelastic Cases 


To apply our results, let us consider the in- 
teresting Bernoulli case where U — log C. This 
does not have the bounded utility that Arrow 
[1] and many writers have convinced them- 
selves is desirahle for an axiom system. Since 
I do not believe tbat Karl Menger paradoxes 
of the generalized St. Petersburg type hold any 
terrors for the economist, I have no particular 
interest in boundedness of utility and considet 
log C to be interesting and admissible. For this 
case, we have, from (12), 

Ji(W) = Max loge 

{C w} 
+ E(i+p)—* log ((W — C) 
((1—1) (+r) + wZ}],- 
= Maxlog C + (+p) -!dog (W —C] 
(C) 
+ Max f log [(1—w) (1-48) 
o 


{w 
+ wZ]aP(2)% (18) 
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Hence, equations (12) and (16’)-(16”) split 
into two independent parts and the Ramsey- 
Phelps saving problem becomes quite indepen- . 
dent of the lifetime portfolio selection problem. 
Now we have 
0 = (1/C) — (1--g) -! (W — C)-tor 
Cra = (Lt+p) (2+p) Wr- 


o= f-1-010-040 


+ wZ]-1 dP(Z) or 
wp_1 = w* independently of Wri. (19^) 
These independence results, of the Cp_, and 
Wr_1 decisions and of the dependence of wr- 
on W; ,, hold for all U functions with iso- 
elastic marginal utility. Le. (16^) and (16^) 
become decomposable conditions for all 
U(C) = 1/y C», y<l (20) 
as well as for U(C) = log C, corresponding by 
L'Hópital's rule ta y = 0. 
To see this, write (12) or (18) as 


(19°) 


A(W) = Max C, (+p) 71 (W—-C) 
(Cw) Y 
f ia- 9040 + wZ]7 dP(Z) 
= Max T + +p) 27 FO» 
(C) 7 
Max fli aen 
+ wZ] dp(Z). (21) 


Hence, (12^) or (15") or (16) becomes 


f (aas 
o 


+ wZ]v-! (Z—r—1) dP (Z) = 0, 
which defines optimal w* and gives 


Max fT) a+) + eZ] dP(Z) 

(w) e 

- f ta-waso + w*Z]v dP(Z) 

- [I + r*]%, for shart. 
Here, »* is the subjective or util-prob mean 
return of the portfolio, where diminishing mar- 
ginal utility has been taken into account.’ To 
get optimal consumption-saving, differentiate 
(21) to get the new form of (12’), (15^), or 
(16’) 


(22") 


"See Samuelson and Merton for the utll-prob concept 
[13]. 
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O = CT-! — (1--p)7! (143-*)v (9 — C) Y. In the limiting case, as y — O and we have 
(227 Bernoulli's logarithmic function, a, = (I--p), 
Solving, we have the consumption decision rule independent of r*, and all saving propensities : 
Cha =A W 23) depend on subjective time preference p only, 
TOR TUTO 4m ( being independent of technological investment 
where opportunities (except to the degree that W, 
aj = [(14-7*)Y7(14-g) ] v1, (24) will = definitely depend on those opportu- 
tend nities). 
Er cs d ps and (25) We can interpret 14-** as kind of a “risk- 
Bi AT ESO carrected" mean yield; and behavior of a long- 
where lived man depends critically on whether 
by = ay (1+) -Y > < 
+ (L4-g) 71 (L4-9*) (Lar). (26) (L+9*) = (+p), corresponding to aghk 


Thus, J,(-) is of the same elasticity form as 
U(-) was. Evaluating indeterminate forms for 
y = 0, we find J, to be of log form if U was. 
Now, by mathematical induction, it is easy 
to show that this isoelastic property must also 
hold for Ja(Wr—2), J.(W. 4),..., since, 
whenever it holds for J,(Wp_,) it is deducible 
that it holds for J,,,(Wr_,-,). Hence, at 
every stage, solving the general equations (16^) 
and (16), they decompose into two parts in 
the case af isoelastic utility. Hence, 
Theorem: 
For isoelastic marginal utility functions, U'(C) 
= C-t, y < 1, the optimal portfolio decision 
is independent of wealth at each stage and in- 
dependent of all consumption-saving decisions, 
leading to a constant w*, the solution to 


o= f'ia-wasn-c-wzp--1-)0aPU). 
0 


Then optimal consumption decisions at each 

stage are, for a no-bequest. model, of the form 
C* p = Wot l 

where one can deduce the recursion relations 
a 

1-F204 ? 


a = [(Ltp)/(a-r)»]ot-v 
Quer - f [G—w*) (4-7) 
o 
+ w* Z]vdP (Z) 


C = 


16 
a= 
i-a 
- ei «& 
= PIE OoesEah D wed 
1 aà1—1 
= Ca Nol ; axl 
1 
= " € = 1. 


(i) Far (1--r*)Y = (14-2), one plans always to 
consume at a uniform rate, dividing current We., evenly 
by remaining life, 1/(14-£). If young enough, ane saves 
on the average; in the familiar “hump saving” fashion, 
one dissaves later as the end comes sufficiently close 
into sight. 

(ii) For (1+r*)7 > (1+), a <1, and investment 
opportunities are, so to speak, so tempting compared 
ta psychological time preference that one consumes 
nothing at the heginning af a long-long life, i.e., rigor- 
ously 

Lim ¢, =0, 

17> c0 
and again hump saving must take place, For (1-4-r*)* 
> (L-+e), the perpetual lifetime problem, with T = œ, 
is divergent and ill-defined, i.e., J (W) œ as i oo. 
For y= 0 and a > 0, this case cannot arise. 

(iii) For (1+7*)Y < (1-9), m > 1, consumption 
at very early ages drops only to a limiting pasitive 
fraction (rather than zero), namely 

Lim c = 1- I/a <La >l. 

1> co 
Now whether there will be, an the average, initial hump 
saving depends upon the size of 7* — Ca, or whether 
(Lp) -v 


Cp) 


This ends the Theorem. Although many of 
the results depend upon the no-bequest as- 
sumption, Wr}, = 0, as Merton’s companion 
paper shows (p. 247, this Review) we can easily 
generalize to the cases where a bequest func- 
tion Bo (W;,1) is added to X7, (1+e)—U(C,). 
If B, is itself of isoelastic form, 

Bg = br(Wr41)/y; 


the algebra is little changed. Also, the same 
comparative statics put forward in Merton’s 
continuous-time case will be applicable here, 
eg, the Bernoulli y = 0 case is a watershed 
between cases where thrift is enhanced by risk- 
iness rather than reduced; etc. 


à, « 1 


—1 
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Since proof of the theorem is straightfor- risk-tolerance as toward the end of life! The 


ward, I skip all details except to indicate how 
the recursion relations for c, and 6, are derived, 
namely from the identities 
bipıWY/y = Ji (W) 
= Max {CY/y 
c 


tbOEOE)Y-T e) OV - C)Y/y) 
= (€t + b. (1o r*)Y 
(14g) 1(1—645) Y) Wv/y 
and the optimality condition 
02 Cv-1— b(14-r*)*(123-g)- (WB —C)v-* 
= (auaW)Y-* — BL +r*) YOU +e) 
(1— cipi) Y We}, 
which defines c, , in terms of &. 


What if we relax the assumption of isoelastic 
marginal utility functions? Then wyp; be- 
comes a function of W,_;_, (and, of course, 
of r, p, and a functional of the prabability dis- 
tribution P). Now the Phelps-Ramsey optimal 
stochastic saving decisions do interact with the 
optimal portfolio decisions, and these have to 
be arrived at by simultaneous solution of the 
nondecomposable equations (16’) and (16"). 

What if we have more than one alternative 
asset to safe cash? Then merely interpret Z, 
as a (column) vector of returns (Z?,23,,...) 
on the respective risky assets; also interpret 
w, as a (row) vector (w?,,w*,,...), interpret 
P(Z) as vector notation for 

Prob (27, & Z2, 23, x Z23,...) 

= P(2Z3,Z3,...) = P(Z), 
interpret all integrals of the form fG(Z)dP(Z) 
as multiple integrals fG(Z?,73,. . ) dP(Z?,Z*, 

..). Then (16") becomes a vector-set of 
equations, one for each component of the vectar 
Z,, and these can be solved simultaneously for 
the unknown w, vector. 

If there are many consumption items, we 
can handle the general problem by giving a 
similar vector interpretation to C,. 

Thus, the most general portfolio lifetime 
problem is handled by our equations or obvious 
extensions thereof. 


Conclusion 


We have now come full circle, Our madel 
denies the validity of the concept of business- 
man's risk; for isoelastíc marginal utilities, in 
your prime of life you have the same relative 


"chance to recoup" and tendency for the law 
of large numbers to operate in the case of re- 
peated investments is not relevant. (Note: 
if the elasticity of marginal utility, —U'(W)/ 
WU" (W), rises empirically with wealth, and if 
the capital market is imperfect as far as lending 
and borrowing against future earnings is con- 
cerned, then it seems to me to be likely 
that a doctor of age 35-50 might rationally 
have his highest consumption then, and certain- 
ly show greatest risk tolerance then — in other 
words be open to a “businessman’s risk.” But 
not in the frictionless isoelastic model!) 

As usual, one expects w* and risk tolerance 
to be higher with algebraically large y. One 
expects C, to be higher late in life when 7 and 
y* is high relative to p. As in a one-period 
model, one expects any increase in “riskiness” 
of Z,, for the same mean, to decrease w*, One 
expects a similar increase in riskiness ta lower 
or raise consumption depending upon whether 
marginal utility is greater or less than unity in 
its elasticity.? 

Our analysis enables us to dispel a fallacy 
that has been borrowed into portfolio theory 
from information theory of the Shannon type. 
Associated with independent discoveries by 
J. B. Williams [16], John Kelly [2], and H. A. 
Latané [3] is the notion that if one is invest- 
ing for many periods, the proper behavior 1s to 
maximize the geometric mean of return rather 
than the arithmetic mean. I believe this to be 
incorrect (except in the Bernoulli logarithmic 
case where it happens * ta be correct for reasons 


"See Merton's cited companion paper in this issue, for 
explicit discussian of the comparative statical shifts of (16)'s 
C*, and ue, functions as the parameters (a, v, 7, r*, and 
P(2) or P(Z,...) ot B(Wr) functions change. The same 
results hold in the discrete-and-continuous-time models. 

"See Latané [3, p. 151] for explicit recognition of this 
point. I find somewhat mystifying his footnate there which 
says, "As pointed out ta me by Prafessor L, J. Savage (in 
correspondence), not anly is the maximizatian af G [the 
geametric mean] the cule for maximum expected utility in 
connection with Bernoullis function but (in so far as cer- 
tain approximations are permissible) this same rule ís ap- 
proximately valid far all utility functions.” (Latané, p. 151, 
n.13.) The geometric mean criterion is definitely toa con- 
Servative ta maximize an isoelastic utility function car- 
respanding to pasitive 4 in my equation (20), and it is 
definitely tao daring to maximize expected utility when 
y «0. Professor Savage has informed me recently that his 
1969 position differs from the view attributed 1o him in 
1959. 
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quite distinct from the Williams-Kelly-Latané 
reasoning). 

These writers must have in mind reasoning 
that goes something like the following: If one 
maximizes for a distant goal, investing and 
reinvesting (all one's proceeds) many times on 
the way, then the probability becomes great 
that with a portfolio that maximizes the geo- 
metric mean at each stage you will end up 
with a larger terminal wealth than with any 
other decision strategy. 

This is indeed a valid consequence of the 
central limit theorem as applied to the addi- 
tive logarithms of portfolio outcomes. (Le., 
maximizing the geometric mean is the same 
thing as maximizing the arithmetic mean of 
the logarithm of outcome at each stage; if at 
each stage, we get a mean log of m** > m*, 
then after a large number of stages we will 
have m**T >> m*T, and the properly nor- 
malized probabilities will cluster around a 
higher value.) 

There is nothing wrong with the logical 
deduction from premise to tbeorem. But the 
implicit premise is faulty to begin with, as I 
have shown elsewhere in another connection 
[Samuelson, 10, p. 3]. It is a mistake ta think 
that, just because a w** decision ends up with 
almost-certain probahility to be better than a 
w* decision, this implies that w** must yield a 
better expected value of utility. Our analysis 
Íor marginal utility with elasticity differing 
from that of Bernoulli provides an effective 
counter example, if indeed a counter example 
is needed to refute a gratuitous assertion. 
Moreover, as I showed elsewhere, the ordering 
principle of selecting between two actions in 
terms of which has the greater probability of 
producing a higher result does not even possess 
the property of being transitive’? By that 
principle, we could have w*** better than w**, 
and w** better than w*, and also have w* 
better than w***, 


"See Satouelson [11]. 


P. A. Samuelson 
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Abstract 


'This edited and updated excerpt from the Ziemba and Vickson (1975, 2006) 
volume discusses various aspects of the history of optimal capital accumulation 
and the capital growth criterion, and supplements the introduction to Part IV of 
this volume. 


1 Optimal Capital Accumulation 


Phelps (1962) extended the Ramsey (1928) model of lifetime saving to include un- 
certainty. In Phelps! problem, the choice in each period was between consumption 
and investment in a single risky asset, with the objective being the maximization 
of expected utility of life time consumption. Phelps assumed that the utility func- 
tion was additive in each period's utility for consumption, and he obtained explicit 
solutions when each utility function was the same and a member of the isoelastic 
marginal utility family. The papers reprinted in Ziemba and Vickson (1975, 2006) 
(ZV) and in this volume generalize and extend the work of Phelps in several direc- 
tions: Portfolio choice is included in Samuelson (1969) and Hakansson (1970, 1971a) 
papers, and more general utility functions are treated in Neave (1971b), which is in 
ZV. These papers also constitute generalizations of the multiperiod consumption- 
investment papers of Part IV.! 

Neave (1971b) treats the multiperiod consumption-investment problem for a 
preference structure given by a sum of utility functions of consumption in different 
periods, including a utility for terminal bequests. Future utilities are discounted by 


*Reprinted, edited and updated from Ziemba and Vickson (1975, 2006). 
1See Zabel (1973) for an interesting extension of the Phelps-Samuelson-Hakansson model to include 
proportional transaction costs. 
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an "impatience" factor, compounded in time. The portfolio problem is neglected, 
with the only decisions at any time being the amount of wealth to be consumed, 
the non-negative remainder being invested in a single risky asset. In each period, 
terminal wealth consists of a certain, fixed income plus gross return on investment, 
and this terminal wealth becomes initial wealth for the following period, from which 
consumption and investment decisions are again to be made, and so forth. Neave 
is concerned with conditions under which risk-aversion properties are preserved in 
the induced utility functions for wealth. He demonstrates the important result 
that the property of non-increasing Arrow-Pratt absolute risk aversion is preserved 
under rather general conditions. For intertemporally independent risky returns, 
if the single-period utilities for consumption and the utility for terminal bequests 
exhibit non-increasing absolute risk aversion, then all the induced utility functions 
for wealth also have this property. Mathematically, this result arises from the 
fact that convex combinations of functions having decreasing absolute risk aversion 
also have this property (see Pratt, 1964, reprinted in ZV). Thus, the decreasing 
absolute risk-aversion property is preserved under the expected-value operation, 
and is further preserved under maximization. The paper specifically includes the 
possibility of boundary as well as interior solutions in the presentation. 

Neave (1971b) also considers the behavior of the Arrow-Pratt relative risk- 
aversion index of induced utility for wealth. He presents sufficient conditions under 
which the property of non-decreasing relative risk aversion is preserved. Since this 
property is not, in general, preserved under convex combinations, results can be ob- 
tained only under special conditions. Unfortunately, these conditions are not easily 
interpreted, and involve the specific values of the optimal decision variables as well 
as the properties of the utility functions. In general, however, non-decreasing rel- 
ative risk aversion obtains for sufficiently large values of wealth. In ZV's Exercise 
V-ME-6, the reader is asked to attempt to widen Neave's classes of utility functions 
that generate desirable risk-aversion properties for the induced utility functions. 
Neave also relates properties of the relative risk-aversion measures to wealth elas- 
ticity of demand for risky investment. Arrow (1971) performed a similar study in 
the context of choice between a risk-free and a risky asset, with no consumption. In 
this case, the choice is between consumption and risky investment, with no risk-free 
asset. Neave shows that if the utilities for consumption and terminal wealth in any 
period exhibit constant relative risk aversion equal to unity, then the wealth elastic- 
ity for risky investment in that period is also equal to unity. He further shows that 
the wealth elasticity of risky investment exceeds unity if and only if the relative 
risk-aversion index of expected utility for wealth is less than or equal to that of 
utility for consumption in the relevant period. 

Samuelson (1969) studies the optimal consumption-investment problem for an 
investor whose utility for consumption over time is a discounted sum of single-period 
utilities, with the latter being constant over time and exhibiting constant relative 
risk aversion (power-law functions or logarithmic functions). Samuelson assumes 
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that the investor possesses in period t an initial wealth Wt which can be consumed 
or invested between two assets. One of the assets is safe, with constant known 
rate of return r per period, while the other asset is risky, with known probability 
distribution of return Zt in period t. The Zt are assumed to be intertemporally 
independent and identically distributed. Samuelson thus generalizes Phelps’ model 
to include portfolio choice as well as consumption. 

Terminal wealth Wt + 1 in period t consists of gross return on investment, and 
becomes initial wealth governing the consumption-investment decisions in period 
t+ 1. The problem is to determine the fraction of wealth to be consumed and 
the optimal mix between riskless and risky investment in each period (given an 
initial wealth wo), so as to maximize expected utility of lifetime consumption. The 
usual backward induction procedure of dynamic programming yields à sequence 
of induced single-period utility functions Ut for terminal wealth in period t, with 
optimal consumption and investment in period t being determined by maximizing 
expected utility of consumption plus terminal wealth. Assuming interior maxima 
in each period, a recursive set of first-order conditions is derived for the optimal 
consumption-investment program. 

The explicit form of the optimal solution is derived for the special case of utility 
functions having constant relative risk aversion. Samuelson shows that the optimal 
portfolio decision is independent of time, of wealth, and of the consumption decision 
at each stage. This optimal portfolio is identical with the optimal portfolio result- 
ing from the investment of one dollar in the two assets so as to maximize expected 
utility ofgross return, using the instantaneous utility for consumption as the “utility 
for wealth” function in the optimization procedure. Furthermore, in each period, 
the optimal amount to consume is a known fraction of initial wealth in the period, 
with the optimal fraction being dependent only on the subjective discount factor 
for consumption over time, the return on the risk-free asset, the probability distri- 
bution ofrisky return, and the relative risk-aversion index. For logarithmic utilities, 
the optimal consumption fraction further simplifies to a function of the subjective 
discount factor alone, and is independent of the asset returns. 

Samuelson utilizes his solutions to analyze optimal consumption and investment 
behavior as a function of time of life. As observed above, the optimal mix of 
riskless versus risky investment is (for constant relative risk aversion) completely 
independent of time of life. Samuelson relates the optimal consumption over time to 
the asset returns and the subjective discount factor, and discusses conditions under 
which the consumer-investor will save during early life and dissave in old age. 

Throughout the paper, Samuelson assumes the validity of interior maxima in 
the optimization problems. The validity of such an assumption in the multiperiod 
portfolio problem was seriously questioned in Hakansson (1970, 1971a), and in ZV- 
Exercises including IV-CR-3, 5, and 6. When consumption decisions are included, as 
they are in the present case, the question of boundary solutions is presumably even 
more important than it was in the pure investment case. The reader may wish to 
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ponder whether these considerations can materially affect Samuelson's conclusions. 
ZV-Exercise V-CR-16 shows that a stationary policy is not optimal in a multiperiod 
mean-variance model even under strong independence assumptions. 

Hakansson (1970) studies a generalization of the problem treated by Samuelson 
(1969). Hakansson's assumptions regarding utility for consumption are identical to 
Samuelson's, except for an assumed infinite lifetime. The investment possibilities 
facing Hakansson's investor are, however, more general. The individual is assumed 
to possess a (possibly negative) initial capital position plus à non-capital steady 
income stream which is known with certainty. In each period, borrowing and lending 
at a known, time-independent rate of interest, and investment in a number of risky 
assets with known joint distribution of returns, are possible. The risky asset returns 
are assumed to be independent identically distributed (iid) over time. It is further 
assumed that a subset of the risky assets may be sold short. In each period, initial 
wealth to be consumed and invested consists of the certain income plus gross return 
on the previous period's investment. The objective is maximization of the expected 
utility of consumption over an infinite lifetime. 

Hakansson introduces two important and apparently realistic constraints on the 
risky returns and the investor's decision variables. The capital market is assumed 
to impose a so-called no-easy-money restriction on the risky asset returns. This 
restriction ensures that no mix of risky assets exists, which provides with proba- 
bility 1 a return exceeding the risk-free rate, and that no mix of long and short 
sales exists, which guarantees against loss in excess of the riskfree lending rate. 
The consumption-investment decision variables are assumed to satisfy the so-called 
solvency constraint. This requires that the investor always remain solvent with 
probability 1, i.e., that in any period t, the investor's initial capital position plus 
the capitalized value of the future income stream be non-negative with certainty. 

Hakansson uses the familiar Bellman backward induction procedure of dynamic 
programming to set up a functional equation for the optimal return function (i.e., 
the maximum expected utility of present and future consumption, given any value 
of initial wealth). Since the risky returns are iid and the lifetime infinite, the func- 
tional equation obtained is stationary over time: The optimal decision variables 
in period ¢ depend only on initial wealth (in period 1), on the single-period asset 
returns, and on the relative risk-aversion index of consumption. Hakansson's treat- 
ment of the infinite horizon dynamic programming problem is intuitively plausible, 
but not wholly satisfactory from a rigorous standpoint. (The results are correct, 
however.) First, he assumes that the optimal return function exists and satisfies 
the functional equation; in a totally rigorous development, these facts need proof. 
Next, he verifies the form of the optimal solution by direct substitution, that is, by 
showing that the assumed form of solution actually solves the functional equation. 
Here the question of uniqueness arises, and is not treated satisfactorily in the de- 
velopment. One way of treating the problem rigorously would be to deal first with 
the finite horizon problem (as Samuelson does) and subsequently go to the infinite 
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horizon limit. At each stage in the finite horizon backward induction procedure, 
one would have a functional equation similar to Hakansson’s (but rigorously jus- 
tified). As shown in the paper, the decision variable feasibility region defined by 
the solvency constraint is compact and convex, and (as in Hakansson’s proof of his 
lemma, and its corollaries) the functional equation has a finite, unique solution. By 
considering the limiting behavior of the optimal solution and optimal return func- 
tion, the given infinite horizon results could be justified. For the case of a power-law 
utility function unbounded from above, Hakansson needs a restriction on the utility 
function parameters, the optimal risky asset returns, and the subjective discount 
factor, in order to obtain a solution. In a limiting argument as outlined above, such 
a restriction would be required in order to obtain a finite return function in the 
limit of infinite horizon. 

For constant relative risk-aversion utilities, the form of the optimal solution is (i) 
a fixed fraction of initial wealth plus capitalized future income is consumed in each 
period; and (ii) a fixed proportion of initial wealth plus capitalized future income 
is invested in each of the risky assets, with the optimal asset proportions being 
independent of time, initial wealth, capitalized income stream, and impatience to 
consume. The remainder of initial wealth is invested in the risk-free asset. For 
logarithmic utilities, the optimal consumption fraction is completely independent 
of all asset returns, and is a function only of the impatience factor. Hakansson 
also considers the interesting case of constant absolute risk aversion (exponential 
utilities), and states without proof the form of the optimal policy under special 
restrictions. The conditions required for the solution and the form of the optimal 
policy are more difficult to interpret than the corresponding results for the constant 
relative risk-aversion case. The reasons for this different behavior in the two classes 
of utilities may be understood as follows. For a power-law (or logarithmic) utility, 
expected utility of terminal wealth decomposes into a product (or sum) of utility of 
initial wealth and expected utility of rate of return on one dollar of investment (since 
terminal wealth equals initial wealth times rate of return). Such a factorization does 
not obtain for exponential utilities. See ZV Exercise [V-CR-3 for a clarification of 
this point in a simple example. 

A number of conclusions regarding consumption and investment behavior are 
derived as corollaries to the optimal solution. As stated previously, the optimal 
level of consumption is a fixed fraction of initial wealth plus capitalized future in- 
come. Furthermore, the optimal amount of consumption increases with increasing 
impatience to consume, as expected. The optimal solution is such that the indi- 
vidual borrows when poor and generally lends when rich. Additional behavioral 
implications of the solutions are discussed in the paper. 

In the Samuelson-Hakansson additive utility models of this section, the individ- 
ual’s consumption versus saving and investment behavior may vary as a function 
of time of life, but attitudes toward risk do not. In these models, the same risky 
portfolios are chosen in youth and in old age. In ZV-Exercise V-ME-21, a multiplica- 
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tive utility model, based on Pye (1972), is developed. In this model, risk aversion 
in portfolio selection increases or decreases with age according as risk tolerance is 
greater or less than that of the logarithmic utility function. Logarithmic utility is à 
member of both the additive and multiplicative families. In the multiplicative fam- 
ily, risk aversion in portfolio selection depends on impatience to consume as well as 
on age. The multiplicative family thus allows greater richness in portfolio selection 
behavior over time. On the other hand, consumption behavior over time is simpler 
in the multiplicative model: Given the impatience factor and the length of remain- 
ing life, the propensity to consume is identical for all members of the multiplicative 
family, and is independent of present and future investment opportunities. Further- 
more, in the infinite horizon limit, portfolio selection behavior changes over time, 
but consumption behavior does not: A constant fraction of wealth is consumed in 
every period. 

To summarize, these papers show that an optimal lifetime consumption- 
investment program is simple to calculate for additive utilities belonging to the 
constant relative risk-aversion family. The optimal portfolio remains unchanged 
throughout life for asset returns which are iid over time. Consumption in any 
period is always proportional to wealth (including capitalized future income), the 
proportionality factor being dependent in general on impatience to consume, on the 
capital market, and possibly also on time of life. The optimal portfolio and the opti- 
mal consumption fraction are easily determined by solving a static, one-period, pure 
portfolio problem using utility for consumption as the appropriate utility for wealth 
function. Effects due to age-dependent risk aversion can be incorporated through 
a multiplicative form of utility function. In this case, the optimal consumption 
program may be calculated (in the absence of exogenous income, at least) without 
solving any "optimization" problem and without reference to the capital market. 
Optimal portfolio choice depends on time, however, and must be continually recal- 
culated in successive periods. In both the additive and multiplicative cases, a useful 
form of myopia obtains: In each period, past or future realized random outcomes 
need not be known in choosing the optimal portfolio, and past outcomes affect 
present consumption decisions only by fixing the available wealth. 

Some of the conclusions would remain unchanged under a slight weakening of 
hypotheses. In particular, the assumption of identically distributed asset returns 
over time may be dropped (while retaining independence). For additive utilities 
having constant relative risk aversion, the optimal amount of consumption will still 
be proportional to wealth. The relative risk aversion of induced utility for wealth 
will remain constant over time. The optimal asset proportions at any time £ will be 
independent of wealth, consumption, and past or future asset returns, and will de- 
pend only on the nature of the capital market at time t. Calculation of the optimal 
consumption fraction at time t will, however, generally require the solution of all 
"future" consumption and portfolio problems, and will thus be much more complex 
than in the iid case. Hakansson (1971b) studied a significant generalization of his 
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model where the individual's impatience is variable, his lifetime is stochastic, and 
the capital assets (both risk free and risky) change in time. The individual con- 
sumes, borrows (or lends), invests in risky assets, and purchases life insurance. For 
additive utilities having constant relative risk aversion, solutions are obtained for 
the optimal lifetime consumption, investment, and insurance-buying program. For 
logarithmic utilities, Miller (1974ab) analyzed the optimal consumption strategies 
over an infinite lifetime with a stochastic (possible nonstationary) income stream 
and a single, safe asset. Rentz (1971, 1972, 1973) has analyzed optimal consump- 
tion and portfolio policies with life insurance, for stochastic lifetimes and changing 
family size. Additional extensions are in Hakansson (1969), Long (1972), and Neave 
(19712). 


2 The Capital Growth Criterion 


The papers by Breiman and Thorp are concerned with Kelly’s capital growth cri- 
terion for long-term portfolio growth. The criterion states that in each period one 
should allocate funds to investments so that the expected logarithm of wealth is 
maximized. Hence, the investor behaves in a myopic fashion using the stationary 
logarithmic utility function and the current distribution of wealth. 

Kelly (1956) supposed that the maximization of the exponential rate of growth 
of wealth was a very desirable investment criterion. Mathematically the criterion 
is the limit as time t goes to infinity of the logarithm of period t’s wealth relative 
to initial wealth divided by t. If one considers Bernoulli investments in which a 
fixed fraction of each period's present wealth is invested in a specific favorable 
double-or-nothing gamble, then the criterion is easily shown to be equivalent to the 
maximization of the expected logarithm of wealth. ZV Exercise V-CR-18 illustrates 
the calculations involved and related elementary properties of Kelly's criterion. 

The desirability of Kelly's criterion was further enhanced when Breiman 
[Breiman (1961) and his 1960 article reprinted here] showed that the expected log 
strategy produced a sequence of decisions that had two additional desirable proper- 
ties. First, if in each period two investors have the same investment opportunities 
and one uses the Kelly criterion while the other uses an essentially different strategy 
(i.e., the limiting difference in expected logs is infinite), then in the limit the former 
investor will have infinite times as much wealth as the latter investor with proba- 
bility 1. Second, the expected time to reach a pre-assigned goal is asymptotically 
(as the goal increases) least with the expected log strategy. Latane (1959) provided 
an earlier intuitive justification of the first property. 

Breiman (1961) established these results for discrete intertemporally indepen- 
dent identically distributed random variables. In this case, as in the Bernoulli case, 
a stationary policy is optimal. In ZV Exercise V-ME-22, the reader is asked to 
develop these and related results for the Bernoulli case. The main technical tool 
involved in the proof is the Borel strong law of large numbers, which states that 
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with probability 1 the limiting ratio of the number of successes to trials equals the 
Bernoulli probability of a success. Breiman’s paper reprinted here proves the first 
result for positive bounded random variables. Results from the theory of martin- 
gales form the basis of proof of his conclusions. ZV Exercise V-CR-19 provides the 
reader with some elementary background on martingales. The reader is asked in 
ZV Exercise V-CR-20 to consider some questions concerning Breiman’s analysis. 

The proof of the results in Breiman’s paper involves difficult and advanced math- 
ematical tools. ZV Exercise V- ME-23 shows how similar results may be proved in 
a simple fashion using the Chebychev inequality. The crucial assumption is that 
the variance of the relative one-period gain be finite in every investment period. 
Breiman's assumption that the random returns are finite and bounded away from 
zero is sufficient but not necessary for the satisfaction of this assumption. ZV Ex- 
ercise V- CR-22 presents an extension of Breiman’s model to allow for more general 
constraint sets and value functions. The development extends his Theorem 1 and 
indicates that no strategy has higher expected return than the expected log strat- 
egy in any period. The model applies to common investment circumstances that 
include borrowing, transactions and brokerage costs, taxes, possibilities. of short 
sales, and so on. In ZV Exercise V- ME-24, the reader is invited to consider the re- 
lationship between properties 1 and 2, to attempt to verify the validity of property 
2 for Breiman's model reprinted here, and to consider the discrete asset allocation 
case. 

Thorp’s paper provides a lucid expository treatment of the Kelly criterion and 
Breiman's results. He also discusses some relationships between the max expected 
log approach and Markowitz's mean-variance approach. In addition, he points out 
some of the misconceptions concerning the Kelly criterion, the most notable being 
the fact that decisions that maximize the expected log of wealth do not necessarily 
maximize expected utility of terminal wealth for arbitrarily large time horizons. The 
basic fallacy is that points that maximize expected log of wealth do not generally 
maximize the expected utility of wealth if an investor has nonlogarithmic utility 
function; see ZV Exercise V- ME-25 for one such example and Thorp and Whitley 
(1973) for a general analysis. See Markowitz (1976) for a refutation, in a limited 
sense, of the fallacy if the investor's utility function is bounded. For some enlighten- 
ing discussion of this and other fallacies in dynamic stochastic investment analysis 
see Merton and Samuelson (1974) and ZV Exercise V- ME-26. Miller (1974b) shows 
how one can avoid the fallacy altogether by utilizing what is called the utility of 
an infinite capital sequence criterion. Under this criterion, the investor's utility is 
assumed to depend only on the wealth levels in one or more periods infinitely dis- 
tant from the present; that is, capital is accumulated for its own sake, namely its 
prestige. In this formulation, the time and expectation limit operators are reversed 
(from the conventional formulation) and hence the improper limit exchange that 
yields the fallacy does not need to be made. One disadvantage of this formulation 
is that the admissible utility functions generally are variants of the unconventional 
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form: limit infimum of the utility of period t’s wealth. The reader is invited in ZV 
Exercise V-CR-21 to consider some questions concerning Thorp’s paper. In ZV Ex- 
ercise V- ME-27, the reader is asked to determine whether good decisions obtained 
from other utility functions have a property “similar” to the expected log strategy 
when they produce infinitely more expected utility. 

For additional discussion and results concerning the Kelly criterion the reader 
may consult Aucamp (1971), Breiman (1961), Dubins and Savage (1965), Goldman 
(1974), Hakansson (1971a,b,d), Hakansson and Miller (1972), Jen (1971, 1972), 
Latane (1959, 1972), Markowitz (1976), Roll (1972), Samuelson (1971), Merton 
and Samuelson (1974), Thorp (1969), Young and Trent (1969), aud Ziemba (1972b). 
For an elementary presentation of the theory of martingales the reader may consult 
Doob (1971). More advanced material may be found in the work of Breiman (1968), 
Burrill (1972), and Chow et al. (1971). 

The highly technical Merton paper discusses the optimal consumption technical 
investment problem in continuous time. Because of its heavy reliance on stochastic 
differential equations and stochastic optimal control theory, the paper may be quite 
intimidating upon first reading. To make the paper intelligible to readers unfamiliar 
with these mathematical concepts, we deviate from our previous policy of keeping 
formalism out of the introduction: Appendix A contains a brief, intuitive introduc- 
tion to stochastic differential equations and stochastic control theory. Although not 
rigorous, the arguments are, hopefully, sufficiently plausible as to enable the reader 
to understand Merton's paper with relative ease. 

Merton assumes the investor's utility for lifetime consumption to be an integral of 
utilities for instantaneous consumption over time. This is a natural, continuous-time 
version of the additive utility assumption in discrete time. The general formalism 
of the optimal control problem is outlined first for utilities that vary arbitrarily 
in time and is specialized in later sections to the pure "impatience" case, as in the 
Samuelson and Hakansson discrete time models. Most of Merton's explicit solutions 
pertain to utility functions of the hyperbolic absolute risk aversion (HARA) class, 
that is, to the class where the reciprocal of the Arrow-Pratt absolute risk-aversion 
index is linear in wealth. This class contains as special cases utilities having constant 
absolute or relative risk aversion. The individual must decide at each point in time 
how to allocate his existing wealth to between consumption and investment in a 
number of financial assets. 'l'he risky asset returns are governed by a known Markov 
process. The objective is maximization of expected utility of lifetime consumption, 
including terminal bequests. 

In the spirit of dynamic programming, there exists at each time t a derived util- 
ity function for wealth w, namely, the maximum expected utility of future lifetime 
consumption given that wealth is w at time t. The instantaneous consumption- 
investment problem requires the maximization of utility of consumption and ter- 
minal wealth at time t, or rather, at an “infinitesimal” time later than t. For 
risky asset returns governed by a stationary log-normal process (see Appendix A), 
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terminal wealth at the end of the “infinitesimal” time span is almost deterministic. 
The expected utility maximization thus involves only means and variances of risky 
returns, as in the Samuelson paper in Part IIT, Chapter 1. The portfolio selection 
aspect of the continuous-time consumption-investment problem is thus solved ex- 
actly using mean-variance analysis. As in the Lintner and Ziemba papers in Part 
IIL, this implies that the optimal portfolio possesses the mutual fund separation 
property: All investors will choose a linear combination of two composite assets, 
which are, moreover, independent of individual preferences. If there exists a risk- 
free asset, it may be chosen as one of the mutual funds, and all portfolios reduce 
to a mix of this risk-free asset and a single composite risky asset which is the same 
for all investors. The returns on the mutual fund are also governed by a stationary 
log-normal process. Note that such a result is only true in continuous time. 

In discrete time, a non-negative linear combination of log-normally distributed 
random returns is not log-normally distributed. Through the use of à continuous- 
time formulation, the consumption-investment problem is thus reduced to a two- 
asset model with consumption. Using this reduction, Merton obtains explicit solu- 
tions for the optimal consumption-investment program for utility functions of the 
HARA class. As in the papers of Part V, the optimal amount of consumption 
and the optimal amount of investment are linear functions of wealth. Furthermore, 
when a steady noncapital income stream is included, this remains true with “wealth” 
reinterpreted as present wealth plus future income (capitalized at the risk-free rate 
of return). 

Merton discusses the effects on optimal consumption and investment due to 
the possibility of large but rare random events. Specifically, among such rare but 
significant events which he discusses are: (1) a bond which is otherwise risk free, 
may suddenly become worthless, or (2) the investor dies. The modeling of such 
processes in continuous time is achieved through the use of Poisson differential 
equations (see Appendix A). Merton derives the optimal consumption-investment 
policy for choice between a log-normally distributed common stock and a “suddenly 
worthless" bond, for utilities having constant relative risk aversion. As before, the 
optimal amount of consumption is linear in wealth. The optimal investment in 
the common stock is an increasing function ofthe probability of bond default, as 
expected. Merton also discusses the effect of uncertain lifetime, using a stationary 
Poisson process to model the arrival of the consumer's death. He shows that the 
optimal consumption-investment problem in this case (with no bequests) is identical 
to that of an infinite-lifetime model, with the consumer's utility discounted by a 
subjective rate of time preference equal to the reciprocal of life expectancy. 

There are a number of other topics covered in the paper, which we mention 
only briefly. These pertain to the important and usually neglected question of 
nonstationary random asset returns. Merton presents solutions for the optimal 
consumption-investment problem in three such nonstationary models. In the first 
model, he assumes there exists an asymptotic, deterministic price curve pet), toward 
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which the stochastic asset prices tend (in expected value) in the distant future. In 
the second model, he assumes that the risky return is given by a log-normal distri- 
bution whose mean is itself a random process of a special form. Finally, in the third 
model, he assumes that the risky return is given by a stationary log-normal] pro- 
cess whose parameters are unknown, but must be estimated from past behavior of 
the random variable. Explicit solutions for these three models are obtained in a 
two-asset setting, for utility functions having constant absolute risk aversion. 
ZV Exercise V-ME-28 considers a deterministic continuous-time consumption- 
investment model. If utility is intertemporally additive, then it is possible to 
develop explicit optimality criteria that yield an algorithm for constructing an 
optimal policy. 
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The “Fallacy” of Maximizing the Geometric Mean in Long Sequences of 


Investing or Gambling 


(maximum geometric mean strategy/uniform strategies/asymptotically suficient parameters) 


PAUL A. SAMUELSON 


Department of Economica, Massachusetts Institute of Technology, Cambridge, Mass. 02139 


ABSTRACT Because the outcomes of repeated invest- 
ments or gambles involve products of variables, authori- 
ties have repeatedly been tempted to the belief that, in a 
long sequence, maximization of the expected value of 
terminal utility can be achieved or well-approximated by 
a strategy of maximizing at each stage the geometric 
mean of outcome (or its equivalent, the expected value of 
the logarithm of principal plus return). The law of large 
numbers or of the central limit theorem as applied to the 
logs can validate the conclusion that a maximum-geo- 
metric-mean strategy does indeed make it “virtually 
certain" that, in a “‘long’’ sequence, one will end with a 
higher terminal wealth and utility. However, this does not 
imply the falee corollary that the geometric-mean strategy 
is optimal for any finite number of periode, however long, 
or that it becomes asymptotically a good approximation. 
As a trivial counter-example, it is shown that for utility 
proportional to x*/y, whenever y = 0, the geometric 
strategy is suboptimal for all T and never a good approxi- 
mation. For utility bounded above, as when y < 0, the 
same conclusion holds. If utility is bounded above and 
finite at zero wealth, no uniform strategy can be optimal, 
even though it can be that the best uniform strategy will 
be that of the maximum geometric mean. However, 
asymptotically the same level of utility can be reached by 
an infinity of nearby uniform strategies. The true opti- 
mum in the bounded case involves nonuniform strategies, 
usually being more risky than the geometric-mean maxi- 
mizer's strategy at low wealths and less risky at high 
wealths. The novel criterion of maximizing the expected 
average compound return, which asymptotically leads to 
maximizing of geometric mean, is shown to be arbitrary. 


BACKGROUND 


Suppose one begins with initial wealth, Xo, and after a series 
of decisions one is left with terminal wealth, X r, subject to a 
conditional probability distribution 


prob {Xr € Xl = A} = Pr(X,A) a) 


Then an “expected utility” maximizer, by definition, will 
choose his decisions to 


u(X)Pr(dX,A), (2) 


max E{u(Xr)} = max f 
where E stands for the "expected value” and where u(z) is a 
specified utility function that is unique except for arbitrary 
scale and origin parameters, b and a, ina + bu(z), b > 0. 

Ina portfolio, or gambling situation, at each period one can 
make investments or bets proportional to wealth at the begin- 
ning of that period, X,, so that the outcome of wealth for the 
next period is the random variable 


2493 


^ 
Xm = X,[w¥it ... + wna), Dw, = à (3) 
Y 
where the w's are the proportions decided upon for investment 
in the different securities (or gambling games). The returns per 
dollar invested in each alternative, respectively, are subject 
to known probability distributions. 


prob (Yi X yu. Ys S ya] = Für... ud 


where for simplicity the vector outcomes at any ¢ are assumed 
eacb to be independent of outcomes at any other time periods, 
and to remain the same distribution over all time periods. 
Usually Y, is restricted to being nonnegative to avoid bank- 
ruptcy. A complete decision involves selecting over the inter- 
val of time ¢ = 1,2,...,T, all the vectors [w,(t)], which will 
be nonnegative if short-selling and bankruptcy are ruled 
out. 

In particular, if the same strategy is followed at all times, 
so that w,(f) = wy, the variables become 


(4) 


X% = Xon, X: = Xia, = Xux 
X, 


and all z; are independently distributed according to the same 
distribution, which we may write as 


prob Íz, € z} = nis). 


(5) 
= X at, = Xota... x, 


(6) 


Of course, H(z) will depend on the w strategy chosen, and is 
short for H(z; wi. ..,w4) in this particular case, and on the 
assumed F(yi,. . .,y4) function. 

1t can be easily shown that 


prob (X, S XIX = Z} = P,(X,Z) 
PAX ,Z) = P(X/Z,) (1) 
= P,(X/2) for short 
and 
P,(z) = Ba) 
Px) = f^ Pile/saPito) © 


Pr(z) = f 7 


Pri(z/s)aPi(s) 


w 
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EXACT SOLUTIONS 
For general u(.) functions, a different decision-vector [wl(¢) ] is 
called for at each intermediate time period, ? = 1,2,...,T — 1. 
This is tedious, but both inevitable and feasible. It is well 
known that for one, and only one, family of utility functions, 
namely 


u(z; y) t/y o y #0 


(8) 
= log z y70 


it is optimal to use the same repeated strategy. For this case 
the common optimal strategy is that of a T = 1 period prob- 
lem, namely 


XQ max Ex'/y = Xo’ max f, (z*/y)dPA (T; wi,. . Wa) 
we wi 
(10) 


The log x case is included in this formulation, since as y — 0, 
the indeterminate form is easily evaluated. 


PROPOSED CRITERION 
Often in probability problems, as the number of variables 
becomes large, T — œ. In this case, certain asymptotic 
simplifications become feasible. Repeatedly, authorities (1-3) 
have proposed a drastic simplification of the decision problem 
whenever T is large. 


Rule. Act in each period to maximize the geometric mean 
or the expected value of log zs. 

The plausibility of such a procedure comes from recognition 
of the following valid asymptotic result. 


Theorem. lí one acts to maximize the geometric mean at 
every step, if the period is "sufficiently long," “almost cer- 
tainly" higher termina! wealth and terminal utility will result 
than from any other decision rule. 

To prove this obvious truth, one need only apply the central- 
limit theorem, or even the weaker law of large numbers, to 
the sum of independent variables 


T 
log Xr = log Xo + 2, log z: (11) 
1 


We may note the following fact about Pr(z; max g.m.) = 
Qr*(z) and Pz(z; other rule) = Qz(z). 


Qr*(z) < Qr(z) for T > M(z) (12) 


The crucial point is that M is a function of r that is un- 
bounded in z. 

From this indisputable fact, it is tempting to believe in the 
truth of the following false corollary: 


False Corollary. If maximizing the geometric mean almost 
certainly leads to a better outcome, then the expected utility 
of ita outcomes exceeds that of any other rule, provided T is 
sufficiently large. 

The temptation to error is compounded by the considera- 
tion that both distributions approach asymptotically log 
normal distributions. 
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Since &* > u by hypothesis, it follows at once that, for T 
large enough Lr*(z) can be made smaller than Lr(z) for any 
z. From this it is thought, apparently, that one can validly 


deduce that IN u(z)dQr* > f u(z)dQr(z). 
0 


0 
FALSITY OF COROLLARY 


A single example can show that the needed corollary is not 
generally valid. Suppose y = 1, and one acts, in the fashion 
recommended by Pascal, to maximize expected money wealth 
itself. Let the gambler-investor face a choice between invest- 
ing completely in safe cash, Yı, or completely in a "security" 
that yields for each dollar invested, $2.70 with probability 
1/2 or only $0.30 with probability 1/2. To maximize the 
geometric mean, one must stick only to cash, since {(2.7)(.3) ] * 
= .9 < 1. But, Pascal will always put all his wealth into the 
risky gamble. 

Isn't he a fool? If he wins and loses an equal number of 
bets, and in the long run that will be his median position, 
he ends up with 


(2.7)7(.8)? = (97 + 0 as T— œ (14) 


"Almost certainly," he will be “virtually ruined" in a "long 
enough" sequence of play. 

No, Pascal is not a fool according to his criterion. In those 
rare long sequences (and remember all sequences are finite, 
albeit very large) when he does experience relatively many 
wins, he makes more than enough to compensate him, accord- 
ing to the max EX p criterion, for the more frequent times when 
he is ruined. 

Actually, 


T 
E{Xr} = a{x.[T= 
Y 
- X lI Ex], for independent variates (15) 
i 


- Xj Exi]? 
=Xol.57 > Xy foral T 


But, you may say, Pasca) is foolish to court ruin just for a 
large money gain. A dollar he wins is surely worth less in 
utility than the dollar be loses. Very well, as with eighteenth- 
century writers on the St. Petersburg Paradox, let us assume 
8 concave utility-function, u(z), with u"(z) « 0. Let us now 
test the false corollary for ulz) = z*/y, 1 > y »* 0. Let tbe 
decision according to the geometric mean rule lead to 


E[log zi*] > £ log zı (16) 
But let the alternative decision produce 
E|lz/y] > Efry} (17) 


Then, as before, except for the changed value of the y ex- 
ponent 


E|Xr/y] > E{Xr*/y}, (18) 


. 1 log x i 
lim Qz*(z) = Jazo TUA f {on -3 le — eT) ds = Lz*(z) 


T—-- -w 


im Qr(z) = VIr T a 


(13) 


n log z 1 
f few 73 ls — aT Wer) ds = L(x) 
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since 


T 
E{X7/y} = Xv JI Elze) 
Y 


-OXs[E(n/yH* (19) 
»X"E(nt/s]? = EXzU/v 


This strong inequality holds for all T, however large. Thus, 
the false corollary is seen to be invalid in general, even for 
concave utilities. 


BOUNDED UTILITIES 


Most geometric-mean maximizers are convinced by this 
reasoning (5). But not all. Thus, Markowitz, in the new 1971 
preface to the reissue of his classic work on portfolio analysis 
(4), says that “boundedness” of the utility function will save 
the geometric-mean rule. For y < 0, u = z?*/* will be bounded 
from above, and a run of favorable gains will not bring the 
decision maker a utility gain of more than a finite amount. 
But, as the case y = —1 shows, that cannot save the false 
corollary. For in all cases when y < Ñ; the false rule leads to 
over-riskiness, just as for y > 0 it leads to under-riskines. 
Those few times, and they will happen for all T, however large, 
with a positive (albeit diminishing probability), whenever the 
false rule brings you closer to zero terminal wealth (or ruin), 
the unboundedness of u as z — 0 and u — — © puts a pro- 
hibitive penalty against the false rule. ` 

What about the case where utility is bounded above and is 
finite at Xr = 0 or ruin? It is easy to show that no uniform 
decision rule can be optimal for such a case, where 

-9«u0 S u(X) SM< œ (20) 

But suppose we choose among all suboptimal uniform 
strategies that have the property of “limited liability,” so 
that each z, is confined to the range of nonnegative numbers. 
Then the following theorem, which seems to contain the germ 
of truth the geometric mean maximizers are groping for, is 
valid. 

Theorem. For u(r) bounded above and below in the range 
of nonnegative numbers, the uniform rule of maximizing the 
geometric mean E{log X7*}, will asymptotically outperform 
any other uniform strategy’s result, E{u(X'7)}, in the follow- 
ing sense 

Bjur] > E[uXp], T» TO) G1) 


The limited worth of this theorem is weakened a bit further 
by the consideration that an infinite number of blends of a 
suboptimal strategy with the geometric mean rule, of the form 
vz? + (i — v)z, v sufficiently small and positive, will do 
negligibly worse as T — o. 

Thus for all Xo, 

lim E[u(Xz*)] = u(0) or M = supu(X), (22) 
T—9 x 


depending upon whether 
E(log zi*) € 00r» 0 
For a range of v's, the same utility level will be approached as 
T o. 
OPTIMAL NONUNIFORM STRATEGY 


To illustrate that no uniform strategy can be optimal when 
utility is bounded, consider the well-known case of u = —e—*, 
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Suppose X, can be invested at each stage into Z, dollars of a 
risky Y; such that E[Yi] > 1, or into X, — Z, dollars of safe 
cash. Under "limited liability," if the lowest Y; could get 
were r, Z, could not exceed X ,/(1 — r). 

First, disregard limited liability and permit negative Xr. 
Then a well-known result of Pfansangl (6) shows that at every 
stage, it is optimal to set Z, = Z*, where 


max f. — exp[ — b(X —Z) — »Y;Z]dFi(Ys) 


= - [exp - UX — Z*)] f T exp — QYQZ*MFAY) (3) 


Then optimally 
Xr = Xo — TZ* + (Y + Yi? +... + YZ” (24) 


As T — œ, X, approaches a normal distribution, not a log- 
normal distribution as in the case of uniform strategies. How- 
ever, the utibty level itself will approach a log-normal dis- 
tribution. 


T 
U, = exp — b[Xs — TZ*] II (exp — bY,Z*) 
! Q5 


lim E{U;} =0= M 
T— w 


For this option of cash or a risky asset with positive return, it 
will necessarily be the case that 


max E{loglo + (1 — w)Yi] > 1 


lim E{U;*} 20 M 

To 
Nonetheless, at every wealth level, above or below a critical 
number, the optimal policy will generally differ from that of 
maximum geometric mean. 

When we reintroduce limited liability in the problem and 
never permit an investor to take a position that could leave 
him bankrupt with negative wealth, the optimal strategy at 
each X , will be to put min (X ,,Z*) into the risky asset and an 
exact solution becomes more tedious. But clearly the optimal 
Strategy is nonuniform and will outperform any and all uni- 


‘form strategies, including that of the geometric-mean maxi- 


muzer. 
MAXIMIZING EXPECTED AVERAGE 
COMPOUND GROWTH 

Hakansson (7) has presented an analysis with a bearing on 
geometric-mean maximization by the long-run investor. De- 
fining the rate of return in any period as z, and the average 
compound rate of return as (1:22... 27/7), one can propose as 
a criterion of portfolio selection maximising the expected 
value of this magnitude. After the conventional scaling-factor 
T is introduced, this gives 


max TX Elaziz: ... 2r) T = Xo| max E[z,/7/(/7)]]7 (26) 


This problem we have already met in (19) for y = 1/T. For 
finite T, this new criterion leads to slightly more risk-tak- 
ing than does geometric-mean or expected-logarithm raaximiz- 
ing: for T = 1, it leads to Pascal’s maximizing of expected 
money gain; for T — 2, it leads to the eighteenth century 
square-root utility function proposed by Cramer to resolve the 
St. Petersburg Paradox. 
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However, as T — © and y = 1/T — 0, 


lim z?/4 = log z (27) 
0 
and we asymptotically approach the geometric-mean maxi- 
mizing. 

Thus, one wedded psychologically to a utility function 
—z^! will find the new criterion leads to rash investing. 
Example: modify the numerical example of (14) above so 
that 2.7 and .3 are replaced by 2.4 and .6. Because the geo- 
metric mean of these numbers equals 1.2 > 1, none in cash 
is better for such an investor than ie all in cash. But, since 
the harmonic means of these numbers equals .96 « 1, our 
hypothesized investor would prefer to satisfy his own pyscho- 
logical tastes and choose to invest all in cash rather than 
none in cash—no matter how great T is and in the full recog- 
nition that he is violating the new criterion. The few times 
that following that criterion leads him to comparative losses 
are important enough in his eyes to scare him off from use of 
that criterion. 

Indeed, if commissions were literally zero, then no matter 
how short were T in years, the number of transaction periods 
would become indefinitely large: Hence, with y = 0, the novel 
criterion would lead to geometric-mean maximization, not just 
asymptotically for long-lived investors, but for any T. To be 
sure, as one shortens the time period between transactions, 
my assertion of independence of probabilities between periods 
might become unrealistic. This opens a Pandora’s Box of 
difficulties. Fortuitously, the utility function log z is the one 
case that is least complicated to handle when probabilities are 
intertemporally dependent. This makes log z an attractive 
candidate for Santa Claus examples in textbooks, but will not 
endear it to anyone whose psychological tastes deviate signifi- 
cantly from log z. (For what it is worth, I may mention that 
I do not fall into that category, but that does not affect the 
logic of the problem.) ` 

These remarks critical of the criterion of maximum expected 
average compound growth do not deny that this criterion, 
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arbitrary as it is, still avoids some of the even greater arbitrari- 
ness of conventional mean-variance analysis. Its essential 
defect is that it attempts to replace the pair of “asymptotically 
sufficient parameters” (E{log z,], Variance{log z:}] by the 
first of these alone, thereby gratuitously ruling out arbitrary y 
in the family u(z) = z"/« in favor of u(z) = log z. This 
diagnosis can be substantiated by the valuable discussion in 
the cited Hakansson p&per of the efficiency properties of the 
pair [E { average-compound-rcturn], Variance { average—com- 
pound-return } ], which are asymptotically surrogates for the 
above sufficient parameters. 
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WHY WE SHOULD NOT MAKE MEAN LOG OF WEALTH BIG 
THOUGH YEARS TO ACT ARE LONG 


Paul A. SAMUELSON* 
Massachusetts Institute of Technology, Cambridge, MA 02139, USA 


He who acts in N plays to make his mean log of wealth as big 
as it can be made will, with odds that go to one as N soars, 
beat me who acts to meet my own tastes for risk. 


Who doubts that? What we do doubt! is that it should make us change our 
views on gains and losses — should taint our tastes for risk. 

To be clear is to be found out. Know that life is not a game with net stake 
of one when you beat your twin, and with net stake of nought when you do 
not. A win of ten is not the same as a win of two. Nor is a loss of two the 
same as a loss of three. How much you win by counts. How much you lose by 
counts. 

As soon as we see this clear truth, we are back to our own tastes for risk. 
Mean log of wealth then bores those of us with tastes for risk not real near 
to one odd (thin!) point on the line of all the tastes for risk — and this holds 
for each N, with N as big as you like. 

Why then do some still think they should want to make mean log of | 
wealth big? They nod. They feel ‘That way I must end up with more. More 
sure beats less'. But they err. What they do not see is this: 


When you lose — and you sure can lose ~ with N large, you can 
lose real big. Q.E.D. 


Long since, in Samuelson (1963, p. 4), I had to prove what is not hard to 
grasp: 


If it does not pay to do an act once, it will not pay to do it 
twice, thrice,..., or at all. 


*I owe thanks for aid to NSF Grant 750-4053-A01-SOC. 
ICf. the views of Ophir (1978, 1979) and Latané (1978). 
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Can we bring the dead rule back to life by what it tells us about the mean 
growth rate? No. Here's why not. 


For large N, when you act at each turn to make the mean of 
log of wealth big, you will make your mean growth rate big in 
this sense: 


As N grows large, the odds go to one that my mean growth 
rate (per turn) will end up real close to a rate less than that 
which you (with big odds) end up close to. 


Who doubts that truth? But it does not rule out this clear truth. 


For N as large as one likes, your growth rate can well (and at 
times must) turn out to be less than mine — and turn out so 
much less that my tastes for risk will force me to shun your 
mode of play. To make N large will not (say it again, not) 
make me change my mind so as to tempt me to your mode 
of play. Q.E.D. 


No doubt some will say: ‘I’m not sure of my taste for risk. I lack a rule to 
act on. So I grasp at one that at least ends doubt: better to act to make the 
odds big that I win than to be left in doubt?' Not so. There is more than one 
rule to end doubt. Why pick on one odd one? Why not try to come a bit 
more close to that which is not clear but which you ought to try to make 
more clear? 


No need to say more.? I’ve made my point.? And, save for the last word, 
have done so in prose of but one syllable. 


?We should spare the dead. When a chap has said he now doubts that *... this same rule [of 
max of mean of log of wealth] is approximately valid for all utility functions [...insofar as 
certain approximations are permissible...]..., we should take him at his word and free his 
shade of all guilt. For a live friend to still say: 'given the qualifications it seems to me that this 
[above quoted] statement of Savage is very difficult to refute’, as the French say, gives one to 
cry. Those key words are false when we make them clear. When we don't make them clear, there 
is nought to talk about (to say Yes or No to). As the French say too, it is a case of put up or... 
For more on this, see Latané (1959, p. 151; 1978, p. 397) and Samuelson (1959, p. 245). 


3Let me tie down one loose end. Look at this Odd Rule: 
= From Acts A(N) and B(N) pick Act A(N) if, for their two end wealths, W, (N) 
and W,(N), with odds of more than one half (or more than 1 —ey, O<ey <1), 
W,(N)> W,(N). 
This Odd Rule is odd since it can put you in this Fix: . 
You may well pick A(N) from A(N) and B(N), and pick B(N) from B(N) and 
C(N), and yet still pick C(N) from A(N) and C(N). 
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This Fix can come for all N, as large as we choose to make N. It can do so though the truth of 
Thorp (1971, p. 603) holds, 


plim W,(N) - W, > W,-plim W,(N) ^ f,» We = plim W(W), 
No 


N= œ Noo 


cules out 


plim W.(N) > plim W,(N). 


No N 


That is so since W, > W, and Wg > We means W, > Wg. l 
But I'd not said: ‘When you act to make mean of log of wealth large, you could get in the 
Fix'. To see why you can't, we need only note that 


mean of log of W,(N)=L,(N)>L,(N)=mean of log of W,(N), 
and 

mean of log of W&(N) - L&(N)- Lc(N) 2 mean of log of W.(N), 
rules out 

Lc(N)» L,(N), 


and it does so for all N, as when N is as small as one. 
Some goals are strange, but need not be as bad as the odd rules some seek to base them on. 
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INVESTMENT FOR THE LONG RUN: NEW EVIDENCE FOR AN 
OLD RULE 


Harry M. MARKOWITZ* 


I. BACKGROUND 


“INVESTMENT FOR THE LONG RUN," as defined by Kelly [7], Latané [8] [9], Marko- 
witz [10], and Breiman [1] [2], is concerned with a hypothetical investor who neither 
consumes nor deposits new cash into his portfolio, but reinvests his portfolio each 
period to achieve maximum growth of wealth over the indefinitely long run. (The 
hypothetical investor is assumed to be not subject to taxes, commissions, illiquidi- 
ties and indivisibilities.) In the long run, thus defined, a penny invested at 6.01% is 
better—eventually becomes and stays greater—than a million dollars invested at 
6%. 

When returns are random, the consensus of the aforementioned authors is that 
the investor for the long run should invest each period so as to maximize the 
expected value of the logarithm of (1+single period return). The early arguments 
for this “maximum-expected-log” (MEL) rule are most easily illustrated if we 
assume independent draws from the same probability distribution each period. 
Starting with a wealth of Wo, after T periods the player’s wealth is 


Wr=Wy lle) (1) 


t=] 


. where r, is the return on the portfolio in period t. Thus 
T 

log(W,/Wo)= >, log(1+r,) (2) 
T 


If log(1 +r) has a finite mean and variance, the weak law of large numbers assues 
us that for any e» 0 


Prob | los, W,/W,)— Elog(1+r) >«}-00 (3) 


* IBM Thomas J. Watson Research Center, Yorktown Heights, New York. 
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and the strong law! assures us that 


"S! 
lim 7 leg W_/W,) = Elog(1+r) (4) 


with probability 1.0. Thus if Elog(1--7) for portfolio A exceeds that for portfolio 
B, then the weak law assures us that, for sufficiently large T, portfolio A has a 
probability as close to unity as you please of doing better than B when time- T; 
and the strong law assures us that 


Wi/W?—coo with probability one. (5) 


Some authors have argued that the strategy which is optimal for the player for 
the long run is also a good rule for some or all real investors. My own interest in 
the subject stems from a different source. In tracing out the set of mean, variance 
(E, V) efficient portfolios one passes through a portfolio which gives approximately 
maximum £Elog(l-47)? I argued that this "Kelly-Latané" point should be consid- 
ered the upper limit for conservative choice among Æ, V efficient portfolios, since 
portfolios with higher (arithmetic) mean give greater short-run variability with less 
return in the long run. A real investor might, however, perfer a smaller mean and 
variance, giving up return in the long run for stability in the short run. 

Samuelson [14] and [15] objected to MEL as the solution to the problem posed in 
[1], [2], [7], [8], [9], [10]. Samuelson's objection may be illustrated as follows: 
suppose again that the same probability distríbutions of returns are available in 
each of T periods, t— 1,2,..., T. (Samuelson has also treated the case in which t is 
continuous; but his objections are asserted as well for the original discussion of 
discrete time. The latter, discrete time, analysis is the subject of the present paper.) 
Assume that the utility associated with a play of a game is 


U-Wt*/a a#0 (6) 


where W, is final wealth. Samuelson shows that, in order to maximize expected 
utility for the game as a whole, the same portfolio should be chosen each period. 
This always chosen portfolio is the one which maximizes single period 


EU=E(1+r)"/a. | (7) 


Furthermore, if EUS. is the expected return provided by this strategy for a T 


1. In most cases the early literature on investment for the long run used the weak law of large 
numbers. The results in Breiman [1], however, specialize to a strong law of large numbers in the 
particular case of unchanging probability distributions. See also the Doob [4] reference cited by 


Breiman. 
2. Markowitz (10] Chapters 6 and 13 conjectures, and Young and Trent [16] confirm that 


Elog(| +r)avlog(1 + E) - -(V/(04- EY) 


for a wide class of actual ex post distributions of annual portfolio returns. 
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period game, and EU% is that provided by MEL, usually we will have 
EUT/EUL—o) as Too (8) 


Thus, despite (3), (4) and (5), MEL does not appear to be asymptotically optimal 
for this apparently reasonable class of games. 

Von Newman and Morgenstern [17] have directly and indirectly persuaded 
many, including Samuelson and myself, that, subject to certain caveats, the 
expected utility maxim is the correct criterion for rational choice among risky 
alternatives. Thus if it were true that the laws of large numbers implied the general 
superiority of MEL, but utility analysis contradicted this conclusion, I am among 
those who would accept the conclusions of utility analysis as the final authority. 
But not every model involving "expected utility" is a valid formalization of the 
subject purported to be analyzed. In particular I will argue that, on closer 
examination, utility analysis supports rather than contradicts MEL as a quite 
general solution to the problem of investment for the long run. 


II. THE SEQUENCE OF GAMES 


It is important to note that (8) is a result concerning a sequence of games. For fixed 
T, say T=100, EU= EW 19o/a is the expected utility (associated with a particular 
strategy) of a game involving precisely 100 periods. For T=101, EW,,/a is the 
expected utility of a game lasting precisely 101 periods; and so on for T 
= 102,103,.... 

That (8) is a statement about a sequence of games may be seen either from the 
statement of the problem or from the method of solution. In Samuelson's formula- 
tion W;. is fina! wealth—wealth at the end of the game. If we let T vary (as in 
*T— 0o") we are talking about games of varying length. 

Viewed differently, imagine computing the solution by dynamic programming 
starting from the last period and working in the direction of earlier periods. (Here 
we may ignore the fact that essentially the same solution reemerges in each step of 
the present dynamic program. Our problem here is not how to compute a solution 
economically, but what problem is being solved). If we allow our dynamic pro- 
gramming computer to run backwards in time for 100 periods, we arrive at the 
optimum first move, and the expected utility for the game as a whole given any 
initial Wo, for a game that is to last 100 moves. If we allow the computer to 
continue for an additional 100 periods we arrive at the optimum first move, and the 
expected utility for the game as a whole given any initial Wo, for a game that is to 
last for 200 moves; and so on for 7—201,202,.... 

In particular, equation (8) is not a proposition about a single game that lasts 
forever. This particular point will be seen most clearly later in the paper when we 
formalize the utility analysis of unending games. 

To explore the asymptotic optimality of MEL, we will need some notation 
concerning sequences of games in general. Let T, « T; « T4::- be a sequence of 
strictly increasing positive integers. In this paper? we will denote by G}, G» G3... a 


3. A somewhat different, but equivalent, notation was used in [11]. 
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sequence of games, where the ith game lasts T, moves. (In case the reader feels 
uncomfortable with the notion of a sequence of games, as did at least one of our 
colleges who read [11], perhaps the following remarks may help. The notion of a 
sequence of games is similar to the notion of a sequence.of numbers, or a sequence 
of functions, or a sequence of probability distributions. In each case there is a first 
object (i.e., a first number or function or distribution or game) which we may 
denote as G,; a second object (number, function, distribution, game) which we may 
denote by G,; etc.). 

In general we will not necessarily assume that the same opportunities are 
available in each of the T; periods of the game G, We will always assume that—as 
part of the rules that govern G—the game G; is to last exactly T; periods, and that 
the investor is to reinvest his entire wealth (without commissions, etc.) in each of 
the T, periods. Beyond this, specific assumptions are made in specific analyses. 

In addition to a sequence of games, we shall speak of a sequence of strategies 
55,55, 53,... Where s; is a strategy (i.e., a complete rule of action) which is valid for 
(may be followed in) the game G,. By convention, we treat the utility function as 
part of the specification of the rules of the game. The rules of G; and the strategy s; 
together imply an expected utility to playing that game in that manner. 


III. ALTERNATE SEQUENCE-OF-GAMES FORMALIZATIONS 


Let g equal the rate of return achieved during a play of the game G; i.e., writing T 
for T; 


Wy-Wy(ltg) (9) 
or 


g=(Wr/ Wo) 7-1. (10) 


In the Samuelson sequence of games, here denoted by G, G» G;,..., the utility 
function of each game G; was assumed to be 


U-f(Wy)-Wt/a. (11) 

We can imagine another sequence of games—call them H, H5, H;,... —which 

have the same number of moves and the same opportunities per move as 

Gp Gz, Gy,..., respectively, but have a different utility function. Specifically im- 
agine that the utility associated with a play of each game H; is 

U- V(g). (11a) 


for some increasing function of g. For a fixed game of length T= T, we can always 
find a function V(g) which gives the same rankings of strategies as does some 
specific f(W,). For example, for fixed T (11) associates the same U to each 
possible play as does 


U- V(g)- Ws: (1 gy /a. (11b) 
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Thus for a given T it is of no consequence whether we assume that utility is a 
function of final wealth Wy or of rate of return g. 

On the other hand, the assumption that some utility function V(g) remains 
constant in a sequence of games, as in H,, H} H}... has quite different con- 
sequences than the assumption that some utility function (W7) remains constant 
as in G, G,,.... Markowitz [11] shows that if V(g) is continuous then 


EVL/EV$>1 as T—oo (12a) 


where EVE is the expected utility provided by the MEL strategy for the game H, 
and EV?. is the expected utility provided by the optimum strategy (if such an 
optimum exists‘); and if V(g) is discontinuous then 


EVE/EV®.>1—e-6 (12b) 


where 6 is the largest jump in V at a point of discontinuity, and «20 as T— oo. 
(12a) and (12b) do not require the assumption that the same probability distribu- 
tions are available each period. It is sufficient to assume that the return r is 
bounded by two extremes r and r: 


-l«r&r&?«oo (13) 


e.g., the investor is assumed to not lose more than, say, 99.99% nor make more than 
a million percent on any one move in any play of any game of the sequence. Note 
also that V( g) is not required to be concave, nor strictly increasing nor differenti- 
able; but of course it is allowed to be such. 

Thus under quite general assumptions, if V( g) is continuous MEL is asymptoti- 
cally optimal in the sense of 12a. If V( g) has small discontinuities, then MEL may 
possibly fail to be asymptotically optimal by small amounts as in 12b. These results 
are in contrast to (8), derived on the assumption of constant U= f( W7). 

In [11] I argued that the assumption of constant V( g) in a sequence of games is a 
more reasonable formalization of "investment for the long run" than is the 
assumption of constant U( W7). Given the basic assumptions of utility analysis, the 
choice between constant V( g) and constant U( W7) is equivalent to deciding which 
of two types of questions would be more reasonable to ask (or determine from 
revealed preferences) of a rational player who invests for the long run in the'sense 
under discussion. 

Example of question of type I: what probability would make you indifferent 
between (a) a strategy which yields 6% with certainty in the long run: and (b) a 
strategy with a probability a of yielding 9% in the long run versus a probability of 
1 —« of yielding 3% in the long run. 

Example of question of type II: if your initial wealth is 810,000.00, what 


4. The assumptions of [11] do not necessarily imply that an optimum strategy exists. In any case (12a) 
and (12b) apply to any “other” strategy such that 


EV9.» EVE — forall T. 
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probability 8 would make you indifferent between (a) a strategy which yields 
$20,000 with certainty in the long run, versus (b) a strategy which yields 825,000 
with probability 8 and 815,000 with probability (1— 8) in the long run. 

Question I has meaning if constant V(g) is assumed; question II if constant 
U(W,) is assumed. It seemed to me (and still does) that preferences among 
probability distributions involving, e.g., 3%, 6% or 9% return in the indefinitely long 
run are more reasonable to assume than preferences among probability distribu- 
tions involving a final wealth of, e.g., $10,000, $15,000 or $20,000 in the long run. I 
will not try to further argue the case for constant V(g) as opposed to constant 
U(W7) at this point, other than to encourage the reader to ask himself questions of 
type I and type II to judge. 

In [11] I also argued that even if we were to assume constant U(W,) rather than 
constant V(g), we would have to assume that U was bounded (from above and 
below) in order to avoid paradoxes like those of Bernoulli [3] and Menger [13]. I 
then show that MEL is asymptotically optional for bounded U( W). Merton and 
Samuelson [12] and Goldman [6] object to my definition of asymptotic optimality, 
although it is essentially the same as the criteria by which we judge, e.g. a statistic 
to be asymptotically efficient. Merton and Samuelson proposed, and Goldman 
adopted, an alternative criterion in terms of the “bribe” required to make a given 
strategy as good as the optimum strategy. But this bribe criteria seems to me 
unacceptable, since it violates a basic tenant of game theory—that the normalized 
form of a game (as described in [17]) is all that is needed for the comparison of 
strategies. It is not possible to infer the Samuelson-Merton-Goldman bribe from 
the normalized form of a game. Strategies Ia and Ib in game I may have the same 
expected utilities, respectively, as strategies IIa and IIb in game II; but a different 
bribe may be required to make Ia indifferent to Ib than is required to make Ila 
indifferent to IIb. Strategies IIIa and IIIb in a third game (not necessarily an 
investment game) may have the same pair of expected utilities as Ia and Ib in game 
I, or IIa and IIb in game II, but the notion of a bribe may have no meaning 
whatsoever in game III? Thus unless we are prepared to reject the equivalence 
between the normalized and extensive form of a game in evaluating strategies, we 
must reject the Merton-Samuelson-Goldman bribe as part of a precise, formal 
definition of asymptotic optimality. 


5. For example, suppose that strategy (a) has EU, *:0 and strategy (b) has EU,= 4. What bribe will 
make (a) as good as (b)? Consider the answer, e.g., for one period games I and II in which (a) accepts 
=} with certainty and (b) elects a 50-50 chance of W=0 versus W  ]. In (I) suppose 


0 for W«i 
U=; 10*(wW-4) for O5«W«0.6 
1 W> 0.6 


while in II suppose 


0 for W«09 
U- 4 10*(W—0.9) 0.9<W<1.0 
l W>1.0 


In game I, (a) requires a bribe of 0.05; in game II (a) requires 0.45. 
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IV. UNENDING GAMES 


Even if we agree that a player playing a fixed finite game should maximize 
expected utility, we cannot determine whether MEL is asymptotically optimal for a 
given sequence of games {G,} unless we can agree on criteria for asymptotic 
optimality. What is needed is either “metacriteria” regarding how to choose criteria 
of asymptotic optimality, or else an alternate method of analyzing the desirability 
of strategies for the long run. This section presents such an alternate method, 
namely the utility analysis of unending games. 

Consider a game G,, which is like one of the games G, described above with this 
one exception: the game G,, never terminates. Instead of having a first move, a 
second move, and so on through a 7;th move, we have an unending sequence of 
moves. As with a game G, a strategy for a G,, is a rule specifying the choice of 
portfolio at each time / as a function of the information available at that time. The 
only difference is that now the rule is defined for each positive integer /- 1,2,3,... 
rather than only for 1 &/ & T. 

Given a particular game G,, and a strategy (5), a play of the game involves an 
infinite sequence of "spins of the wheel" and results in an infinite sequence of 


wealths at each time: 
(Wo, Wy W,...,W,...) (14) 


where W, is initial wealth, and 
W,= W? (1 return at time t) (15) 
as in G, 

The reader should find it no more unthinkable to imagine an infinite sequence of 
Spins than to imagine drawing a uniformly distributed random variable. For 
example, if the same wheel is to be spun each time in an unending game, and if this 
wheel has ten equally probable stopping points, which we may label 0 through 9, 
then the infinite decimal expansion of a uniform [0,1] random variable may be 
taken as the infinite sequence of random stopping points of the wheel.® If the wheel 
has sixteen stopping points, then the hexadecimal expansion of the random number 
may be used. In either case the infinite sequence of wealths (Wo, Wi, Wa...) is 
implied by the rules of the game, the player's strategy, and the uniform random 
number drawn. 

In general, a given Go and a given strategy imply a probability distribution of 
wealth-sequences (Wo, Wp W,,...). 

Since G, has no “last period", we cannot speak of “final wealth". We can, 
however, assume that the player has preferences among alternate Wealth-sequences: 
e.g., he may prefer the sequence of passbook entries provided by a savings account 
which compounds his money at 6%, starting with W,, to one that compounds it at 
3%. Given any two sequences: 


W1= (Wo, W3, W3...) 


6. The fact that some numbers have two decimal expansions, like 0.4999... versus 0.5000..., may be 
resolved in any manner without effect on the analysis; since such numbers occur with zero probability. 
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and 
W*-(W, W*, W^...) 


we may assume that the player either prefers W^ to W^, or W^ to W? or is 
indifferent. Further, we may assume that given a choice between any two probabil- 
ity distributions among sequences of wealth 


Pr (Wo WA, Wa...) 
versus 
Pra( Wy, Wp Wa...) 


he either prefers probability distribution A to B, or B to A, or is indifferent 
between the two probability distributions. 

We shall not only assume that the player has such preferences, but also that he 
maximizes expected utility. In other words, we assume that he attaches a (finite) 
number 


U(Wo, WA, W,,...) 


to each sequence of wealths, and chooses among alternate strategies so as to 
maximize EU. 

The only additional assumption we make about the utility function U(...), is 
this: 

If the sequence W°=(Wp Wi, W3,...) eventually pulls even with, and then stays 
even with or ahead of the sequence 


W^-(Wy,Wt,W35,..) 
then W^? is at least as good as W^; i.e., if there exists a T such that 
W:2W^  fort»T (16) 


then U(W?) > U(W^). This assumption expresses the basic notion that, in the sense 
that we have used the terms throughout this controversy, if player A eventually gets 
and stays ahead of player B (or at least stays even with him) then player A has 
done at least as well as player B “in the long run”. 

At first it may seem appropriate to make a stronger assumption that if W7 
eventually pulls ahead of W^, and stays ahead, then the sequence W^ is preferable 
to the sequence W^. In other words, if there is a T such that 


W:»2W*  fort2T (16a) 
then 


U(W*)» U(W*). 
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As shown in the footnote’, this stronger assumption is too strong in that no 
utility function U( W,, Wi, Wa...) can have this property. Utility functions can 
however have the weaker requirement in (16). 

The analysis of unending games is particularly easy if we assume that the same 
opportunities are available at each move, and that we only consider strategies 
which select the same probability distribution of returns each period. We shall 
make these assumptions at this point. Later we will summarize more general results 
derived in the appendix to this paper. 

Without further loss of generality we will confine our discussion to just two 
strategies, namely, MEL and any other strategy, and will consider when the 
expected utility supplied by MEL is at least as great as that supplied by the other 
strategy. Letting WŁ and W? be the wealth at ¢ for a particular play of the game 
using MEL or the other strategy, respectively, 


U( Wo Wt, Wis.) > U( Wo w?, W9...) (17) 
is implied if there is a T such that 


WŁ> W?  fort>T. (18) 


7. If U orders all sequences We (Wo, Wi Wa...) in a manner consistent with (16a), then in 
particular it orders sequences of the form 


Wo- given 
W,-— any positive number (N.1) 
W,-(l*a)M)9W,, fort>2;a>—-1. 


Since this family of sequences depends only on W, and a, we may here write 
VW p a) = U( Wo, Wy, W;,...). (N.2) 
Then (16a) requires 
V(W4,04)>V(W?,07)  ifeithera4»a? or 
a^-a? and | M>. (N.3) 


For any a let 
Uis (a) = GLB V(W,, a) (N.4) 


U*i(a) = LUB V(W,, a). 
Then (N.3) implies 
Uy (a)«U*(a) for every a (N.5a) 
as well as 
Uj,(a4)»U5(a7) ifat>a?, (N.5b) 


But (N.5b) implies that we can have U,,,(a)< Ux) for at most a countable number of values of a, 
since at most a countable number of values of a can have U™(a) — U (a) > 1/N for N=1,2,3,.... But 
this contradicts (N.5a). : 
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Equation (2) implies that we have W+ > W? if and only if 


> lese)» 5 log(1 +79). (19) 
Thus (18) will hold in any play of the game in which there exists a T such that 
> log(1 +74) > 1 X log(I+r9) forall £> T. (20) 
Or, if we let 
y;7log(1- r;) (21) 


(20) may be written as 
| « 1 « 
TE> forts. (22) 


Under the present simplified assumptions 
Ey" > Ey? (23) 


by definition of MEL. But for random variables y}, Y»... with identical distribu- 
tions and with (finite) expected value p, we have 


lim 1l X (y;7p)20 (24) 


ind l; 


except for a set of probability measure zero. In other words 
. 1l 
lim | X yu Q5) 


except for a set of sequences which have (in total) zero probability of occurrence 
(c.f. the strong law of large numbers in [4] or [5]). But (23) and (25) imply (as a 
simple corollary of the definition of the limit of sequence) that there exists T' such 


that 
1« l< 
TE> fore 7 (26) 


except on a set of probability zero; hence (17) holds except on a set of measure 
zero, Since 


EU= f U(Wo Wy...) dP (Wo Wy...) (27) 


is not affected by arbitrarily changing the value of U on a set of measure zero, we 
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have 
EU( Wo, Wt, Wii) 2 EU( Wo, we, Woes) (28) 


Thus, given our simplifying assumption of an unchanging probability distribu- 
tion of returns for a given strategy, the superiority of MEL follows quite generally. 

The case in which opportunities change from period to period and, whether or 
not opportunities change, strategies may select different distributions at different 
times, is treated in the appendix. It is shown there that if a certain continuity 
condition holds, then MEL is optimal quite generally. If this continuity condition 
does not hold, however, then there can exist games for which MEL is not optimal. 

In this respect the results for the unending game are similar to those for the 
sequence of games with constant V( g). In the latter case we found that MEL was 
asymptotically optimal for the sequence of games if V(g) was continuous, but 
could fail to be so if V( g) was discontinuous. In the case of the unending game, the 
theorem is not concerned with asymptotic optimality in a sequence of games, but 
optimality for a single game. Given a particular continuity condition, MEL is the 
optimum strategy. 


V. CONCLUSIONS 


The analysis of investment for the long run in terms of the weak law of large 
numbers, Breiman's strong law analysis, and the utility analysis of unending games 
presented here each imply the superiority of MEL under broad assumptions for the 
hypothetical investor of [1], [2], [7], [8], [9], [10]. The acceptance or rejection of a 
similar conclusion for the sequence-of-games formalization depends on the defini- 
tion of asymptotic optimality. For example, if constant V(g) rather than constant 
U(W,) is assumed, as this writer believed plausible on a priori grounds, then the 
conclusion of the asymptotic analysis is approximately the same (even in terms of 
where MEL fails) as those of the unending game. 

I conclude, therefore, that a portfolio analyst should not be faulted for warning 
an investor against choosing E, V efficient portfolios with higher E and V but 
smaller Elog(1 + R), perhaps not even presenting that part of the E, V curve which 
lies above the point with approximate maximum Elog(1-- R), on the grounds that 
such higher E, V combinations have greater variability in the short run and less 
"return in the long run". 


APPENDIX 


Using the notation of footnote 7, we will show that if U,,,(a)=U(a) for all a 
then MEL is an optimum strategy quite generally; whereas, if U™(a)>U,,,(a) for 
some ay, then a game can be constructed in which MEL is not optimum. 
U,, (a) = U™(a) for all a is the “continuity condition" referred to in the text. 

For v=L or 0, indicating the MEL strategy or some other strategy, respectively, 
we define 


yr Lour Q9) 
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where 
LP=E{ ye | Lp, Ly...) Lo yu u.s up au) (30) 
is the expected value of y? given the events prior to time ¢. From this follows 
E(wuf|Lj,...,up,) =0. (31) 


The u? (for a given v) are thus what Doob [4] refers to as "orthogonal" random 
variables, and Feller [5] calls “completely fair" random variables. Therefore, 
writing var for variance, 


o var(u;) 
co (32) 
n=] n 
implies 
> u? converges to 0 almost always. (33) 
t-21 


(In particular, (32) holds if the var(u7) are bounded.) In addition to now assuming 
condition (32) we will also assume that the game is such that 


lim 1 > LE exists almost always. (34) 
"> i=] 


This is the case, for example, when the same distributions are available each time, 
whether or not “the other” strategy uses a constant distribution. Since L^» L? 


always, we have 


tim 2 > LL- lim sup. Y LE» lim supt S L? — always. (35) 
n n 


i= | i=] i=] 


Thus when (32) holds we have 


imt >) yf > limsup— > y)? almost always. (36) 


im] i=] 
In general, 


a=limsup+ > y? implies U(a)> U(Wo W3,W3...); (37) 


(since there always exists another series yr, y7,... such that 


and 


n 1 n 
2 > y? >- forall n; 
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hence 
U*" (a) > U( Wo, Wt, Wt...) > U(U WY, W3,...)). 


If we now add to the assumptions expressed in equations (32) and (34), the 
assumption that U(a)=U,,,(a) for all a, we get directly from (36) and (37) that 


EU! > EU’. 


Conversely, the following is an example in which U"(a,)> U, (ag) and which 
MEL is not optimum: let W,=1 and suppose that for some fixed positive a we 
have U(1,0.5,0.5(1+a),0.5(1+«)’,...) equals 


U(1,(1 +a), (1 +a}, (1 +a),...)< U(1, L5, 1.5(1 +a), 1.5(1+4)’,...). 


With such a U-function it would be better to take a 50-50 chance of W, 0.5 or 1.5 
followed by W,=(l+a)W,_,,t>2, rather than have W,=(l+a)-W,_, with 
certainty for t 2 1,.... 

While the above shows that MEL can fail to be optimal when U™(a) > UV (o) 
for some a, recall that we can have U"(a)» U,, (a) for at most a countable 
number of values of a. Thus MEL is optimal in a game in which 


a-liml Y L} 


i=] 


has a continuous distribution, or in which a has a discrete or mixed distribution 
but in which none of the points of discontinuity of the cumulative probability 
distribution of a have UP(a)» U, (a). 
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Understanding the Kelly Criterion* 
Edward O. Thorp 


In January 1961, I spoke at the annual meeting of the American Mathemati- 
cal Society on “Fortune’s Formula: The Game of Blackjack". This announced the 
discovery of favorable card counting systems for blackjack. My 1962 book Beat 
the Dealer explained the detailed theory and practice. The ‘optimal’ way to bet in 
favorable situations was an important feature. In Beat the Dealer, I called this, nat- 
urally enough, “The Kelly gambling system”, since I learned about it from the 1956 
paper by John L. Kelly (Claude Shannon, who refereed the Kelly paper, brought 
it to my attention in November of 1960). I have continued to use it successfully 
in gambling and in investing. Since 1966, I’ve called it “the Kelly Criterion”. The 
rising tide of theory about and practical use of the Kelly Criterion by several lead- 
ing money managers received further impetus from William Poundstone's readable 
book about the Kelly Criterion, Fortune's Formula. (As this title came from that 
of my 1961 talk, I was asked to approve the use of the title). At a value investor's 
conference held in Los Angeles in May, 2007, my son reported that ‘everyone’ said 
they were using the Kelly Criterion. 

The Kelly Criterion is simple: bet or invest so as to maximize (after each bet) 
the expected growth rate of capital, which is equivalent to maximizing the expected 
value of the logarithm of wealth; but the details can be mathematically subtle. 
Since they're not covered in Poundstone (2005), you may wish to refer to my ar- 
ticle, Thorp (2006), and other papers in this volume. Also some services such as 
Morningstar and Motley Fool have recommended it. These sources use the rule: 
“optimal Kelly bet equals edge/odds" that applies only to the very special case of a 
two-valued payoff. 

Hedge fund manager, Mohnish Pabrai (2007), gives examples of the use of the 
Kelly Criterion for investment situations (Pabrai won the bidding for the 2008 lunch 
with Warren Buffett, paying over $600,000). Consider his investment in Stewart 
Enterprises (Pabrai, 2007: 108-115), his analysis gave what he believed to be a list 
of worst case scenarios and payoffs over the next 24 months which I summarize 
in Table 1. 

The expected growth rate of capital g(f) if we bet a fraction f of our net 


* Reprinted revised from two columns from the series A Mathematician on Wall Street in Wilmott 
Magazine, May and September 2008. Edited by Bill Ziemba. 
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Table 1 Stewart enterprises, 
payoff within 24 months. 


Probability Return 

pi — 8.80 Ri > 100% 

p2 = 0.19 R2 > 0% 

p3 = 0.01 R3 = —100% 

Sum = 1.00. 
worth is 
3 

g(f) = >> piln( + Rif) (1) 

t=1 


where In means the logarithm to the base e. When we use Table 1 to insert the p; 
values, replacing the R; by their lower bounds gives the conservative estimate 


g(f) = 0.801n(1 + f) + 0.01 In(1 — f). (2) 


Setting g'(f) = 0 and solving gives the optimal Kelly fraction f* = 0.975 noted by 
Pabrai. Not having heard of the Kelly Criterion in 2000, Pabrai only bet 10% of 
his fund on Stewart. Would he have bet more, or less, if he had then known about 
Kelly’s Criterion? Would I have? Not necessarily. Here are some of the many 
reasons why: 

(1) Opportunity costs. A simplistic example illustrates the idea. Suppose 
Pabrai’s portfolio already had one investment which was statistically independent 
of Stewart and with the same payoff probabilities. Then, by symmetry, an optimal 
strategy is to invest in both equally. Call the optimal Kelly fraction for each f*, 
then 2f* < 1 since 2f* = 1 has a positive probability of total loss, which Kelly 
always avoids, so f* < 0.50. The same reasoning for n such investments gives 
f* <1/n. Hence, we need to know the other investments currently in the portfolio, 
any candidates for new investments, and their (joint) properties, in order to find 
the Kelly optimal fraction for each new investment, along with possible revisions 
for existing investments. Formally, we solve the nonlinear programming problem: 
maximize the expected logarithm of final wealth subject to the various constraints 
on the asset weights (see the papers in Section 6 of this volume for examples). 

Pabrai’s discussion (e.g. pp. 78-81) of Buffett’s concentrated bets gives consid- 
erable evidence that Buffet thinks like a Kelly investor, citing Buffett bets of 25% 
to 40% of his net worth on single situations. Since f* < 1 is necessary to avoid 
total loss, Buffett must be betting more than 0.25 to 0.40 of f? in these cases. The 
opportunity cost principle suggests it must be higher, perhaps much higher. Here’s 
what Buffett himself says, as reported in http://undergroundvalue.blogspot.com/ 
2008/02 /notes-from-buffett-meeting-2152008.23.html, notes from a Q & A session 
with business students: 


Understanding the Kelly Criterion 511 


Emory: 


With the popularity of “Fortune’s Formula" and the Kelly Cri- 
terion, there seems to be a lot of debate in the value community 
regarding diversification vs. concentration. I know where you side 
in that discussion, but was curious if you could tell us more about 
your process for position sizing or averaging doum. 


Buffett: 


I have 2 views on diversification. If you are a professional and 
have confidence, then I would advocate lots of concentra- 
tion. For everyone else, if it’s not your game, participate in total 
diversification. So this means that professionals use Kelly and am- 
ateurs better off with index funds following the capital asset pricing 
model. 

If it's your game, diversification doesn't make sense. It's crazy 
to put money in your 20th choice rather than your 1st choice. If 
you have LeBron James on your team, don't take him out of the 
game just to make room for some else. 

Charlie and I operated mostly with 5 positions. /f I were 
running 50, 100, 200 million, I would have 80% in 5 positions, with 
25% for the largest. In 1964, I found a position I was willing to 
go heavier into, up to 40%. I told investors they could pull their 
money out. None did. The position was American Express after 
the Salad Oil Scandal. In 1951 I put the bulk of my net worth into 
GEICO. With the spread between the on-the-run versus off-the-run 
30 year Treasury bonds, I would have been willing to put 75% of 
my portfolio into it. There were various times I would have gone 
up to 75%, even in the past few years. If it’s your game and you 
really know your business, you can load up. 


This supports the assertion in Rachel and Bill Ziemba’s 2007 book, that Buffett 
thinks like a Kelly investor when choosing the size of an investment. They discuss 
Kelly and investment scenarios at length. 

Computing f* without considering the available alternative investments is one 
of the most common oversights I’ve seen in the use of the Kelly Criterion. It is a 
dangerous error because it generally overestimates f*. 

(2) Risk tolerance. As discussed at length in Thorp (2006), “full Kelly" is too 
risky for the tastes of many, perhaps most, investors and using instead an f = cf*, 
with fraction c where 0 « c < 1 or “fractional Kelly" is much more to their liking. 
Full Kelly is characterized by drawdowns which are too large for the comfort of 
many investors. 


ISeveral papers in Section 3 in this volume, as do the following two papers in this section, discuss 
fractional Kelly strategies. 
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(3) The *true" scenario is worse than the supposedly conservative lower 
bound estimate. Then we are inadvertently betting more than f* and, as dis- 
cussed in Thorp (2006), we get more risk and less return, a strongly suboptimal 
result. Betting f — cd*, 0 « c « 1 gives some protection against this (see the 
graphs in Section 3, (MacLean, Ziemba and Blazenko (1992)). 

(4) Black swans. As fellow Wilmott columnist Nassim Nicholas (Taleb 2007) 
has pointed out so eloquently in his bestseller The Black Swan, humans tend not to 
appreciate the effect of relatively infrequent unexpected high impact events. Failing 
to allow for these “black swans”, scenarios often don't adequately consider the 
probabilities of large losses. These large loss probabilities may substantially reduce 
f*. One approach to successfully model such black swans is to use a scenario 
optimization stochastic programming model? For Kelly bets that simply means 
that you include such extreme scenarios and their consequences in the nonlinear 
programming optimization to compute the optimal asset weights. The f* will be 
reduced by these negative events. 

(5) The *long run". 'The Kelly Criterion's superior properties are asymptotic, 
appearing with increasing probability as time increases. For instance: 

As time t tends to infinity the Kelly bettor’s fortune will, with probability tend- 
ing to 1, permanently surpass that of any bettor following an “essentially different” 
strategy. 

The notion of “essentially different” has confounded some well known quants so 
I'll take time here to explore some of its subtleties. Consider for simplicity repeated 
tosses of a favorable coin, the outcome of the nth trial is X, where P(X, = 1) = 
p > 1/2 and P(X, = —1)is1—p = q > 0. The {Xn} are independent identically 
distributed random variables. The Kelly fraction is f* = p — q = E(Xn) > 0. The 
Kelly strategy is to bet a fraction fn = f* at each trial n = 1,2,.... Now consider 
a strategy which bets gn, n = 1,2,... at each trial with gn Z f* for some n < N 
and gn = f* thereafter. The {gn} strategy differs from Kelly on at least one of the 
first N trials but copies it thereafter, but it does not differ infinitely often. There 
is a positive probability that {gn} is ahead of Kelly at time N, hence ahead for 
all n > N. For example consider the sequence of the first N outcomes such that 
Xn = 1 if gn > f* and Xn = —1 if gn € f*. Then for this specific sequence, which 
has probability > q^, {gn} gains more than Kelly for each n < N where gn £ f*, 
hence exceeds Kelly for all n > N. 

What if instead in this coin tossing example we require that g, 4 f* for infinitely 
many n? This question arose indirectly about 15 years ago in the newsletter Black- 
jack Forum when a well known anti Kellyite, John Leib, challenged a well known 
blackjack expert with (approximately) this proposition bet: Leib would produce 
a strategy which differed from Kelly at every trial but would (with probability as 
2There you assume the possibility of an event, specifying its consequences but not what it is. See 
Geyer and Ziemba (2008) for the application to the Siemens Austria Pension Fund. Correlations 


change as the scenario sets move from normal conditions to volatile to crash which include the 
black swans. See also Ziemba (2003) for additional applications of this approach. 
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close to 1 as you wish), after a finite number of trials, get ahead of Kelly and stay 
ahead forever. When I read the challenge I immediately saw how Leib could win 
the bet. 

Leib’s Paradox: Assuming capital is infinitely divisible,footnoteThe infinite di- 
visibility of capital is a minor assumption and can be dealt with as needed in 
examples where there is a minimum monetary unit by choosing a sufficiently large 
starting capital. then given £ > 0 there is an N > 0 and a sequence {fn} with 
fn x f" for all n, such that P(V,* < Vn for all n > N) > 1 — varepsilon where 
Va = [a0 fX) and Vz = [T4 (1+ f^ X). Furthermore, there is a b > 1 such 
that P(V,/V,x > b,n > N) > 1—varepsilon and P(V, —V;; — oo) > 1—varepsilon. 
That is, for some N there is à non Kelly sequence that beats Kelly "infinitely badly” 
with probability 1 — e for all n > N. 


Proof. The proof has two parts. First we want to establish the assertion for 
n = N. Second we show that once we have an {f,,n € N} that is ahead of Kelly 
at n = N, we can construct {fn Æ f*,n > N} to stay ahead. 

To see the second part, suppose Vm > Vx. Then Vw > a+ bV for some 
a >0,b> 1. For instance, Vy >> C > 0 since there are only a finite number 
of sequences of outcomes in the first N trials, hence, only a finite number with 
Vy > VN So: 


Vy > c+ Vg 2 c/2- [(c/2) + V] = c/2 + [dMax Vg; + V] > c/2 4- (d-- IVE 


where d Max Vy, = c/2 defines d > 0 and Max Vy is over all sequences of the first 
N trials such that Vy > Vy. Setting c/2 = a > 0 and d + 1 = b > 1 suffices. Once 
we have Vy > a+ bVy we can, for bookkeeping purposes, partition our capital into 
two parts: a and bVy. For n > N we bet fn = f* from bVx and an additional 
amount a/2” from the a part, for a total which is generally unequal to f* of our 
capital. If by chance for some n the total equals f* of our total capital we simply 
revise a/2" to a/3" for that n. The portion bVý will become bVs for n > N and 
the portion a will never be exhausted so we have Vn > bV7 for all n > N. Hence, 
since P(VĂ — oo) = 1, we have P(V,/Vs > b) = 1 from which it follows that 
P(V, — Vx > o0) = 1. 

To prove the first part, we show how to get ahead of Kelly with probability 1—& 
within a finite number of trials. The idea is to begin by betting less than Kelly 
by a very small amount. If the first outcome is a loss, then we have more than 
Kelly and use the strategy from the proof of the second part to stay ahead. If the 
first outcome is a win, we're behind Kelly and now underbet on the second trial by 
enough so that a loss on the second trial will put us ahead of Kelly. We continue 
this strategy until either there is a loss and we are ahead of Kelly or until even 
betting 0 is not enough to surpass Kelly after a loss. Given any N, if our initial 
underbet is small enough, we can continue this strategy for up to N trials. The 
probability of the strategy failing is p", 1/2 < p < 1 Hence, given £ > 0, we can 
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choose N such that p" < £ and the strategy therefore succeeds on or before trial 
N with probability 1 — p" > 1— e. 

More precisely: suppose the first n trials are wins and we have bet a fraction 
f*— ai with a; > 0,4 — L,...,n, on the ith trial. Then: 


Va (14+ f*—ai)--- 4+ f* — an) 


VS +f f*) 


- c ie) Dex) »(1—-a)--(1—a4,) > 1 (a, +--+ an) 


where the last inequality is proven easily by induction. Letting aj ++- + an =a, 
so V/V; > 1— a, what betting fraction f* — b will put us ahead of Kelly if the 
next trial is a loss? A sufficient condition is 


Vou Vall — f* +8) _ b a 
Ve, VU- Fo >(1 JEE Spa a)(1+b) >1 or b> 7, 


provided b € f* and 0 « a « 1. If a < 1/2 then b = 2a suffices. Proceeding 
recursively, we have these conditions on the a;: choose a; > 0. Then a4,1 = 
2(a14-- a4), n = 1,2,... provided all the an < 1/2. Letting f(x) = aiztagz?+:-- 
we get the equation 


f(x) - aux = zf (+r +r? +) 
= 2rf(x)/(1— z) 
whose solution is f(z) = ai(z + 25771," 72^] from which a, = 20,3"? if 


n 2 2. Then given £ > 0 and an N such that p < e it suffices to choose a, so that 
an = 2043" ? < min(f*, 1/2). 


Although Leib did not have the mathematical background to give such a proof 
he understood the idea and indicated this sort of procedure. 

So far we've seen that all sequences which differ from Kelly for only a finite 
number of trials, and some sequences which differ infinitely often (even always), are 
not essentially different. How can we tell, then, if a betting sequence is essentially 
different than Kelly? Going to a more general setting than coin tossing, assume now 
for simplicity that the payoff random variables X; are independent and bounded 
below but not necessarily identically distributed. 

At this point we come to an important distinction. In financial applications, one 
commonly assumes that the f; are constants that are dependent only on the current 
period payoff random variable (or variables). Such *myopic strategies" might arise 
for instance, by selecting a utility function and maximizing expected utility to 
determine the amount to bet. However, for gambling systems, the amount depend 
on previous outcomes, i.e., fn = fa(X3, Xo,..., X4 1), just as it does in the Leib 
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example. As Professor Stewart Ethier pointed out, our discussion of “essentially 
different" is for the constant f; case. For a more general case, including the Leib 
example and many of the classical gambling systems, I recommend Ethier's 2010 
book on the mathematics of gambling. 

We assume R(X;) > 0 for all X; from which it follows that f7 > 0 for all i. 
As before, V, = [];_,(1 + fiX:) and Vr = [4,0 + f£ Xi) from which In V, = 
yo, (14 fiX;) and In Vf = So, In(14- f? Xi). Note from the definition f* that 
Eln(14 f7X;) > Eln(1t fiXi), where E denotes the expected value, with equality 
if and only if f? — f;. Hence: 


El(V/V,) = BY nt * Xi) —In(1 + fiX} -a 


where a; > 0 and a; = 0 if and only if f7 = f;. This series of non-negative 
terms either increases to infinity or to a positive limit M. We say (fi) is essentially 
different from (7) if and only if $77. ., a; tends to infinity as n increases. Otherwise, 
{fi} is not essentially different from {f*}. The basic idea here can be applied to 
more general settings. 

(6) Given a large fixed goal, e.g., to multiply your capital by 100, or 1000, 
the expected time for the Kelly investor to get there tends to be least. 

Is à wealth multiple of 100 or 1000 realistic? Indeed. In the 511/2 years from 
1956 to mid 2007, Warren Buffett has increased his wealth to about $5 x 1019. If 
he had $2.5 x 10* in 1956, that’s a multiple of 2 x 109. We know he had about 
$2.5 x 10” in 1969 so his multiple over these 38 years is about 2 x 10°. Even my 
own efforts, as a late starter on a much smaller scale, have multiplied capital by 
more than 2 x 10* over the 41 years from 1967 to early 2007. I know many investors 
and hedge fund managers who have achieved such multiples. One of the best is Jim 
Simons, who recently retired from running the Renaissance Medallion Fund. His 
record to 2005 is analyzed in Section 6 of this book. 

The caveat here is that an investor or bettor many not choose to make, or be 
able to make, enough Kelly bets for the probability to be *high enough" for these 
asymptotic properties to prevail, i.e., he doesn't have enough opportunities to make 
it into this "long run". Below I explore investors for which Kelly or fractional Kelly 
may be a more or less appropriate approach. An important consideration will be 
the investor’s expected future wealth multiple. 


Using Kelly Optimization at PIMCO 


During a recent interview in the Wall Street Journal (March 22-23, 2008), Bill Gross 
and I discussed turbulence in the markets, hedge funds, and risk management. Bill 
considered the question of risk management after he read Beat the Dealer in 1966. 
That summer he was off to Las Vegas to beat blackjack. Just as I did some years 
earlier, he sized his bets in proportion to his advantage, following the Kelly Criterion 
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as described in Beat the Dealer, and ran his $200 bankroll up to $10,000 over the 
summer. Bill has gone from managing risk for his tiny bankroll to managing risk for 
Pacific Investment Management Company's (PIMCO) investment pool of almost $1 
trillion.? He still applies lessons he learned from the Kelly Criterion. As Bill said: 
“Here at PIMCO it doesn’t matter how much you have, whether it's $200 or $1 
trilion. ... Professional blackjack is being played in this trading room from the 
standpoint of risk management and that's a big part of our success". 

The Kelly Criterion applies to multiperiod investing and we can get some in- 
sights by comparing it with Markowitzs standard portfolio theory for single period 
investing. 


Compound Growth and Mean-Variance Optimality 


Nobel Prize winner Harry Markowitz introduced the idea of mean-variance optimal 
portfolios. This class is defined by the property that, among the set of admissible 
portfolios, no other portfolio has both higher mean return and lower variance. The 
set of such portfolios as you vary return or variance is known as the efficient frontier. 
The concept is a cornerstone of modern portfolio theory, and the mean and variance 
refer to one period arithmetic returns. In contrast, the Kelly Criterion is used to 
maximize the long term compound rate of growth, a multiperiod problem. It seems 
natural, then to ask the question: is there an analog to the Markowitz efficient 
frontier for multiperiod growth rates, i.e., are there portfolios such that no other 
portfolio has both a higher expected growth rate and a lower variance in the growth 
rate? We'll call the set of such portfolios the compound growth mean-variance 
efficient frontier. 

Let/s explore this in the simple setting of repeated independent identically dis- 
tributed returns per unit invested, where the payoff random variables are (X; : i = 
1,...,n} with E(X;) > 0 so the “game” is favorable, and where the non-negative 
fractions bet at each trial, specified in advance, are (f; :  — 1,...,n]. To keep the 
math simpler, we also assume that the X; have a finite number of distinct values. Af- 
ter n trials the compound, growth rate per period is G({f;}) = 155; , log(1+ fiXi) 


and the expected growth rate g((g;]) = E[G((fi])] = Ly" Flog(1 + fiXi) = 
1 oP, Elog(1+ f; X) € Elog(1+ f X). The last step follows from the (strict) con- 
cavity of the log function, where as X has the common distribution of the X;, we 
define f = i DDAN fi and we have equality if and only if f; = f for all i. Therefore, 
if some f; differ from f, we have g((fi]) < g({f}). This tells us that betting the 
same fixed fraction always produces a higher expected growth rate than betting a 
varying fraction with the same average value. Note that whatever f turns out to 


be, it can always be written as f = cf*, a fraction c of the Kelly fraction. 


3PIMCO is widely regarded as the top bond trading operation in the world. 
^A comprehensive survey of mean-variance theory is in Markowitz and Van Dijk (2006). 


Understanding the Kelly Criterion 517 


Now consider the variance of G(( fi]). If is a random variable with: 


P(X =a)=p, P(r—--1)—-4q and a>0 


2 
then Var(ln(1 + fX)] = pq [in =) ; 
(Compare Thorp, 2006, Section 3.1). 
Note: the change of variable f = bh, b > 0, shows the results apply to any two 
valued random variable. We chose b = 1 for convenience.) 
A calculation shows that the second derivative with respect to f is strictly 
positive for 0 < f < 1 so Var|ln(1 + fX)] is strictly convex in f. It follows that: 


Varl@({f})] = = Y Varllog(+fiXi)] = — Y 7 Varllog(1+ 4] = Verllog(1-- 720] 
i=1 i=1 


with equality if and only if f; = f for all i. Since every admissible strategy is there- 
fore “dominated” by a fractional Kelly strategy, it follows that the mean-variance 
efficient frontier for compound growth is a subset of the fractional Kelly strategies. 
If we now examine the set of fractional Kelly strategies {f} = {cf*}, we see that 
for 0 € c € 1, both the mean and the variance increase as c increases but for c > 1, 
the mean decreases and the variance increases as c increases. Consequently {f*} 
dominates the strategies for which c > 1 and they are not part of the efficient fron- 
tier. No fractional Kelly strategy is dominated for 0 € c < 1. We have established 
in this limited setting: 


Theorem. For repeated independent trials of a two valued random variable, the 
mean-variance efficient frontier for compound growth over a finite number of trials 
consists precisely of the fractional Kelly strategies (cf* : 0 € c x 1]. 


So, given any admissible strategy, there is a fractional Kelly strategy with 0 < 
c <1, which has a growth rate that is no lower and a variance of the growth rate 
that is no higher. The fractional Kelly strategies in this instance are preferable in 
this sense to all the other admissible strategies, regardless of any utility function 
upon which they may be based. This deals with yet another objection to the 
fractional Kelly strategies, namely that there is a wide spread in the distribution of 
wealth levels as the number of periods increases. In fact, this eventually enormous 
dispersion is simply the magnifying effect of compound growth on small differences 
in growth rate and we have shown in the theorem that in the two outcome setting 
this dispersion is minimized by the fractional Kelly strategies. Note that in this 
simple setting, a one-period utility function will choose a constant f; = cf which 
will either be a fractional Kelly with c < 1 in the efficient frontier or will be too 
risky, with c > 1, and not be in the efficient frontier. 

As a second example, suppose we have a lognormal diffusion process with instan- 
taneous drift rate m and variance s? where as before the admissible strategies are 
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to specify a set of fixed fractions (fi) for each of n unit time periods, à = 1,...,n. 
Then, for a given f and unit time period Var G(f) = s? f? as noted in (Thorp, 2006, 
eq. (7.3)). Over n periods Var G({fi}) = $2955. f? = 52» 57. f? with equality if 
and only if f; = f for all i. This follows from the strict convexity of the function 
h(x) = z?. So the theorem also is true in this setting. I don’t currently know how 
generally the convexity of Var[In(1 + f.X)] is true but whenever it is, and we also 
have Var[In(1 + f X)] increasing in f, then the compound growth mean variance 
efficient frontier is once again the set of fractional Kelly strategies with 0 € c < 1. 
In email correspondence, Stewart Ethier subsequently showed that Var[ln(1+ f.X)] 
need not be convex. Example (Ethier): 


Let X assume values —1, 0 and 100 with probabilities 0.5, 0.49 
and 0.01, respectively. Then, on approximately the interval [0.019, 
0.180] the second derivative of the variance is negative, hence the 
variance is strictly concave on that interval. The first derivative of 
Var[In(1 + f X)] equals 2 Cov(In(1 + f X), X/(1-- f.X)), which is al- 
ways nonnegative because the two functions of X in the covariance 
are increasing in X. Thus Var[In(14- f X)] is always increasing in f. 
The second derivative of Var[In(1+ f. X)] equals 2 Var(X/(1+ f .X))- 
2 Cov(In(14- f. X)—1, X?/(14- f. X)?). However, the covariance term 
sometimes exceeds the variance term. 


Samuelson's Criticisms 


The best known “opponent” of the Kelly Criterion is Nobel Prize winning economist, 
Paul Samuelson, who has written numerous polemics, both published and private, 
over the last 40 years. William Poundstone's book Fortune's Formula gives an 
extensive account with references. The gist of it seems to be: 

(1) Some authors once made the error of claiming, or seeming to claim, that 
acting to maximize the expected growth rate (i.e., logarithmic utility) would ap- 
proximately maximize the expected value of any other continuous concave utility 
(the “false corollary” ). 

Response: Samuelson’s point was correct but, to others as well as me, obvious 
the first time I saw the false claim. However, the fact that some writers made 
mistakes has no bearing on an objective evaluation of the merits of the criterion. 
So this is of no further relevance. 

(2) In private correspondence to numerous people Samuelson has offered exam- 
ples and calculations in which he demonstrates, with a two valued X (“stock”) and 
three utilities, H(W) = —1/W, K(W) = log W, and T(W) = W"/?, that if any 
one who values his wealth with one of these utilities uses one of the other utilities 
to choose how much to invest then he will suffer a loss as measured with his own 
utility in each period and the sum of these losses will tend to infinity as the number 
of periods increases. 
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Response: Samuelson's computations are simply instances of the following gen- 
eral fact proven 30 years earlier by Thorp and Whitley (1972, 1974).° 


Theorem 1. Let U and V be utilities defined and differentiable om (0,00) with 
U'(x) and V'(x) positive and strictly decreasing as x increases. Then if U and 
V are inequivalent, there is a one period investment setting such that U and V 
have distinct sets of optimal strategies. Furthermore, the investment setting may be 
chosen to consist only of cash and a two-valued random investment, in which case 
the optimal strategies are unique. 


Corollary 2. If the utilities U and V have the same (sets of) optimal strategies 
for each finite sequence of investment settings, then U and V are equivalent. 


Two utilities U; and U2 are equivalent if and only if there are constants a and 
b such that U5(x) = aU; (x) + b(a > 0), otherwise U — 1 and U» are inequivalent. 

Thus, no utility in the class described in the theorem either dominates or is 
dominated by any other member of the class. 

Samuelson offers us utilities without any indication as to how we ought to choose 
among them, except perhaps for this hint. He says that he and an apparent majority 
of the investment community believe that maximizing U(x) = —1/z explains the 
data better than maximizing U(x) = logz. How is it related to fractional Kelly? 
Does this matter? Here are two examples showing that this utility can choose cf* 
for any 0 < c < 1, c Z 1/2, depending on the setting: 

For a favorable coin toss and U(x) = —1/z, we have f* = u/(/p + yq)? which 
increases from 4/2 or half Kelly to u or full Kelly as p increases from 1/2 to 1, 
giving us the set 1/2 < cl. On the other hand, if P(X = A) = P(X = —1) = y 
describe the returns and A > 1 so p > 0, u = (A—1)/2 and the Kelly f* = u/A. For 
U(x) = —1/z we find U maximized for f = {-2A+(4A2+ A(A—1)?)/?}/A(A-1), 
which is asymptotic to A7!/? as A increases, compared to the Kelly f*, which is 
asymptotic to 1/2 as A increases, giving us the set 0 < c < 1/2. 

In the continuous case, the relation between c, g(f) and o(G(f)) is simple and 
the tradeoff between growth and spread in growth rate as we adjust between 0 
and 1 is easy to compute and it’s easy to visualize the correspondence between 
fractional Kelly and the compound growth mean-variance efficient frontier. This 
is not the case for these two examples so the fact that U(x) = —1/x can choose 
any c, 0 < c < 1, c z 1/2 doesn't necessarily make it undesirable.® I suggest that 


5The first Thorp and Whitley paper is reprinted in this book in Section 4 where three of Samuel- 
son’s papers are reprinted and discussed in the introduction to that part of this book. 
$MacLean, Ziemba and Li (2005) reprinted in Section 4 of this book, show that for lognormally 
distributed assets, a fractional Kelly strategy is uniquely related to the coefficient a < 0 in the 
negative power utility function aw® via the formula c = 1/(1 — o) so 1/2 Kelly is —1/w. However, 
when assets are lognormal this is only an approximation and, as shown here, it can be a poor 
approximation. 
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a useful way to look at the problem for any specific example involving n period 
compound growth is to map the admissible portfolios into the (o(G{fi}), g((fi])) 
plane, analogous to the Markowitz one period mapping into the (standard deviation, 
return) plane. Then examine the efficient frontier and decide what tradeoff of growth 
versus variability of growth you like. Professor Tom Cover points out that there 
is no need to invoke utilities. Adopting this point of view, we're simply interested 
in portfolios on the compound growth efficient frontier whether or not any of them 
happen to be generated by utilities. The Samuelson's preoccupation with utilities 
becomes irrelevant. The Kelly or maximum growth portfolio, which as it happens 
can be computed using the utility U(x) = log x, has the distinction of being at the 
extreme high end of the efficient frontier. 

For another perspective on Samuelson's objections, consider the three concepts: 
normative, descriptive and prescriptive. A normative utility or other recipe tells 
us what portfolio we “ought” to choose, such as “bet according to log utility to 
maximize your own good". Samuelson has indicated that he wants to stop people 
from being deceived by such a pitch. I completely agree with him on this point. 
My view is instead prescriptive: how to achieve an objective. If you know future 
payoff for certain and want to maximize your long term growth rate then Kelly does 
it. If, as is usually the case, you only have estimates of future payoffs and want to 
come close to maximizing your long term growth rate, then to avoid damage from 
inadvertently betting more than Kelly you need to back off from your estimate of 
full Kelly and consider a fractional Kelly strategy. In any case, you may not like the 
large drawdowns that occur with Kelly fractions over 1/2 and may be well advised 
to choose lower values. The long term growth investor can construct the compound 
growth efficient frontier and choose his most desirable geometric growth Markowitz 
type combinations. 

Samuelson also says that U(x) = —1/x seems roughly consistent with the data. 
That is descriptive, i.e., an assertion about what people actually do. We don't argue 
with that claim — it's something to be determined by experimental economists and 
its correctness or lack thereof has no bearing on the prescriptive recipe for growth 
maximizing. 

I met the economist Oscar Morgenstern (1902-1977), coauthor with John von 
Neumann of the great book, The Theory of Games and Economic Behavior, at his 
company, Mathematica; in Princeton, New Jersey, in November of 1967 and, when 
I outlined these views on normative, prescriptive and descriptive, he liked them 
so much that he asked if he could incorporate them into an article he was writing 
at the time. He also gave me an autographed copy of his book, On the Accuracy 
of Economic Observations, which has an honored place in my library today and 
which remains timely. (For instance, think about how the government has made 
successive revisions in the method of calculating inflation so as to produce lower 
numbers, thereby gaining political and budgetary benefits). 
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Proebsting's Paradox 


Next, we look at a curious paradox. Recall that one property of the Kelly Cri- 
terion is that if capital is infinitely divisible, arbitrarily small bets are allowed, 
and the bettor can choose to bet only on favorable situations, then the Kelly bet- 
tor can never be ruined absolutely (capital equals zero) or asymptotically (capital 
tends to zero with positive probability}. Here’s an example that seems to flatly 
contradict this property. The Kelly bettor can make a series of favorable bets yet 
be (asymptotically) ruined! Here’s the email discussion through which I learned 
of this. 


From: Todd Proebsting 
Subject: FW: incremental Kelly Criterion 


Dear Dr. Thorp, 


I have tried to digest much of your writings on applying the Kelly Crite- 
rion to gambling but I have found a simple question that is unaddressed. 
I hope you find it interesting: 

Suppose that you believe an event will occur with 50% probability and 
somebody offers you 2:1 odds. Kelly would tell you to bet 25% of your 
capital. Similarly, if you were offered 5:1 odds, Kelly would tell you to 
bet 40%. Now, suppose that these events occur in sequence. You are 
offered 2:1 odds, and you place a 25% bet. hen another party offers 
you 5:1 odds. I assume you should place an additional bet, but for what 
amount? 


If you have any guidance or references on this question, I would appre- 
ciate it. 


Thank you. 


From: Ed Thorp 
To: Todd Proebsting 
Subject: Fw: incremental Kelly Criterion Interesting. 


After the first bet the situation is: 

A win gives a wealth relative of 1 + 0.25 « 2 

A loss gives a wealth relative of 1 — 0.25 

Now bet an additional fraction f at 5:1 odds and we have: 

A win gives a wealth relative of 1 + 0.25 «2+ 5f 

A loss gives a wealth relative of 1 — 0.25 — f 

‘The exponential rate of growth g(f) = 0.5*In(1.5+5f) +0.5*1n(0.75— f) 
Solving g'(f) = 0 yields f = 0.225 which was a bit of a surprise until I 
thought about it for a while and looked at other related situations. 


From: Todd Proebsting 
To: Ed Thorp 
Subject: RE: incremental Kelly Criterion 


Thank you very much for the reply. 
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I, too, came to this result, but I thought it must be wrong since this 
tells me to bet a total of 0.475 (0.25 + 0.225) at odds that are on average 
worse than 5:1, and yet at 5:1, Kelly would say to bet only 0.400. 


Do you have an intuitive explanation for this paradox? 


From: Ed Thorp 
To: Todd Proebsting 
Subject: Re: incremental Kelly Criterion 


I don't know if this helps, but consider the example: 


A fair coin will be tossed (Pr Heads = Pr Tails = 0.5). You place a bet 
which gives a wealth relative of 1 + u if you win and 1 — d if you lose (u 
and d are both nonnegative). (No assumption about whether you should 
have made the bet.) Then you are offered odds of 5:1 on any additional 
bet you care to make. Now the wealth relatives are, each with Pr 0.5, 
l+u+5f and 1—d-— f. The Kelly fraction is f = (4 — u — 5d)/10. 
It seems strange that increasing either u or d reduces f. To see why it 
happens, look at the In(1 + x) function. This odd behavior follows from 
its concave shape. 


From: Todd Proebsting 
To: Ed Thorp 
Subject: RE: incremental Kelly Criterion 


Yes, this helps. Thank you. 


It is interesting to note that Kelly is often thought to avoid ruin. For 
instance, no matter how high the offered odds, Kelly would never have 
you bet more than 0.5 of bankroll on a fair coin with one single bet. 
Things change, however, when given these string bets. If I keep offering 
you better and better odds and you keep applying Kelly, then I can get 
you to bet an amount arbitrarily close to your bankroll. 


Thus, string bets can seduce people to risking ruin using Kelly. (Granted 
at the risk of potentially giant losses by the seductress.) 


From: Ed Thorp 
To: Todd Proebsting 
Subject: Re: incremental Kelly Criterion 


Thanks. I hadn't noticed this feature of Kelly (not having looked at 
string bets). To check your point with an example I chose consecutive 
odds to one of An : 1 where A, = 2", n = 1,2,... and showed by 
induction that the amount bet at each n was fn = 305D /4n (where ^ 
is exponentiation and is done before division or multiplication) and that 
sum{fn : n = 1,2,...} = 1. 

A feature (virtue?) of fractional Kelly strategies, with the multiplier less 
than 1, eg. f = c'f(kelly), 0 < c < 1, is that it (presumably) avoids 
this. 
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In contrast to Proebsting's example, the property that betting Kelly or any fixed 
fraction thereof less than one leads to exponential growth is typically derived by 
assuming a series of independent bets or, more generally, with limitations on the de- 
gree of dependence between successive bets. For example, in blackjack there is weak 
dependence between the outcomes of successive deals from the same unreshuffled 
pack of cards but zero dependence between different packs of cards, or equivalently 
between different shufflings of the same pack. Thus the paradox is a surprise but 
doesn't contradict the Kelly optimal growth property. 
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CONCAVE UTILITIES ARE DISTINGUISHED BY THEIR 
OPTIMAL STRATEGIES 


E. THORP — R. WHITLEY 


|l. INTRODUCTION 


Mossin [5], Thorp [7], and Samuelson [6] showed for spe- 
cific pairs of utility functions that different utilities can lead to different 
optimal strategies. In particular the optimal investment strategy for the 


utility logx is not necessarily the optimal strategy for the utility E x? 
(y # 0). 


These examples suggest the following generalization, of obvious im- 
portance to general utility theory. 


Consider a T stage investment process. Át each stage allocate re- 
sources among the available investments. Each chosen sequence 4 ofal- 
locations ("strategy") yields a corresponding terminal probability distribu- 
tion FA of assets at the completion of stage 7. For each utility func- 
tion U(-), consider those strategies A*(U) which maximize the expect- 
ed value [UGoaF;. (x) of terminal utility. Assume sufficient hypotheses 


- 813 - 
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on U and the set of FA so that the integral is defined and that further- 
more the maximizing strategy A*(U) exists. Then is it true in general 
that A*(U,) is not A*(U,) for "distinct" utilities U, and U,? 


As we now show, the answer is yes: the Mossin — Thorp — Samuelson 
results for specific utility pairs generalizes to the principal class of interest 
in modern utility theory. 


2. THE MAIN THEOREM 


We prove this for the class of "interesting" concave utilities. We be- 
gin with more special hypotheses. 


Theorem 1. Let U and V be utilities defined and differentiable 
on (0,9) with U'(x) and V'(x) positive and strictly decreasing as x 
increases. Then if U and V are inequivalent, there is a one period in- 
vestment setting such that U and V have distinct sets of optimal strat- 
egies. Furthermore, the investment setting may be chosen to consist only 
of cash and a two-valued random investment, in which case the optimal 
strategies are unique. 


Corollary 2. Zf the utilities U and V have the same (sets of) opti- 
mal strategies for each finite sequence of investment settings, then U and 
V are equivalent. 


Two utilities U, and U, are equivalent if and only if there are 
constants a and b such that U(x) = au, (x)+ b (a> 0), otherwise 


U, and U, are inequivalent. 


Let X, (1<i<k) be the (random) outcome per unit invested in 
the ith "security". We call (X,,... ,X,) the investment setting. We as- 
sume X; is independent of the amount invested. Let the initial capital be 
Zo and let the final capital be Zi. A strategy is an allocation W — 


=(w,,...,W,) where w, is the fraction of Zə allocated to security i. 


pv 
We assume w,> 0 for all i, that 2, w, = 1, and that wealth is infinite- 
I 


ly divisible. Thus the w, may assume any real values consistent with the 
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constraints and with the requirement that 2 wX; is in the domain of 
the utility function U. i 


Given a particular U satisfying the hypotheses of the theorem, sup- 
pose EU(Z,(W)) is maximized by some strategy W*. Then W* isan 
optimal (or best) strategy for U relative to the given investment setting. 


Proof of Theorem. Suppose that U and V have the same optimal 
strategies for every one period investment setting consisting of cash and a 
two-valued random investment. It will be shown that U and V are equiv- 
alent, which will establish the logical contrapositive to the theorem and 
hence the theorem itself. 


In the proof of theorems we shall assume for technical simplicity 
that the initial capital Zo = ]. When theorems have been established for 
this case, consideration of the transformation U,(s) = U(Z)5) = U(t) 
gives the theorems for arbitrary Zo > 0. We shall therefore state the gen- 
eral results without further comment after proving the Z, = 1 case. 


Let the only investment (besides cash) be X where P(X = 1 — b)= 
=q=1-—p and P(X=1+a)=p, where a>O and O«p,b« I. 
The choice 0< b< 1, rather than simply b= 1, has been made because 
for b=1 and w= 1, the expression U(0) would arise and O is not 
necessarily in the domain of U (e.g, U(x) = logx). The available strat- 
egies are to allocate the fraction w of recources to X and 1—w to 
cash, with Ox wx 1. f 


At the end of the period, we have 
(2.1) EU(Z, (w))= pU(1 + aw) + qU(1 — bw) = Rw). 


To find the maximum, consider f'(w)= apU'(1 + aw) — bqU'(1 — bw). 
Since U'(f) strictly decreases as £ increases, we have f'(w) decreasing 
strictly as w increases. Thus there is a unique maximum. If f ‘(w*) = 0 
for some w* with Ox w*« 1, then the maximum is at this unique w*. 
If fw)» 0 forall w with O<w<1, then the unique maximum is 
at w= l. If instead. f'(w) « O for O«w« 1, then the unique maxi- 
mum is at w= 0. 
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, - U u + aw) | og 
If f'(w)=0 we have U'(1— bw) ap Suppose a» 0 and 
, a 
U'(1* 3) 


l . 

z5) = 0. Letting amr 
U'(5) 

we can solve r= d for p, with O«p« 1. Thus for each a» 0 


i«b«1 are given and we wish f'( 


there is a choice of p, hence an X, such that w* — 3, is optimal for 


U. 


Now suppose that U and V have the same optimal strategies for 


all such investment settings. Then w* = = for V also and we have 


U(1-3) vi(i+4 
UU +a) ls a) for all a» 0. Letting V' (4) = aU' (5) 
U'(5) v(a) 


we find V'(t) - aU'(t) (t> 1) whence V(t) = aU(t) +B (t> 1). 


When t« 1, we proceed similarly. Choose X so that P(X 2 2)= p 
and P(X—-1—5)-2q, where 0<b< 1. Then 


EU(Z, (w)) = pU(1 + w) + qU(1 — bw) = fw) 
f'(w) = pU'(1 + w) — bqU'(1 — bw) 
and the maximum is unique and located as before. 


U(1—-aw) p 
U'((14 w) aq 
0<b< 1, we can choose p with O«p«]1 such that d= T. Then 


If f'(w)2 0 we have A= and given w= b, 


as before we find V'(1 — ab) = yU'(1 — ab) and since a and b can be 
any numbers such that Oca, b< 1, then V'(t) = yU'(t) (O«€ t« 1) 
V'(14 b) 
where y= UGF by 
Thus V(t)-2 aU(t) - 8 (O<t< 1). Also V(1) = «U(1) * e. Hence- 
V(t) — «U(t) 2 8 if t» 1, 6 if t<1 and e if t=1. But V(t) — 
—aU(t) is continuous so B=65=e so V(t) —- aU(t) * B. Thus U and 


But y was shown to be a. 
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V are equivalent under the assumption that they have the same optimal 
strategies for all one period investment settings containing only (cash and) 
a two-valued random investment. The logical contrapositive assertion is the 
Theorem. This completes the proof. The Corollary follows a fortiori. 


Note that a single investment setting of the type in the proof will not 
in general distinguish inequivalent utility functions. For instance, if E(X) « 
«O0 then w= O is the unique optimal strategy for all the utilities of 
Theorem 1 (more generally, for all strictly concave utilities, as defined be- 
low) so such X distinguish between none of these utilities. It may be of 
interest to characterize each investment setting by the pairs of utility func- 
tions it distinguishes between or "separates", and to similarly characterize 
collections of investment settings. 


For a security X, let m(X) and M(X) be the greatest and least 
numbers, respectively, such that P(m(X) < X < M(X)) - 1. Then for a 
collection C of investment settings whose securities are Ux «€ A}, 
where A is some index set, let m, = inf {m(X,): acA} and M, = 
= sup{M(X,): QEA}. Evidently, if U(t) - V(t) for m, < t<M,, the 
collection C will not separate U and V. Thus a collection with m, = 
= 0 and M, =% will be needed in general to prove the conclusion of 
Theorem 1. 


Next we generalize Theorem 1 to concave non-decreasing utilities de- 
fined on (0, œ). We do not make the common assumption that first or 
even second derivatives exist. A function f is concave on an interval J 
if for each pair of points x, #x, in I and each number s with 
O«s«l, then Asx, * (1 —5)X4)2 sflx,) + (1 — sf). If fcx, + 
t(l- $)x4) > sfx) +(l— Kx) always, then f is strictly concave. 
(We use "concave" to mean "concave from below”.) 


The more general definition includes such computationally and empir- 
ically natural functions as the "polygonal" utilities. In these, the utility is 
a sequence of linear segments. The vertices are such that the function líes 
on or below each segment extended, and the ordinates of the vertices in- 
crease as the abscissas increase. 
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First, recall some facts from the elementary theory of concave func- 
tions. (Most texts give results for convex functions. But f is concave ex- 
actly when —f is convex so the theories of concave and convex functions 
are equivalent.) A concave function is either continuous in the interior of 
its domain or non-measurable. An increasing function is always measurable 
so our utilities are continuous. A continuous concave function f defined 
on an open interval has a left derivative f' and a right derivative f; 
defined everywhere. (If the left endpoint a is included in the interval of 
definition, then f’ (a) is not defined and f; (a) may or may not be de- 
fined. Similarly, if the right endpoint b is included in the interval of de- 
finition, then f, (b) is not defined and f'(b) may or may not be de- 
fined.) Furthermore, f' (0 2f.) for all ¢ except the endpoints in 
the domain of f and whenever t, <t}, then f' (t,)> f(t.) and 
f, (t) > f, (t,). There are at most countably many points where f' (@)> 
> £1.09; otherwise f' (= fLO= f'(t) and f is differentiable. Proofs 
of these assertions and further theorems on concave functions are given for 
instance in Hardy, Littlewood, Polya [3]. 


Theorem 3. Let U and V be concave utilities defined on (0, «), 
one of which is strictly increasing on (0, 1 + e) for some e» 0. If U 
and V are inequivalent then there is a one period investment setting such 
that the sets of optimal strategies for U and for V are distinct. The in- 
vestment setting may be chosen to consist only of cash and a two-valued 
random investment. If U and V are each strictly concave on the same 
one Of the sets (0, Zo] or [Zo œ), then the optimal strategies are unique 
and U and V therefore have distinct optimal strategies. 


Proof. We proceed as in the proof of Theorem | until we obtain 
equiation (2.1). 


Note that f is concave and that if U is strictly concave on either 
(0, 1] or [i, œ) then f is strictly concave. Now f(w) is a continuous 
function defined on the closed bounded set (w: O< w< 1} hence f 
has an absolute maximum. Let w* be a point where f attains its max- 
imum. It follows from the continuity of f that the set of all such w* 
is closed. 
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From the concavity of f, the set of points w* where f attains its 
maximum is also convex, hence it is a closed interval in [0O, 1]. If f is 
strictly concave the maximum is unique. 


For any w* with O« w*« 1, f is a maximum if and only if 
f'(w*»z 02 f. (w*). A maximum occurs at w* = 0 if and only if 
f, (0) & 0. A maximum occurs at w* — ] if and only if f' (1) 2 0. If 
the maxima occur on an interval [a,b] with O<a<b< 1, then 
f (a> 0 and f. (a)= 0, f' (D) 2 O and fb) < 0, and f'(w*) ex- 
ists and is zero for a < w* < b. 


Equation (2.1) yields 


f! (w) = apU' (1 + aw) - bqU' (1 — bw) > 
(2.2) ' 
> apU, (1 + aw) — bqU. (1 — bw) =f (w). 


Since U’ (t) and U. (f) are non-increasing as ¢ increases, it follows 
from equation (2.2) that f' (w) and f, (w) are non-increasing as w in- 
creases. 


Let c be such that O«c« b and U'(1—c) and V'(1—c) are 
defined. This is possible because U’ and V’ are both defined except at 
countably many points hence there are uncountably many pointsin (0, 1) 
where both U' and V' exist. With a and b already given, choose 


w= A Consider now the case where U’ (1 t 5) >0. Then we may 


choose p with O«p« 1 in equation (2.2) so that f^ (5) = 0. This 
means fA) <0 and since w= D is not an endpoint of [0, 1] this 


, thus is optimal for U in the 


o^ 


means f attains its maximum at 
given investment setting. 


oo 


eo 


Since U and V have the same optimal strategies, w= 7 is opti- 


e 


mal for V hence V attains its maximum there so for w= $ 


g (w) = apV' (1 + aw) - bqV' (1 bw) 20 and apV, (1 + aw) — 
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— bq FR — bw)= e (w)<0. Note that g' (5) 20 and the fact 


V'(1—c)» 0 implies that V' (1 t F) > 0. We may show similarly 


that if V' (14 A) »0 then U' (14 F) >0. Since a is chosen in- 
dependently of b and c this means that for each t> 1, U' ()»0 
if and only if V' (2) » 0. But this is readily shown to be equivalent to 
the statement that (t: U(t) = sup U(t)} = (t: V(t) = sup V(t)), ie. that 
if either U or V become horizontal for t> e» 1 then they both be- 
come horizontal for t> e» 1. For t»e, we have of course U'(t) = 
= V'(t) 20. For t<e, the argument continues as follows. 


From f' (w)=0, apU" 1 + 3 = bqU'(1— c), noting that 


A a du 
U' (1 — c) = U'(1 — c). Thus “o aD From g’ (w)>0, it 
i ac 
PES] bq V'(1 — c) 


follows similarly that “Piao 2 um Letting a= Ip yields 


V (1 + S] >a" (1 + a Since the choices of b and c were in- 


dependent of that a, the result holds for all a» 0, therefore V' (t) > 
2 aU’ (t) forall £» 1. 


A similar argument shows that Veit) « aU. (t) forall > 1. Thus, 
except for at most countably many points, V'(t) - aU'(t) for t» 1. 
Now U and V are readily shown to be absolutely continuous on any 
closed subinterval of (1,99), as a consequence of the fact they are con- 
tinuous, concave, and non-decreasing, thus V — aU is absolutely contin- 
uous. The absolute continuity of U — aV and the fact that (V — aU) = 
— 0 almost everywhere implies that V —oU-— 6, a constant 
(Goffman [2], p. 242, Prop. 12). 


A similar argument shows that V(r) = aU(t) * y for t< 1. The 
role of 2 in the proof of Theorem 1 is played by any number c such 
that 1 «c« e and U'(c) and V'(c) are both defined. One then shows 
as in the proof of Theorem 1 that V(A= aU(t) - 8 for O<t<~. We 
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have established the contrapositive assertion as in the proof of Theorem 1. 
This completes the proof. 


The hypothesis that either U or V (hence both, from the proof) 
is strictly increasing for a positive distance to the right of 1 is required. 
If instead U and V are merely concave and non-decreasing, the conclu- 
sion of Theorem 3 need not hold. For instance, let U(t)= V(t) - 0 if 
t2d, where 0<d< 1. Let U(t) and V(t) each be extended to 
(0, d) so that they are continuous, concave, and strictly increasing on 
(0, d). Then all such utilities have the same optimal strategies, yet many 
pairs are inequivalent. 


To obtain an inequivalent pair, let U(t) - t —d if 0<t<d and 
let V(r) = —(t — d)?. If for some constants a and f, VWA = altt) + f 
then V'(r) - «U'(r. But V'(r) = -2(t-d) ¥a= aU (ft). 


To see that all such utilities U have the same optimal strategies, note 
that W=(w,,...,W,) is optimal for the investment setting (X,,. 

, X,) if and only if P(2 wx, »d)-1, in which case EU(Z,(W)) = 
= 0. If instead P(Z wa, « d)» O0 then for some e> 0, P(2 wx, < 
<d-e)=6>0. Then EU(Z,(W)) < 5U(d — e) «0 so W is noi optimal. 


3. OTHER SEPARATING FAMILIES 


We next establish the conclusion of Theorem 1 using investment set- 
tings with n points in their range. We determine the effect of varying the 
payoffs (x,,... TOM and their probabilities (p,,..., Pa) separately: 
One surprising conclusion (part (b)) can be stated in terms of an example. 


Suppose X consists of betting on a wheel of fortune divided into red, 
3 


white and blue sectors, with payoffs of $, 3, and 4 respectively. Then 
if U and V are inequivalent on [5. i] the areas of the sectors may 
be chosen so U and V have distinct optimal strategies. But if the wheel 


: 1 3 
is divided into just red and blue sectors, with payoffs of j and PL then 
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there are two inequivalent utilities on [4. 3] which have the same opti- 


mal strategies for every choice of areas for the two sectors. 


Theorem 4. Suppose U and V are increasing strictly concave util- 
ities on (0,9). Let X bea random variable with outcomes 0< x,< 
SXI S... SX, With xy <1 and x, > 1. Suppose P(X 2 x) 7 p,> 0, 


(a) Let n and the Dj be given. If U and V have the same opti- 
mal strategies for each X (i.e. X1,--+-,X, vary), then U and V are 
equivalent. 


(b) Let n and the x, be given. Suppose U' and V" exist and 
are continuous at 1. If U and V have the same optimal strategies for 
each X (ie p,,...,p, vary) and at least three x, are unequal to |, 
then U and V are equivalent on [Z9X,, ZyX41- If exactly two of the 
x, $ are unequal to one, there are utilities U and V which are not equiv- 
alent on |Zgx,, Zgx,], but which have the same optimal strategy for 


0^1" 
each X. 


Proof. Assume Zo =|. Let R= X-—1I1 and r=x,— 1, Then in- 
vesting w in X gives an expected return (with respect to U) of 


n 
E(UwR + 1))= 2 p,U(wr, * 1). Each function U(wr, + 1) is differen- 
i=] 


tiable except at a countable set C, of points, so except for w in the 
countable set C, U...U C, the expectation E(U(wR + 1)) is differen- 
dE(U(wR + 1) 
dw 
function V(wr, + l) is differentiable except at a countable set. Thus, ex- 


cept at a countable set D of points in [0,9] both E(U(wR+ 1)) and 
E(V(wR + 1)) are differentiable functions of w. They are also strictly 
concave functions of w. 


n 
tiable at w with = >, p;r;U (wr, + 1). Similarly each 
i=1 


For part (a) let po, be given and choose wg in (0, 1) — D. 
Consider the vectors & = (U'(Wor, + 1),..., U'(wor, t1) and B= 
= (Viwor, *10,..., V'(war, + I). Suppose that the non-zero vector 
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y= (e, €5,...,C,) is perpendicular to o, ie. the inner product 


Cj 


(a, y) = 0. Choose r= with e = max 2 l<i<n, 


a 
pe Zu] i 
jaro? 


Since each component of @ is positive, some c; 2 0 and some c < 0, 
hence some x,» ] and some x < l. Also r, + 1= x,> 0. Then 


dE(U(wR + 1)) 


n 
dw lu - wg = 4 pr, U (wgr, + 1) = 0 and E(U(wR + 1)) 


has a maximum at wg. By hypothesis E(V(wR + 1)) has a maximum 
dE(V(wR + 1) 


at wọ and, since it is differentiable there, dw lu - wg = 


n 
= 2 bj, V'(wor, +1)=0 ie, (B,y)- O0. Hence the set of vectors per- 


pendicular to « is also perpendicular to B which implies that 6 = ao. 

Since the components of @ and f are non-negative, a> 0. Equating 

components 

(3.1) U'(wgr, * 1)- aV'(wor, + 1) 

where a is a non-negative function of Tyo+++,%, and wy. Since U 

and V are strictly concave there is a point fọ not in D, wo € ty <i, 

with V'(t9) » 0 and U"(ty) > 0. Choose r, sothat wor, + l= 7E 

choose r, 2 0 with t= wor, +1 notin D, and choose r, <...< r 

so they are not in D. Then U'(tg) = Ars. 66s Paa Wo)V Co) and 

U(ty) 

U'(wyr, tl) alr, PE wa) (war, +1). Thus a= TG? 0 is 

constant. So V'(t) - aU'(t) for any t£» 1 notin D. Since V and U 

are absolutely continuous on any closed subinterval, V(t) = aU(f) + b 

for all ¢> 1. A similar argument shows that V(t) = cU(t) +d for t< 1 
V'(t.) 

with c= UG =a. The equivalence of U and V now follows (as in 

0 
the proof of Theorem 1) from their continuity. 


For part (b) suppose that the x; are given, with 0< Xj € XX... 
< Xy «1x Xo+] L. Ll Xa We proceed as before, but now consider, 
for 0< wg <1 and U, V differentiable at Wor; +1, I<j<n, the 
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vectors a= (7, U'(war, + 1),...,7, U'(wor, + 1)) and B= 
- (nr V'(wgr, +1),...57, V'(wor,, + 1). Since @ has both positive and 
negative components there is a vector (d,,d,,...,4,) perpendicular to 


a with each d,» 1. Choose p= —,——, thus p,>0 and Zn- l, 


n 


2 d, 
i=1 


and define X by P(X = x,)= p, Thus 


~ n d 
0= (a, (d,,d,,..., d) - pA -— rU'(wor, + 1) = 
d, 
i=] 
- JECUWR + 1) 
T dw w=Wwo 
.. dE(U(wR + 1)) (8, (d, ..., d) 
By hypothesis — lw ew = ——————————— = 0 so 
dw ° Ža 
i 
(B, (d,,...,d,)) - O. Suppose that y = (e,,6€5,...,€,) is perpendicu- 
~ e; + dod, 
lar to a. Let dj > max|e,| and choose p, = —, . Note that 
2 ej * dod, 
i=l 


n 
p,» 0 and 2 p,= 1. Thus 


dE(UCwR + D) 24 
dw ly - wo = 4 piln; U (wor, + 1)] 
n 
and letting D = P? (e, + dyd,) gives 
n 
Dez er, U' (wor, + 1) + ld, 2 dy U' Qn, +1)= 


] ~ l ~ 
=5.( y) * p lA (d, ..., dp DEO. 
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dE(ViwR + D) 

dw Ww=Wo 
5 d,)) = (6, y) = 0. Thus 


Hence, = 0. This yields Bl y) + d (B, (d,,... 


(3/2) U'(wgr, + 1) = aP s P4... p, W)V(wgr;, + D. (0&i&n). 


For w in (0,1) with U,V differentiable at wr, tl í(l&isn) we 
have (3.2) with wọ =w. Consider the quotient 


U'(wr, 1). 
(3.3) h(w) = ap... Pao”) = Y Gr 3 1) (l<i<n). 
I 


First look at the case where at least three x,’s are unequal to one. Sup- 
pose that x, «1«x, ,«x,; the proof where two or more points fall 
to the left of 1 is similar. 


Let q,(w) = wr, + 1. The countable collection of functions 
(U, V, Uo ppt, Vogg’, Uo get? yo Rts Voi ov, 2 Pe 
Uo P oq, Lou qu Lou Vo oq, Lov, o VLL? 
op 1 _..} is simultaneously differentiable except at a countable set of 
points Dy in (0, 1). 


Choose ¢ in (1,x,) — Dy and write t= wr, *l,sow- 
=p l(t), and set f, = w,r, , + 1 e, ,[0, 1 (0). We can also write 
t= Wor, +l; w, qul). By (3.3) we have h(w,) = h(w,), since 


U and V are differentiable at wj and w,. Note that w, <w,, in 


r 
fact, w, = Aw, with A= --l Setting ty Wor, t l= 4 4093), 


n 
yr! _ . 
l = W,r, +1 and w3 = 9, (t,). Then h(w,) = h(w4) since U and 


V are differentiable at w, = y7 low, , ov, (A and at w, = e, ! 


o ° Pn- 1°, O Pnl? "r l(f). Continuing inductively f = wr, ; + 1= 


Wi 7, + l and w, = Àw,. Iterating this equation w,, , = Nw, >0 
as j> œ, thus A(w, P" -= h(w, )> h(l) since U' and V' are con- 


"UD 


tinuous at 1. Hence the equation Wp = h(1) holds except for count- 


ably many ¢ in (1,x,) and thus, since U and V are absolutely contin- 
uous on any closed subinterval, U(t) = KADA * c forall ¢ in [1,~,). 
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Let : belong to (Qi; 1) with U and V differentiable at wr, + 1, 
1<j/<n. Then ¢t=wr, +1 and from equation (3.3) pe = 

U'(wr, + 1) 
~ V'(wr, + 1) 
closed subintervals of (x,, 1), U(t)= h(1) V(t) + d. The continuity of 


U and V at 1 implies that c — d and thus U and V are equivalent 


= h(1). Since U and V are absolutely continuous on 


on [x,,x,]. 


To complete the proof we must consider the case where there are on- 
ly two xps distinct from one, say, 0 <x <1<x,. Let gy be any 
non-constant positive function on [1,x5] with a continuous derivative 
which is zero at 1. Define g on [X,; 1] by giwr, + D- g(wr, * 1) 
for O« w«l. Choose a sothat max  Ig'(D)|—a* min |g(D|« 

x1«t&x2 xQ&t&x2 


I 


t eat t 
<0 and define U(t)= fe “dt = > and Vit) = fem g( dt. 
0 0 


Because U"(t)= —ae^ ^ « 0 and V"(t) =e *(g'() — ag(t))< 0, U 
and V are strictly concave. Also U'(t)=e~% and V'(t) — e^" g(t) are 
positive so U and V are strictly increasing. Clearly U and V are not 
equivalent on [x,,X,]. 


For these two functions U and V and O«w« I, 


dECEQWR + D) =r pi V wr, + 1) t+ rp,V(wr, + 1) = 
— rp,gwr, + 1)U'(wr, + 1) + rp gwr, + 1)U'(wr, + 1) = 


dE(UC(wR + 1)) 


= g(wr, + 1) dw 
Hence dE(UwR + D) _ O if and only if dE(VQWR + 1» _ 0, and so 
dw dw 


w., (0< w, <1), is an optimal strategy for U (with respect to X) if 
0’ 0 , 
pu . ._,.. dE(U(wR + 1) 
and only if it is an optimal strategy for V. If the derivative — —dw 
is never 0, the equation above shows that it has the same sign as 


dECVQWR + D) a + D). so O (or 1) isan optimal strategy for U if and only 
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if it is an optimal strategy for V. 


We have seen that U and V are two utilities on [x,, X5] which 
are not equivalent, but which have the same optimal strategies for all ran- 


dom variables with outcomes Xi and Xo: 


Remark. Our proofs may be modified readily to prove the theorems 
when U and V are defined on the closed interval [0, œ} and also when 
the interval is (c,) or [c,), with c< Zo: Presumably c> 0. (Al- 
ternately, the [c,œ) result implies the (c,œ) result: if U(x)= V(x) on 
every interval [ce + e,9) (0«e« Zo —c) then U(x) - Vix) on (c,~).) 


4. QUESTIONS FOR FURTHER INVESTIGATION 


Friedman — Savage [l] and Markowitz [4] have shown 
that utilities which are not everywhere concave are of interest. This leads 
us to a question which we have not been able to answer yet: 


Is the class of utilities which are continuous and strictly increasing 
(and differentiable everywhere, bounded, and even strictly positive deriva- 
tive, if you like) distinguished by their optimal strategies? 


In the real world factors such as human error, the discreteness of as- 
sets and monetary units, etc. make it in general not possible to choose the 
optimal allocation W* = (wf,..., wg). The continuity of the utility in 
conjunction with boundedness of the attainable utilities implies that "suf- 
ficiently small” deviations from W* will ensure that the realized utility 
is "close" to the optimum. 


One feels as well that in the real world, the exact values of the utility 
function should not be critical. In other words, if two utility functions are 
somehow "close," the consequences of choosing one rather than the other 
should be "close." 


What should it mean for two utility functions to be "close? " First, 
observe that we must define closeness not for functions, but for equivalence 
classes of functions. Let U be a utility. The equivalence class of U, writ- 
ten [U], is the set (V: V2 aU * B, a7 0). For the class 8 of bounded 
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utilities, i.e., M(U) = sup U(t) < e, m(U) s inf Ut) » —«, we suggest 
U—M 
7 7 M-m 
Note that M(U)=1 and m(U)= 0. Then the "closeness" of U and 
V, ie, of [U] and [V], isdefined to be sup [Ms — V(t)) and written 
either d(U, V) or d((U, [V] or d U, P. 


that each (U] equivalence class be represented by Ü- + l. 


We now show that U and V can be "close" yet the optimal strat- 
egies for U and V need not be. For n 22, let Ü and P be de- 
fined as follows: 
= 


n , 


p. 2nt . l , 
U(n-zlrp-l if Ox ti and | if ¢>14+ 


_ 2n-] . l 
V(t) = Tai tT! if O<t<i+e, 
ttn=3 ig j4d<t<2, and 1 if (22. 
n-li n 
Then «(U,, P))- l Now choose an investment setting consisting 


n 
only of cash and the security X, where P(X = 1- e)=q, P(X21-*a)- 
=p, i< a< 1l, and O<e,p,¢<1. Assume Z,- 1. A calculation 
(2n — l)(n— 1) 

n+l 


shows that if ap > qe , then the unique optimal strategy 


for U, is w*= = and for V, the unique optimal strategy is w* = 1. 


Thus for any 6» 0 we can construct sequences ÜU, and V such 
that «U,, 7 )>0 as n>% and (w*(P) —w*(U )|2 1-8, where 
w*(0) means an optimal strategy for Ü. 


Even though a small "error" in the utility function can lead to a large 
change in optimal strategy, it can only lead to a small change in conse- 
quences, in the following sense. (We use the abbreviation U(W) for 


EU(Z, 2 w,X,). Thus for each W, U(W) is a number and 
H 


U(Z, Ziw,X,) is a random variable.) 
H ` 
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Lemma. If av, P) is ” small,” then Owe CP = Awd) and 
Yw*(0)) = Vaw*( V, ie, if U and V are "close," an optimal strate- 
gy for one is " nearly optimal" for the other. 


Proof. Let a, P «€ so V(t) tem Ot). Then for any alloca- 
tion W, P'(Z, 2 wx) * e» U(Z, Zw) and E(P(Z, 2 w,X;) + 


+ e) = E(P(z, Zwx)ee > EŬ(Z, Z wx), or PW) e» Hw). 


Interchanging Ü and P inthe argument yields Tw) te XW) so 
| Ütw) — Yw) I< e. The choices for W of w*(0) and w*(P yield 
the conclusion of the lemma. 


The lemma and the example show us what may happen if we replace 
a U by a nearby V which inay have more desirable properties, such as 
differentiability (of various orders), strictly increasing, etc.: The optimal 
strategies may change drastically but the maximum utility over all strate- 
gies changes only slightly. 


Note added in proof: The authors have since extended the central re- 
sult of the paper, Theorem 3, as follows. 


Theorem. Let U and V be continuous non-decreasing functions 
defined on an arbitrary interval I of the real line. Then if U and V 
are inequivalent, there is a one-period two security investment setting such 
that U and V have distinct optimal strategies if either (a) U and V 
are in the class of all functions which are either concave or convex, or 
(b) U and V are in the class of all functions with a second derivative 
which exists and is continuous, except perhaps for a set of isolated points. 


Thus the Theorem includes the utility functions generally encount- 
ered. 
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Medium Term Simulations of The Full Kelly and Fractional 
Kelly Investment Strategies 


Leonard C. MacLean* Edward O. Thorp! Yonggan Zhaotand William T. Ziemba$ 


Abstract 


Using three simple investment situations, we simulate the behavior of the Kelly and fractional 
Kelly proportional betting strategies over medium term horizons using a large number of sce- 
narios. We extend the work of Bicksler and Thorp (1973) and Ziemba and Hausch (1986) to 
more scenarios and decision periods. The results show: 

(1) the great superiority of full Kelly and close to full Kelly strategies over longer horizons with 
very large gains a large fraction of the time; 

(2) that the short term performance of Kelly and high fractional Kelly strategies is very risky; 
(3) that there is a consistent tradeoff of growth versus security as a function of the bet size 
determined by the various strategies; and 

(4) that no matter how favorable the investment opportunities are or how long the finite horizon 
is, a sequence of bad results can lead to poor final wealth outcomes, with a loss of most of the 


investor's initial capital. 


1 Introduction 


The Kelly optimal capital growth investment strategy has many long term positive theoretical 
properties (MacLean, Thorp and Ziemba 2009). It has been dubbed “ fortunes formula" by Thorp 
(see Poundstone, 2005). However, properties that hold in the long run may be countered by 
negative short to medium term behavior because of the low risk aversion of log utility. In this 
paper, three well known experiments are revisited. The objectives are: (i) to compare the Bicksler - 
Thorp (1973) and Ziemba - Hausch (1986) experiments in the same setting; and (ii) to study them 
using an expanded range of scenarios and investment strategies. The class of investment strategies 
generated by varying the fraction of investment capital allocated to the Kelly portfolio are applied 
to simulated returns from the experimental models, and the distribution of accumulated capital is 
described. The conclusions from the expanded experiments are compared to the original results. 
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2 Fractional Kelly Strategies: The Ziemba and Hausch (1986) 


example 


We begin with an investment situation with five possible independent investments where one wagers 
$1 and either loses it with probability 1 — p or wins $(O + 1) with probability p, where O is the 
odds. The five wagers with odds of O — 1,2,3,4 and 5 to one all have expected value of 1.14. The 
optimal Kelly wagers are the expected value edge of 1496 over the odds. So the wagers run from 
1496, down to 2.896 of initial and current wealth at each decision point. Table 1 describes these 
investments. The value 1.14 was chosen as it is the recommended cutoff for profitable place and 
show racing bets using the system described in Ziemba and Hausch (1986). 


Win Probability | Odds | Prob of Selection in Simulation | Kelly Bets 


Table 1: The Investment Opportunities 


Ziemba-Hausch (1986) used 700 decision points and 1000 scenarios and compared full with half Kelly 
Strategies. We use the same 700 decision points and 2000 scenarios and calculate more attributes 
of the various strategies. We use full, 3/4, 1/2, 1/4, and 1/8 Kelly strategies and compute the 
maximum, mean, minimum, standard deviation, skewness, excess kurtosis and the number out 
of the 2000 scenarios that the final wealth starting from an initial wealth of $1000 is more than 
$50, $100, $500 (lose less than half), $1000 (breakeven), $10,000 (more than 10-fold), $100,000 
(more than 100-fold), and $1 million (more than a thousand-fold). Table 2 shows these results 
and illustrates the conclusions stated in the abstract. The final wealth levels are much higher on 
average, the higher the Kelly fraction. With 1/8 Kelly, the average final wealth is $2072, starting 
with $1000. Its $4339 with 1/4 Kelly, $19,005 with half Kelly, $70,991 with 3/4 Kelly and $524,195 
with full Kelly. So as you approach full Kelly, the typical final wealth escalates dramatically. This 
is shown also in the maximum wealth levels which for full Kelly is $318,854,673 versus $6330 for 
1/8 Kelly. 
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Kelly Fraction 
Statistic 1.0k 0.75k 0.50k 0.25k | 0.125k 
Max 318854673 | 4370619 1117424 27067 | 6330 
Mean 524195 70991 19005 4339 | 2072 
Min 4 56 111 513 587 
St. Dev. | 8033178 | 242313 41289 2951 650 
Skewness 35 11 13 2 1 
Kurtosis 1299 155 278 9 2 
>5 x 10 1981 2000 2000 2000 | 2000 
10 1965 1996 2000 2000 | 2000 
>5 x 10? 1854 1936 1985 2000 | 2000 
> 103 1752 1855 1930 1957 | 1978 
> 107 1175 1185 912 104 0 
> 10° 479 284 50 0 0 
>10 111 17 1 0 0 


Table 2: Final Wealth Statistics by Kelly Fraction: Ziemba-Hausch (1986) Model 


Figure 1 shows the wealth paths of these maximum final wealth levels. Most of the gain is in the 
last 100 of the 700 decision points. Even with these maximum graphs, there is much volatility in 
the final wealth with the amount of volatility generally higher with higher Kelly fractions. Indeed 
with 3/4 Kelly, there were losses from about decision point 610 to 670. 


x 10 Full Kelly 
4 Tt T T T T 
2r J 
0 d L 1 L 
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x 10° 3/4 Kelly 
5 T m T T T T 
0 —t 
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x 108 1/2 Kelly 
2 <7 T Y T C T T 
eek tee a = o 
0 : i 
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x 10 1/4 Kelly 
4Ar- a — aE —T T T 
er d 
0 —— L 4 H xk 
0) 100 200 300 400 500 600 700 800 
1/8 Kelly 
10000, ns iii T a T Y T T 
5000r 4 
oc L i r 4 
0 100 200 300 400 500 600 700 800 


Maximum Wealth Path 


Figure 1: Highest Final Wealth Trajectory: Ziemba-Hausch (1986) Model 


Looking at the chance of losses (final wealth is less than the initial $1000) in all cases, even with 
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1/8 Kelly with 1.1% and 1/4 Kelly with 2.15%, there are losses even with 700 independent bets 
each with an edge of 14%. For full Kelly, it is fully 12.4% losses, and it is 7.25% with 3/4 Kelly and 
3.5% with half Kelly. These are just the percent of losses. But the size of the losses can be large as 
shown in the >50, >100, and >500 and columns of Table 2. The minimum final wealth levels were 
587 for 1/8 and 513 for 1/4 Kelly so you never lose more than half your initial wealth with these 
lower risk betting strategies. But with 1/2, 3/4 and full Kelly, the minimums were 111, 56, and 
only $4. Figure 2 shows these minimum wealth paths. With full Kelly, and by inference 1/8, 1/4, 
1/2, and 3/4 Kelly, the investor can actually never go fully bankrupt because of the proportional 
nature of Kelly betting. 


Full Kelly 
2000 i Eom d T T T T T "up 
1000 Miis RENE | 
0 —L- = mE T i. i. if 
0 100 200 300 400 500 600 700 800 
3/4 Kelly 
2000 |—— TÉ FT T E T Ui T | 
1000 
0 L d. 3 AL vh 
ie) 100 200 300 400 500 600 700 800 
1/2 Kelly 
1000 T T T T T me T 
500r 4 
0 L bs Lj n LL E 
ie) 100 200 300 400 500 600 700 800 
. 1/4 Kelly 
2000 1 pem ote T OT A (quM 
1000 pm IL 1 
Ü —— L oe uw "A. L L L 
0 100 200 300 400 500 600 700 800 
1/8 Kelly 
1500 r T T UE: E H T 
1000 4 
500^ at L ES rs i 
100 200 300 400 500 600 700 800 


Minimum Wealth Paths 


Figure 2: Lowest Final Wealth Trajectory: Ziemba-Hausch (1986) Model 


If capital is infinitely divisible and there is no leveraging than the Kelly bettor cannot go bankrupt 
since one never bets everything (unless the probability of losing anything at all is zero and the 
probability of winning is positive). If capital is discrete, then presumably Kelly bets are rounded 
down to avoid overbetting, in which case, at least one unit is never bet. Hence, the worst case with 
Kelly is to be reduced to one unit, at which point betting stops. Since fractional Kelly bets less, 
the result follows for all such strategies. For levered wagers, that is, betting more than one’s wealth 
with borrowed money, the investor can lose more than their initial wealth and become bankrupt. 


Medium Term Simulations of the Full Kelly and Fractional Kelly Investment Strategies 547 


3 Proportional Investment Strategies: Alternative Experi- 


ments 


The growth and risk characteristics for proportional investment strategies such as the Kelly depend 
upon the returns on risky investments. In this section we consider some alternative investment 
experiments where the distributions on returns are quite different. The mean return is similar: 
14% for Ziemba-Hausch, 12.5%for Bicksler-Thorp I, and 10.2% for Bicksler- Thorp II. However, the 
variation around the mean is not similar and this produces much different Kelly strategies and 
corresponding wealth trajectories for scenarios. 


3.1 The Ziemba and Hausch (1986) Model 


The first experiment is a repeat of the Ziemba - Hausch model in Section 2. A simulation was per- 
formed of 3000 scenarios over T' — 40 decision points with the five types of independent investments 
for various investment strategies. The Kelly fractions and the proportion of wealth invested are 
reported in Table 3. Here, 1.0k is full Kelly, the strategy which maximizes the expected logarithm 
of wealth. Values below 1.0 are fractional Kelly and coincide in this setting with the decision from 
using a negative power utility function. Values above 1.0 coincide with those from some positive 
power utility function. This is overbetting according to MacLean, Ziemba and Blazenko (1992), 
because long run growth rate falls and security (measured by the chance of reaching a specific 
positive goal before falling to a negative growth level) also falls. 


| Kelly Fraction: f | 
Opportunity | 1.75k | 1-5k | 1.25k 1.0k 0.75k | 0.50k | 0.25k 
A 0.245 0.210 0.175 0.140 0.105 0.070 0.035 
B 0.1225 0.105 0.0875 0.070 0.0525 0.035 0.0175 
C 0.08225 | 0.0705 | 0.05875 0.047 0.03525 | 0.0235 | 0.01175 
D 0.06125 | 0.0525 | 0.04375 0.035 0.02625 | 0.0175 | 0.00875 
E 0.049 0.042 0.035 | 0.028 0.021 0.014 0.007 


Table 3: The Investment Proportions (A) and Kelly Fractions 


The initial wealth for investment was 1000. Table 4 reports statistics on the final wealth for T = 40 


with the various strategies. 
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Fraction 


1.5k 1.25k 1.0k 
6632.08 | 3044.34 | 1854.53 
1279.32 | 117274 | 1085.07 
Min 42.77 80.79 83.55 193.07 | 281.25 
849.73 
Skewness 4.72 3.49 
Kurtosis 38.22 21.94 
3000 3000 
2995 2998 


1556 | 11606 | 1762 
43 24 4 
9 | 0 | 0 | 
0 


Table 4: Wealth Statistics by Kelly Fraction: Ziemba-Hausch Model (1986) 


Statistic 0.75k 0.50k 0.25k 


6.66 2.17 


Since the Kelly bets are small, the proportion of current wealth invested is not high for any of 
the fractions. The upside and down side are not dramatic in this example, although there is a 
substantial gap between the maximum and minimum wealth with the highest fraction. Figure 3 
shows the trajectories which have the highest and lowest final wealth for a selection of fractions. The 
log-wealth is displayed to show the rate of growth at each decision point. The lowest trajectories 


are almost a reflection of the highest ones. 
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Figure 3: Trajectories with Final Wealth Extremes: Ziemba-Hausch Model (1986) 


The skewness and kurtosis indicate that final wealth is not normally distributed. This is expected 
since the geometric growth process suggests a log-normal wealth. Figure 4 displays the simulated 
log-wealth for selected fractions at the horizon T = 40. The normal probability plot will be linear 
if terminal wealth is distributed log-normally. The slope of the plot captures the shape of the log- 
wealth distribution. In this case the final wealth distribution is close to log-normal. As the Kelly 
fraction increases the slope increases, showing the longer right tail but also the increase in downside 


risk in the wealth distribution. 
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(b) Normal Plot 


Figure 4: Final Ln(Wealth) Distributions by Fraction: Ziemba-Hausch Model (1986) 


On the inverse cumulative distribution plot, the initial wealth In(1000) — 6.91 is indicated to show 
the chance of losses. The inverse cumulative distribution of log-wealth is the basis of comparisons 
of accumulated wealth at the horizon. In particular, if the plots intersect then first order stochastic 
dominance by a wealth distribution does not exist (Hanoch and Levy, 1969). The mean and standard 
deviation of log-wealth are considered in Figure 5, where the trade-off as the Kelly fraction varies 
can be understood. Observe that the mean log-wealth peaks at the full Kelly strategy whereas 
the standard deviation is monotone increasing. Fractional strategies greater than full Kelly are 
inefficient in log-wealth, since the growth rate decreases and the the standard deviation of log- 


wealth increases. 
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Figure 5: Mean-Std Tradeoff: Ziemba-Hausch Model (1986) 


The results in Table 4 and Figures 3 - 5 support the following conclusions for Experiment 1. 


l. The statistics describing end of horizon (T = 40) wealth are all monotone in the fraction 


of wealth invested in the Kelly portfolio. Specifically the maximum terminal wealth and 
the mean terminal wealth increase in the Kelly fraction. In contrast the minimum wealth 
decreases as the fraction increases and the standard deviation grows as the fraction increases. 
There is a trade-off between wealth growth and risk. The cumulative distribution in Figure 
4 supports the theory for fractional strategies, as there is no dominance, and the distribution 


plots all intersect. 


. The maximum and minimum final wealth trajectories clearly show the wealth growth - risk 
trade-off of the strategies. The worst scenario is the same for all Kelly fractions so that the 
wealth decay is greater with higher fractions. The best scenario differs for the low fraction 
strategies, but the growth path is almost monotone in the fraction. The mean-standard 
deviation trade-off demonstrates the inefficiency of levered strategies (greater than full Kelly). 


3.2 Bicksler - Thorp (1973) Case I - Uniform Returns 


There is one risky asset R having mean return of +12.5% , with the return uniformly distributed 
between 0.75 and 1.50 for each dollar invested. Assume we can lend or borrow capital at a risk free 
rate r = 0.0. Let A = the proportion of capital invested in the risky asset, where A ranges from 0.4 
to 2.4. So A = 2.4 means $1.4 is borrowed for each $1 of current wealth. The Kelly optimal growth 
investment in the risky asset for r = 0.0 is z = 2.8655. The Kelly fractions for the different values 
of A are shown in Table 3. (The formula relating A and f for this expiriment is in the Appendix.) 
In their simulation, Bicksler and Thorp use 10 and 20 yearly decision periods, and 50 simulated 
scenarios. We use 40 yearly decision periods, with 3000 scenarios. 
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[34 os | 12 [ 18 [| 29 | 24 


Table 5: The Investment Proportions and Kelly Fractions for Bicksler-Thorp (1973) Case I 


The numerical results from the simulation with T' — 40 are in Table 6 and Figures 7 - 9. Although 


the Kelly investment is levered, the fractions in this case are less than 1. 


Fraction 
Statistic 0.14k 0.28k 0.42k 0.56k 0. 70k 0.84k 
Max 34435.74 | 743361.14 | 11155417.33 | 124068469.50 | 1070576212.0 | 7399787898 
Mean 7045.27 | 45675.75 275262.93 1538429.88 7877534.72 | 36387516.18 
i 70.97 18.91 3.46 
6047844.60 44547205.57 | 272356844.8 
13.39 


674415.54 


2997 2991 


> 10 2998 2997 2995 2991 2980 2965 

> 107 529 2024 2808 2851 2847 2803 

> 10 0 293 1414 2025 2243 2290 
L——————— 

» 10 0 0 161 L 696 1165 1407 


Table 6: Final Wealth Statistics by Kelly Fraction for Bicksler-Thorp Case I 


In this experiment the Kelly proportion is high, based on the attractiveness of the investment in 
stock. The largest fraction (0.838k) shows strong returns, although in the worst scenario most of the 
wealth is lost. The trajectories for the highest and lowest terminal wealth scenarios are displayed 
in Figures 6. The highest rate of growth is for the highest fraction, and correspondingly it has the 
largest wealth fallback. 
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Figure 6: Trajectories with Final Wealth Extremes: Bicksler-Thorp (1973) Case I 


The distribution of terminal wealth in Figure 7 illustrates the growth of the f = 0.838% strategy. 
It intersects the normal probability plot for other strategies very early and increases its advantage. 
The linearity of the plots for all strategies is evidence of the log-normality of final wealth. The 
inverse cumulative distribution plot indicates that the chance of losses is small - the horizontal line 


indicates log of initial wealth. 
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(b) 
Figure 7: Final Ln(Wealth) Distributions: Bicksler-Thorp (1973) Case I 


As further evidence of the superiority of the f = 0.838k strategy consider the mean and standard 
deviation of log-wealth in Figure 8. The growth rate (mean In(Wealth)) continues to increase since 


the fractional strategies are less then full Kelly. 
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Figure 8: Mean-Std Trade-off: Bicksler-Thorp (1973) Case I 


From the results of this experiment we can make the following statements. 


1. The statistics describing end of horizon (T = 40) wealth are again monotone in the fraction 
of wealth invested in the Kelly portfolio. Specifically the maximum terminal wealth and 
the mean terminal wealth increase in the Kelly fraction. In contrast the minimum wealth 
decreases as the fraction increases and the standard deviation grows as the fraction increases. 
The growth and decay are much more pronounced than was the case in experiment 1. The 
minimum still remains above 0 since the fraction of Kelly is less than 1. There is a trade- 
off between wealth growth and risk, but the advantage of leveraged investment is clear. As 
illustrated with the cumulative distributions in Figure 7, the log-normality holds and the 
upside growth is more pronounced than the downside loss. Of course, the fractions are less 


than 1 so improved growth is expected. 


2. The maximum and minimum final wealth trajectories clearly show the wealth growth - risk 
of various strategies. The mean-standard deviation trade-off favors the largest fraction, even 


though it is highly levered. 


3.3 Bicksler - Thorp (1973) Case II - Equity Market Returns 


In the third experiment there are two assets: US equities and US T-bills. According to Siegel 
(2002), during 1926-2001 US equities returned of 10.2% with a yearly standard deviation of 20.3%, 
and the mean return was 3.9% for short term government T-bills with zero standard deviation. We 
assume the choice is between these two assets in each period. The Kelly strategy is to invest a 
proportion of wealth z = 1.5288 in equities and sell short the T-bill at 1 — x = —0.5228 of current 
wealth. With the short selling and levered strategies, there is a chance of substantial losses. For the 
simulations, the proportion: A of wealth invested in equities and the corresponding Kelly fraction 
f are provided in Table 7. (The formula relating A and f for this expiriment is in the Appendix.) 
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In their simulation, Bicksler and Thorp used 10 and 20 yearly decision periods, and 50 simulated 


scenarios. We use 40 yearly decision periods, with 3000 scenarios. 


24 | 


A] 0.4 | 0.8 | 12 | 1.6 | 20 
f | 0.26 | 0.52 | 0.78 | 1.05 | 1.31 | 1.57 


Table 7: Kelly Fractions for Bicksler-Thorp (1973) Case IT 


The results from the simulations with experiment 3 are contained in Table 8 and Figures 9, 10, and 
ll. This experiment is based on actual market returns. The striking aspects of the statistics in 


Table 8 are the sizable gains and losses. For the the most aggressive strategy (1.57k), it is possible 
to lose 10,000 times the initial wealth. This assumes that the shortselling is permissable through 


to the horizon. 


Table 8: Final Wealth Statistics by Kelly Fraction for Bicksler-Thorp (1973) Case II 


Fraction 
Statistic 0.26k 0.52k 0.78k 1.05k 1.31k 1.57k 
Max 65842.09 | 673058.45 | 5283234.28 | 33314627.67 | 174061071.4 769753090 
12110.34 | 30937.03 76573.69 182645.07 416382.80 895952.14 
2367.92 -4969.78 -133456.35 -6862762.81 | -102513723.8 
6147.30 174683.09 815091.13 3634459.82 15004915.61 
Skewness 1.54 4.88 13.01 25.92 38.22 45.45 
Kurtosis 4.90 51.85 305.66 950.96 1755.18 2303.38 
>5x 10 3000 3000 2998 2970. 2713 2184 
10 3000 3000 2998 2955 2671 2129 
»5x10 3000 3000 2986 2866 2520 1960 
»10 3000 2996 2954 2779 2409 1875 
> 10 1698 2276 2273 2112 1794 1375 
> 108 0 132 575 838 877 751 
> 10 0 9 116 216 270 


The highest and lowest final wealth trajectories are presented in Figures 9. In the worst case, the 


trajectory is terminated to indicate the timing of vanishing wealth. There is quick bankruptcy for 


the aggressive strategies. 
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Figure 9: Trajectories with Final Wealth Extremes: Bicksler-Thorp (1973) Case II 


The strong downside is further illustrated in the distribution of final wealth plot in Figure 10. The 
normal probability plots are almost linear on the upside (log-normality), but the downside is much 
more extreme than log-normal for all strategies except for 0.52k. Even the full Kelly is risky in this 
case. The inverse cumulative ‘distribution shows a high probability of large losses with the most 
aggressive strategies. In constructing these plots the negative growth was incorporated with the 


formula growth = [signW7] ln(|Wr|). 
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(b) Normal Plot 


Figure 10: Final Ln(Wealth) Distributions: Bicksler-Thorp (1973) Case II 


The mean-standard deviation trade-off in Figure 11 provides more evidence to the riskyness of the 
high proportion strategies. When the fraction exceeds the full Kelly, the drop-off in growth rate is 
sharp, and that is matched by a sharp increase in standard deviation. 
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Figure 11: Mean-Std Tradeoff: Bicksler-Thorp (1973) Case II 


The results in experiment 3 lead to the following conclusions. 


4 


1. The statistics describing the end of the horizon (T = 40) wealth are again monotone in the 


fraction of wealth invested in the Kelly portfolio. Specifically (i) the maximum terminal wealth 
and the mean terminal wealth increase in the Kelly fraction; and (ii) the minimum wealth 
decreases as the fraction increases and the standard deviation grows as the fraction increases. 
The growth and decay are pronounced and it is possible to have large losses. The fraction of 
the Kelly optimal growth strategy exceeds 1 in the most levered strategies and this is very 
risky. There is a trade-off between return and risk, but the mean for the levered strategies 
is growing far less than the standard deviation. The disadvantage of leveraged investment is 
clearly illustrated with the cumulative distribution in Figure 10. The log-normality of final 


wealth does not hold for the levered strategies. 


. The maximum and minimum final wealth trajectories clearly show the return - risk of levered 


strategies. The worst and best scenarios are the not same for all Kelly fractions. The worst 
scenario for the most levered strategy shows the rapid decline in wealth. The mean-standard 
deviation trade-off confirms the riskyness/folly of the aggressive strategies. 


Discussion 


The Kelly optimal capital growth investment strategy is an attractive approach to wealth creation. 
In addition to maximizing the rate of growth of capital, it avoids bankruptcy and overwhelms 
any essentially different investment strategy in the long run (MacLean, Thorp and Ziemba, 2009). 
However, automatic use of the Kelly strategy in any investment situation is risky. It requires some 
adaptation to the investment environment: rates of return, volatilities, correlation of alternative 


assets, estimation error, risk aversion preferences, and planning horizon. The experiments in this 
paper represent some of the diversity in the investment environment. By considering the Kelly 
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and its variants we get a concrete look at the plusses and minusses of the capital growth model. 
The main points from the Bicksler and Thorp (1973) and Ziemba and Hausch (1986) studies are 


confirmed. 


The wealth accumulated from the full Kelly strategy does not stochastically dominate frac- 
tional Kelly wealth. The downside is often much more favorable with a fraction less than 


one. 


There is à tradeoff of risk and return with the fraction invested in the Kelly portfolio. In cases 
of large uncertainty, either from intrinsic volatility or estimation error, security is gained by 
reducing the Kelly investment fraction. 


e The full Kelly strategy can be highly levered. While the use of borrowing can be effective 
in generating large returns on investment, increased leveraging beyond the full Kelly is not 
warranted. The returns from over-levered investment are offset by a growing probability of 


bankruptcy. 


e The Kelly strategy is not merely a long term approach. Proper use in the short and medium 


run can achieve wealth goals while protecting against drawdowns. 
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5 Appendix 
The proportional investment strategies in the experiments of Bicksler and Thorp (1973) have frac- 


tional Kelly equivalents. The Kelly investment proportion for the experiments are deveolped in this 


appendix. 
5.1 KELLY STRATEGY WITH UNIFORM RETURNS 
Consider the problem 

Maz; (E(In(1-- r c z(R —r)), 


where R is uniform on [a,b] and r =the risk free rate. 


We have the first order condition 


b R-r 1 
LT dR=0 
f Fr aR A patt ? 


which reduces to 


Ltr$28- 0) es [Erst T 


b-a)=(1 l =eltr 

a(d~ a) = ( eni (ITEM l+r+a(a-r) 

In the case considered in Experiment II, a = —0.25,b = 0.5,r = 0. The equation becomes 
Pu 

Few * = e975, with a solution z = 2.8655. So the Kelly strategy is to invest 286.5596 of 


wealth in the risky asset. 


5.2 Kelly Strategy with Normal Returns 


Consider the problem 
Maz, {E(inQlt+r+2(R-1)}, 


where R is Gaussian with mean 4g and standard deviation op, and r =the risk free rate. The 


solution is given by Merton (1990) as 


= HRT? 
oR — 


zr 


The values in Experiment III are ur = 0.102,05; = 0.203,r = 0.039, so the Kelly strategy is 
xz = 1.5288. 


19 


This page is intentionally left blank 


563 


39 


Good and Bad Properties of the Kelly Criterion* 


Leonard C. MacLean 
School of Business, Dalhousie University, Halifax, NS 


Edward O. Thorp 


E. O. Thorp and Associates, Newport Beach, CA 
Professor Emeritus, University of California, Irvine 


William T. Ziemba 
Professor Emeritus, University of British Columbia, Vancouver, BC 
Visiting Professor, Mathematical Institute, Oxford University, UK 


ICMA Centre, University of Reading, UK 
University of Bergamo, Italy 


Abstract 


We summarize what we regard as the good and bad properties of the Kelly cri- 
terion and its variants. Additional properties are discussed as observations. 


The main advantage of the Kelly criterion, which maximizes the expected value of 
the logarithm of wealth period by period, is that it maximizes the limiting expo- 
nential growth rate of wealth. The main disadvantage of the Kelly criterion is that 
its suggested wagers may be very large. Hence, the Kelly criterion can be very risky 
in the short term. 

In the one asset two valued payoff case, the optimal Kelly wager is the edge 
(expected return) divided by the odds. Chopra and Ziemba (1993), reprinted in 
Part II of this volume, following earlier studies by Kallberg and Ziemba (1981, 
1984) showed for any asset allocation problem that the mean is much more impor- 
tant than the variances and co-variances. Errors in means versus errors in variances 
were about 20: 2:1 in importance as measured by the cash equivalent value of final 
wealth. Table 1 and Figure 1 show this and illustrate that the relative importance 
depends on the degree of risk aversion. The lower is the Arrow-Pratt risk aver- 
sion, Ra = —u'(w)/w (w), the higher are the relative errors from incorrect means. 
Chopra (1993) further shows that portfolio turnover is larger for errors in means 


*Special thanks go to Tom Cover and John Mulvey for helpful comments on an earlier draft of 
this paper. 
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Table 1 Average Ratio of Certainty Equivalent Loss for Errors in Means, Vari- 
ances and Covariances. [Source: Chopra and Ziemba (1993)] 


Errors in Means Errors in Means Errors in Variances 


Risk Tolerance* vs Covariances vs Variances vs Covariances 
25 5.38 3.22 1.67 
50 22.50 10.98 2.05 
75 56.84 21.42 2.68 
l l l 
20 10 2 
Error Mean Error Var Error Covar 
20 2 J 
aia 
*Risk tolerance=Rr (w) = PRAG where RA(w) = — vs 
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Figure 1 Mean Percentage Cash Equivalent Loss Due to Errors in Inputs 


than for variances and for co-variances but the degree of difference in the size of the 
errors is much less than the performance as shown in Figure 2. 

Since log has R4(w) = 1/w, which is close to zero, the Kelly bets may be ex- 
ceedingly large and risky for favorable bets. In MacLean et al. (2009), we present 
simulations of medium term Kelly, fractional Kelly, and proportional betting strate- 
gies. The results show that, with favorable investment opportunities, Kelly bettors 
attain large final wealth most of the time. However, because a long sequence of 
bad scenario outcomes is possible, any strategy can lose substantially even if there 
are many independent investment opportunities and the chance of losing at each 
investment decision point is small. The Kelly and fractional Kelly rules, like all 
other rules, are never a sure way of winning for a finite sequence. 

In Part VI of this volume, we describe the use of the Kelly criterion in many 
applications and by many great investors. Two of them, Keynes and Buffett, were 
long term investors whose wealth paths were quite rocky but with good long term 
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Figure 2 Average turnover for different percentage changes in means, variances and covariances. 
[Source: Based on data from Chopra (1993)] 


outcomes. Our analyses suggest that Buffett seems to act similar to a fully Kelly 
bettor (subject to the constraint of no borrowing) and Keynes like a 80% Kelly 
bettor with a negative power utility function —w 975, see Ziemba (2003) and the 
wealth graphs reprinted in Part VI from Ziemba (2005). 

Graphs such as Figure 3 show that growth is traded off for security with the use 
of fractional Kelly strategies and negative power utility functions. Log maximizes 
the long run growth rate. Utility functions such as positive power that bet more 
than Kelly have more risk and lower growth. One of the properties shown below 
that is illustrated in the graph is that for processes, which are well approximated 
by continuous time, the growth rate becomes zero plus the risk free rate when one 
bets exactly twice the Kelly wager. 

Hence, it never pays to bet more than the Kelly strategy because then risk in- 
creases (lower security) and growth decreases, so Kelly dominates all these strategies 
in geometric risk-return or mean-variance space. See Ziemba (2009) in this volume. 
As you exceed the Kelly bets more and more, risk increases and long term growth 
falls, eventually becoming more and more negative. Long Term Capital is one of 
many real world instances in which overbetting led to disaster. See Ziemba and 
Ziemba (2007) for additional examples. 

Thus, long term growth maximizing investors should bet Kelly or less. We call 
betting less than Kelly “fractional Kelly”, which is simply a blend of Kelly and 
cash. Consider the negative power utility function ów? for ó « 0. This utility 
function is concave and when ó — 0, it converges to log utility. As ó becomes 
more negative, the investor is less aggressive since his absolute Arrow-Pratt risk 
aversion index is also higher. For the case of a stationary lognormal process and a 
given 6 for utility function ów? and a = 1/(1 — 6) between 0 and 1, they both will 
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Figure 3 Probability of doubling and quadrupling before halving and relative growth rates versus 
fraction of wealth wagered for Blackjack (2% advantage, p = 0.51 and g = 0.49). [Source: MacLean 
and Ziemba (1999)] 


provide the same optimal portfolio when « is invested in the Kelly portfolio and 
1—a is invested in cash. This handy formula relating the coefficient of the negative 
power utility function to the Kelly fraction is correct for lognormal investments and 
approximately correct for other distributed assets; see MacLean, Ziemba and Li 
(2005). For example, half Kelly is 6 = —1 and quarter Kelly is 6 = —3. So if you 
want a less aggressive path than Kelly, pick an appropriate 6. This formula does 
not apply more generally. For example, for coin tossing, where Pr(X = 1) = p, 
Pr(X -1)=4¢,p+¢=1, 
ft = pri —qr3 p*-g? 


which is not af*, where f* = p— q > 0 is the Kelly bet. 

We now list these and other important Kelly criterion properties, updated from 
MacLean, Ziemba, and Blazenko (1992), MacLean and Ziemba (1999), and Ziemba 
and Ziemba (2007). See also Cover and Thomas (2006, Chapter 16). 


The Good Properties 


Good Maximizing ElogX asymptotically maximizes the rate of asset growth. See 
Breiman (1961), Algoet and Cover (1988). 

Good The expected time to reach a preassigned goal A is asymptotically least as 
A increases without limit with a strategy maximizing ElogX y. See Breiman 
(1961), Algoet and Cover (1988), Browne (19972). 

Good Under fairly general conditions, maximizing ElogX also asymptotically max- 
imizes median logX. See Ethier (1987, 2004, 2010). 
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Good 
Good 
Good 


Good 


Good 


Good 


The ElogX bettor never risks ruin. See Hakansson and Miller (1975). 

The absolute amount bet is monotone increasing in wealth. 

The ElogX bettor has an optimal myopic policy. He does not have to con- 
sider prior nor subsequent investment opportunities. This is a crucially 
important result for practical use. Hakansson (1971) proved that the my- 
opic policy obtains for dependent investments with the log utility function. 
For independent investments and any power utility, a myopic policy is opti- 
mal, see Mossin (1968). In fact, past outcomes can be taken into account by 
maximizing the conditional expected logarithm given the past (Algoet and 
Cover, 1988). 

Simulation studies show that the ElogX bettor's fortune pulls ahead of other 
“essentially different" strategies’ wealth for most reasonable-sized samples. 
"Essentially different" has a limited meaning. For example, g* 2 g but 
g* — g = € will not lead to rapid separation if e is small enough. The key 
again is risk. See Bicksler and Thorp (1973), Ziemba and Hausch (1986), 
and MacLean, Thorp, Zhao and Ziemba (2009) in this volume. General 
formulas are in Aucamp (1993). 

If you wish to have higher security by trading it off for lower growth, then 
use a negative power utility function, ów?, or fractional Kelly strategy. See 
MacLean, Sanegre, Zhao and Ziemba (2004) reprinted in Part III, who show 
how to compute the coefficent to stay above a growth path at discrete points 
in time with given probability or to be above a given drawdown with a certain 
confidence limit. MacLean, Zhao and Ziemba (2009) add the feature that 
path violations are penalized with a convex cost function. See also Stutzer 
(2009) for a related but different model of such security. 

Competitive optimality. Kelly gambling yields wealth X" such that 
E(4&) < 1, for all other strategies X. This follows from the Kuhn Tucker 
conditions. Thus, by Markov’s inequality, Pr[X > tX*] < 1, for t > 1, 
and for all other induced wealths x. Thus, an opponent cannot outperform 
X* by a factor t with probability greater than 1, This inequality can be 
improved when t = 1 by allowing fair randomization U. Let U be drawn 
according to a uniform distribution over the interval [0,2], and let U be in- 
dependent of X*. Then the result improves to Pr[X > UX*] < 1 for all 
portfolios X. 'Thus, fairly randomizing one's initial wealth and then invest- 
ing it according to the Kelly criterion, one obtains a wealth U X* that can 
only be beaten half the time. Since a competing investor can use the same 
strategy, probability i is the best competitive performance one can expect. 
We see that Kelly gambling is the heart of the solution for this two-person 
zero sum game of who ends up with the most money. So we see that X* 
(actually U.X*) is competitively optimal in a single investment period (Bell 
and Cover 1980, 1988). 
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Good If X* is the wealth induced by the log optimal (Kelly) portfolio, then the 


expected wealth ratio is no greater than one, i.e., E( x) X 1, for the wealth 


X induced by any other portfolio (Bell and Cover, 1980, 1988). 


Good Super St. Petersburg. Any cost c for the St. Petersburg random variable X, 


Pr [X = 2*] = 27*, is acceptable. But the larger the cost c, the less wealth 
one should invest. The growth rate G* of wealth resulting from repeated 
such investments is 


G* = max Eln (1-74 4x) 
O<f<l C 


where f is the fraction of wealth invested. The maximizing f* is the Kelly 
proportion (Cover and Bell, 1980). The Kelly fraction f* can be computed 
even for a super St. Petersburg random variable Pr |Y = 2 -2-* k= 


1,2,..., where Eln Y = oo, by maximizing the relative growth rate 
—~f+4Y 
max Eln— 1 
o&f«1 5T axY 


This is bounded for all f in [0,1]. 

Now, although the exponential growth rate of wealth is infinite for all 
proportions f and it seems that all f € [0,1] are equally good, the max- 
imizing f* in the previous equation guarantees that the f* portfolio will 
asymptotically exponentially outperform any other portfolio f € [0,1]. Both 
investors’ wealth have super exponential growth, but the f* investor will ex- 
ponentially outperform any other essentially different investor. 


The Bad Properties 


Bad 


Bad 


The bets may be a large fraction of current wealth when the wager is favorable 
and the risk of loss is very small. For one such example, see Ziemba and 
Hausch (1986; 159-160). There, in the inaugural 1984 Breeders Cup Classic 
$3 million race, the optimal fractional wager, using the Dr. Z place and show 
system using the win odds as the probability of winning, on the 3-5 shot 
Slew of Gold was 64% (see also the 74% future bet on the January effect in 
MacLean, Ziemba and Blazenko (1992) reprinted in this volume). Thorp and 
Ziemba actually made this place, show bet and won with a low fractional 
Kelly wager. Slew finished third but the second place horse Gate Dancer was 
disqualified and placed third. Wild Again won this race and it was the first 
great victory of the masterful jockey Pat Day. 

For coin tossing, any fixed fraction strategy has the property that if the num- 
ber of wins equals the number of losses, then the bettor is behind. For n wins 
and n losses and initial wealth Wo, we have Wzn = Wo(1 — f?)^. 
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Bad The unweighted average rate of return converges to half the arithmetic rate 
of return. Thus, you may regularly win less than you expect. This is a 
consequence of weighting equally rather than by size of the wager. See Ethier 
and Tavaré (1983) and Griffin (1985). 


Some Observations 


e For an iid process and à myopic policy, which results from maximizing ex- 
pected utility in case the utility function is log or a negative power, the result 
is fixed fraction betting, hence fractional Kelly includes all these policies. 

e A betting strategy is “essentially different" from Kelly if Sn = $77 , Elog(1+ 
f: Xi) — 357.4 Elog(1 + f; X;) tends to infinity as n increases. The sequence 
Uf] denotes the Kelly betting fractions and the sequence {f;} denotes the 
corresponding betting fractions for the essentially different strategy. 

e The Kelly portfolio does not necessarily lie on the efficient frontier in a mean- 
variance model (Thorp, 1971). 

e Despite its superior long-run growth properties, Kelly, like any other strategy, 
can have a poor return outcome. For example, making 700 wagers, all of 
which have a 1496 advantage, the least of which has a 1996 chance of winning, 
can turn $1000 into $18. But with full Kelly 16.696 of the time, $1000 turns 
into at least $100,000, see Ziemba and Hausch (1996). Half Kelly does not 
help much as $1000 can become $145 and the growth is much lower with only 
$100,000 plus final wealth 0.196 of the time. For more such calculations, see 
Bicksler and Thorp (1973) and MacLean, Thorp, Zhao and Ziemba (2009) in 
this volume. 

e Fallacy: If maximizing Elog Xy almost certainly leads to a better outcome, 
then the expected utility of its outcome exceeds that of any other rule provided 
N is sufficiently large. Counter-ezample: u(x) = z,1/2 < p < 1, Bernoulli 
trials f = 1 maximizes EU (zx) but f = 2p — 1 < 1 maximizes ElogX y. See 
Samuelson (1971) and Thorp (1971, 2006). 

e It can take a long time for any strategy, including Kelly, to dominate an es- 
sentially different strategy. For instance, in continuous time with à geometric 
Wiener process, suppose Ha = 20%, ug = 1096, ga = 9g = 1096. Then in five 
years, A is ahead of B with 95% confidence. But if og = 20%, og = 10% with 
the same means, it takes 157 years for A to beat B with 95% confidence. As 
another example, in coin tossing, suppose game A has an edge of 1.0% and 
game B 1.1%. It takes two million trials to have an 84% chance that game A 
dominates game B, see Thorp (2006). 


'The theory and practical application of the Kelly criterion is straightforward 
when the underlying probability distributions are fairly accurately known. However, 
in investment applications, this is usually not the case. Realized future equity 
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returns may be very different from what one would expect using estimates based on 
historical returns. Consequently, practitioners who wish to protect capital above 
all, sharply reduce risk as their drawdown increases. 

Prospective users of the Kelly Criterion can check our list of good properties, 
bad properties and observations to test whether Kelly is well suited to their intended 
application. Given the extreme sensitivity of Elog calculations to errors in mean 
estimates, these estimates must be accurate and to be on the safe side, the size of 
the wagers should be reduced. 

For long term compounders, the good properties dominate the bad properties 
of the Kelly criterion, but the bad properties may dampen the enthusiasm of naive 
prospective users of the Kelly criterion. The Kelly and fractional Kelly strategies are 
very useful if applied carefully with good data input and proper financial engineering 
risk control. 


Appendix 


In continuous time, with à geometric Wiener process, betting exactly double the 
Kelly criterion amount leads to a growth rate equal to the risk free rate. This result 
is due to Thorp (1997), Stutzer (1998), Janacek (1998), and possibly others. The 
following simple proof, under the further assumption of the Capital Asset Pricing 
Model, is due to Harry Markowitz and appears in Ziemba (2003). 

In continuous time 


Ip = Ep — 2 Vp 


Ep, Vp, Jp are the portfolio expected return, variance and expected log, respectively. 
In the CAPM 


Ep = fo + (Em — To) X 
Vp = 0% X? 


where X is the portfolio weight and ro is the risk free rate. Collecting terms and 
setting the derivative of gp to zero yields 


X= (Eu — rg)/o2, 


which is the optimal Kelly bet with optimal growth rate 


. 1 
g* = ro + (Em — ro)? — 5 KEwro)/o] e 


1 
= ro + (Em — ro) [054 — z (Em — ro} on 


= ro + S|(E - M - neuf. 
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Substituting double Kelly, namely Y — 2X for X above into 


1 
2 v2 
59MY 


Ip — TQ + (Em = ro)Y m 9 


and simplifying yields 


4 
go — ro = 2(Em — ro)? /a54 — 5 (EM — rg)?/o$, = 0. 


Hence go = ro when Y = 25. 
The CAPM assumption is not needed. For a more general proof and illustration, 
see T'horp (2006). 
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Introduction to the Utility Foundations of Kelly 


The criticisms of the Kelly strategy based on utility are general. If preferences are 
identified with wealth characteristics such as growth and tail values, then there is 
a utility foundation for Kelly and fractional Kelly strategies. 

Hakansson and Ziemba (1995) survey their and others! works on capital growth 
theory. They discuss Hakansson's consumption-investment model over time in the 
expected log context and MacLean and Ziemba's growth and security tradeoffs and 
its relationship with fractional Kelly strategies. They discuss the properties of these 
models and relate capital growth to expected utility. 

Luenberger (1993) investigates the long term behavior of E log strategies. Mer- 
ton and Samuelson (1974) show that 

lim E [u(W,)] 


RCO 


is not an expected utility so the idea of using utility at a terminal time and taking 
the limit is not valid. Hence, the log mean-variance approach is not consistent 
with expected utility theory. Luenberger provides a new preference based approach 
to the problem assuming independent non-negative identically distributed returns. 
He establishes preferences on infinite sequences of wealth rather than wealth at a 
fixed (but later taken to the limit) terminal time. This approach avoids the limit 
problems and allows use of probability limit theorems. 

If the tail preference is defined by a simple utility function, then the utility 
must be the expected log of wealth. Tail preferences are appropriate if the goal of 
investment is long term wealth maximization. If preferences are represented by a 
compound utility function, then that function will be a function of the expected 
logarithm and the variance of the logarithm of wealth. So in that case, there is 
a Markowitz type efficient tradeoff frontier of ElogW and VarlogW that was 
originally suggested by Williams (1936). 

Stutzer (2003) provides an alternative behavioral foundation for an investor’s use 
of power utility and its defining risk aversion parameter. This involves an investor’s 
desire to minimize the probability that the wealth growth rate will not exceed an 
exogenous target growth rate. Stutzer uses the Gartner-Ellis large deviations theory 
to show that this leads to an equivalent power utility function. This means that 
the investor's risk aversion parameter depends on the investment opportunity set 
contrary to typical model assumptions. Stutzer's formulation uses à wealth variable 
in the utility function that is the ratio of current wealth to the level of wealth growing 
at the constant target rate. Also the power coefficient is not specified in advance 
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but rather is an output of the optimization. For ease of estimation, an independent 
identically distributed asset return assumption is made. 

Stutzer (2010) in a followup paper discusses further how to increase security 
against under-performance in dynamic investment analysis over finite horizons 
against a specified exogenous benchmark. Motivation is provided because it takes 
a long time to be fairly sure that Kelly strategies dominate, and the Kelly strategy 
has considerable risk of under-performing or having very low returns. He uses à 
Bayesian formulation of the Occam's Razor Principle to illustrate the unavoidable 
reduction of statistical testability (namely the ability to more easily falsify) inherent 
in objective functions that have non-directly observable parameters. The coefficient 
of the power utility function is endogenously determined. The setting used is the 
familiar blackjack example previously discussed by Thorp and by MacLean, Ziemba, 
and Blazenko (1992). 
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1. Introduction 


Even casual observation strongly suggests that capital growth is not just a 
catch-phrase but something which many actively strive to achieve. It is therefore 
rather surprising that capital growth theory is a relatively obscure subject. For 
example, the great bulk of today's MBÆ's have had little or no exposure to the 
subject, having had their attention focussed almost exclusively on the single-period 
mean-variance model of portfolio choice. The purpose of this essay is to review 
the theory of capital growth, in particular the so-called growth-optimal investment 
strategy, its properties, its uses, and its links to betting and other investment 
models. We also discuss several applications that have tended to refine the basic 
theory. 

The central feature of the growth-optimal investment strategy, also known as 
the geometric mean model and the Kelly criterion, is the logarithmic shape of 
the objective function. But the power and durability of the model is due to a 
remarkable set of properties. Some of these are unique to the growth-optimal 
strategy and the others are shared by all the members of the (remarkable) small 
family to which the growth optimal strategy belongs. 

Investment over time is multiplicative, not additive, due to the compounding 
nature of the process itself. This makes a number of results in dynamic investment 
theory appear nonintuitive. For example, in the single-period portfolio problem, 
the optimal investment policy is very sensitive to the utility function being used; 
the set of policies that are inadmissible or dominated across all utility functions is 
relatively small. The same observation holds in the dynamic case when the number 
of periods is not large. But as the number of periods does become large, the set of 
investment policies that are optimal for current investment tends to shrink drasti- 
cally, at least in the basic reinvestment case without transaction costs. As we will 
see, many strikingly different investors will, in essence, invest the same way when 
the horizon is distant and will only begin to part company as their horizons near. 
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It is tempting to conjecture that all long-run investment policies to which 
risk-averse investors with monotone increasing utility functions will flock, under a 
favorable return structure, insure growth of capital with a very high probability. 
Such a conjecture is false; many investors will, even in this case, converge 
on investment policies which almost surely risk ruin in the long run, in effect 
ignoring feasible policies which almost surely lead to capital growth. Similarly, 
the relationship between the behavior of capital over time and the behavior 
of the expected utility of that same capital over time often appears strikingly 
nonintuitive. 

Section 2 reviews the origins of the capital growth model while Section 3 
contains a derivation and identifies its key properties. The conditions for capital 
growth are examined in Section 4. The model’s relationship to other long-run 
investment models is studied in Section 5 and Section 6 contains its role in 
intertemporal investment/consumption models. Section 7 adds various constraints 
for accomplishing tradeoffs between growth and security, while Section 8 reviews 
various applications. A concluding summary is given in Section 9. 


2. Origins of the model 


The approach to investment commonly known as the growth-optimal investment 
strategy has a number of apparently independent origins. In particular, Williams 
[1936], Kelly [1956], Latane [1959], and Breiman [1960, 1961] seem to have 
been unaware of each other's papers. But one can also argue that Bernoulli 
(1738) unwittingly stumbled on it in 1738 in his resolution of the St. Petersburg 
Paradox — see the 1954 translation — and Samuelson's survey [1977]. 

Samuelson [1971] appears to be the earliest to have related the geometric mean 
criterion to utility theory — and to find it wanting. The growth optimal strategy's 
inviolability in the larger consumption-investment context when preferences for 
consumption are logarithmic was first noted by Hakansson [1970]. Finally, models 
considering tradeoffs between capital growth and security appear to have been 
pioneered by MacLean & Ziemba [1986]. 


3. The model and its basic properties 


The following notation and basic assumptions will be employed: 


w, = amount of investment capital at decision point t (the end of the tth 
period); 

M, = the number of investment opportunities available in period t, where 
M, < M; 

S, = the subset of investment opportunities which it is possible to sell short in 
period t; 


ry = rate of interest in period t; 
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rj; = return per unit of capital invested in opportunity i, where i = 2,..., M, 
in the tth period (random variable). That is, if we invest an amount 0 in i 
at the beginning of the period, we will obtain (1 + rj,)0 at the end of that 


period; 
zi = amount lent in period ¢ (negative zı; indicate borrowing) (decision vari- 
able); 
zi = amount invested in opportunity i, / = 2,..., M, at the beginning of the 
tth period (decision variable); 
Fia, Y3, -+ -, YMj) = Pr {ru S ya rai S yas rM S Yu b 
£t = (22, 2o TMt) 
Zi ; 
Xit = n , i —1,..., Mg 
W-1 
XQ = XU... XM, 
(Xj) = xi,..., Xt. 


The capital market will generally be assumed to be perfect, i.e. that there are no 
transaction costs or taxes, that the investor has no influence on prices or returns, 
that the amount invested can be any real number, and that the investor has full 
use of the proceeds from any short sale. 

The following basic properties of returns will be assumed: 


nz0, t=1,2,... (1) 
E[ri] 2 8+ ri, ó » 0, somei,t —1,2,... (2) 
Efra] x K, all i, t. (3) 


These assumptions imply that the financial market provides a ‘favorable game.’ 

We also assume that the (nonstationary) return distributions F; are either 
independent from period to period or obey a Markov process and they also satisfy 
the ‘no-easy-money condition’ 


M, M, 
P ju — ry); < | > ô for all t and all 6; such that 5D Bj| = 1, 
m and; > Oforalli g $, ^^ (4) 


where 6; < 0, 5) > 0. 

Condition (4) is equivalent to what is often referred to as the no-arbitrage 
condition. It is generally a necessary condition for the portfolio problem to have a 
solution. 

We also assume that the investor must remain solvent in each period, i.e., that 
he or she must satisfy the solvency constraints 


Pr{w,>O0}=1, ¢=1,2,.... (5) 
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The amount invested at time t — 1 is 
M, 
> Zit = Wi-1 
i=1 


and the value of the investment at time t, broken down between its risky and 
riskfree components, is 


M, M, 
W, = xe! + Fizi ++ no(1 — Yu). 
i=2 i=2 
which together yield the basic difference equation 
Mi 
Wr = $ (ra nga + wall + rw), t=1,2,... 
i=2 (6) 
= w-1 Rix) = woR (44)... Re (xr), t=1,2,... 
where 
Mi 
Rx) = yu > ru)Xii t lo ry. (7) 
i=2 


Let us now turn to the basic reinvestment problem which (ignores capital 
infusions and distributions and) simply revises the portfolio at discrete points in 
time. In view of (5), (6) may be written 


t 
v; = wo exp | “In Rn(xn)}, t=1,2,.... (8) 
n=1 ` 
Defining 


| vin Rain) | 


G,((x:)) = = (9) 


(8) becomes 


wi = wo[exp(G(Gx))]]' (10) 
= wo(1 + gi), 


where g, = exp G;((x,)) — 1 is the compound growth rate of capital over the first t 
periods. 
By the law of large numbers, 


Gi((x1)) > E[GiQQ)] 
under mild conditions. Thus, it is evident that for large T, 


w, > Oif E[G] <6 <0, tT, (11) 
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w, > œ if E[G,] 3 8620, t-T (12) 
& > exp E[G;] - 1. (13) 


Under stationary returns and policies(x;), (11) and (12) simplify to 


w, —0 if E(In Ra(x,)] < 0 


any n. 
w, > œ if E[In Ra (x)| > 0 


There is nothing intuitive that would suggest that the sign of E[In R,,(x,)] is the 
determinant of whether our capital will decline or grow in the (stationary) simple 
reinvestment problem. What is evident is that the expected return on capital, 
E[R,] — 1, is not what matters. As (6) reminds us, capital growth (positive or 
negative) is a multiplicative, not an additive process. 

To illustrate the point, consider the case of only two assets, one riskfree yielding 
5% per period, and the other returning either —60% or +100% with equal proba- 
bilities in each period. Always putting all of our capital in the riskfree asset clearly 
gives a 5% growth rate of capital. The expected return on the risky asset is 20% 
per period. Yet placing all of our funds in the risky asset at the beginning of each 
period results in a capital growth rate that converges to —10.55%! It is easy to see 
this. We will double our money to 200% roughly half of the time. But we will also 
lose 60% (bringing the 200% to 80%) of our beginning-of-period capital about half 
the time, for a ‘two-period return’ of —20% on average, or —10.55% per period. 
Expected capital E[w,], on the other hand, has a growth rate of 20% per period. 

What this simple example demonstrates is that there are many investment 
strategies for which, as t — oo, 


E[w,] > co 

Median [w,] > 0 

Mode [w,] > 0 

Pr (w, < $1) 1. 
The coexistence of the above four measures results when E[G,] x 8 < 0 fort > T 
and a long (but thin) upper tail is generated as w, moves forward in time. 


In view of (7), (9) and (10), we observe that to ‘maximize’ the long-run growth 
rate g,, it is necessary and sufficient to maximize E[G,({x,))], or 


Max ( E[In R1(x))] + E[In R3(x2)] + ...} (14) 


Whenever returns are independent from period to period or the economy obeys a 
Markov process !, it is necessary and sufficient to accomplish (14) to 


Max E[In R;(x;)] sequentially at each ż — 1. (15) 
Xr 


! Algoet & Cover [1988] show formally that the growth-optimal strategy maintains its basic 
properties under arbitrary returns processes. 
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Since the geometric mean of R,(x;) = exp{E[In R, (x;)]}, we observe that (15) is 
also equivalent to maximizing the geometric mean of principal plus return at each 
point in time. 


3.1. Properties of the growth-optimal investment strategy 


Since the solution (x) to (15), in view of (10) and (13), almost surely leads 
to more capital in the long run than any other investment policy which does 
not converge to it, (x7) is referred to as the growth-optimal investment strategy. 
Existence is assured by the no-easy-money condition (4), the bounds on expected 
returns (1)-(3), and the solvency constraint (5). The strict concavity of the 
objective function in (15) implies that the optimal payoff distribution R;(x7) is 
unique; the optimal policy x; itself will be unique only if, for any security i, there 
is no portfolio of the other assets which can replicate the return pattern rj;. 

It is probably not surprising that the growth-optimal strategy never risks ruin, i.e. 


Pr(R;(xP) 20) 20 


— because to grow you have to survive. But this need not mean that the solvency 
constraint is not binding: E[In R,(x;)] may exist even when R, touches 0 as long 
as the lower tail is very thin. The conditions (1)-(3) imply that positive growth is 
feasible. Another dimension of the consistency between short-term and long-term 
performance was observed by Bell & Cover [1988]. 

As shown by Breiman [1961], the growth-optimal strategy also has the property 
that it asymptotically minimizes the expected time to reach a given level of capital. 
This is not surprising in view of the characteristics noted in the previous two 
paragraphs. 

It is also evident from (15) that the growth-optimal strategy is myopic even 
when returns obey a Markov process (Hakansson 1971c). This property is clearly 
of great practical significance since it means that the investor only needs to 
estimate the coming period's (joint) return structure in order to behave optimally 
in a long-run sense; future periods’ return structures have no influence on the 
current period's optimal decision. No other dynamic investment model has this 
property in a Markov economy; only a small set of other families have it when 
returns are independent from period to period (see Section 5). 

The growth-optimal strategy implies, and is implied by, logarithmic utility of 
wealth at the end of each period. This is because at each t — 1 


Max E[In R;(x:)] 
~ Max (E[In R; (xj) + In w,1]) 
= Max E[In(w, i R: (x:))] = Max E[In wi (z))]. 


Since every utility function is unique (up to a positive linear transformation), 
it also follows that the growth-optimal strategy is not consistent with any other 
end-of-period utility function (more on this in the next subsection). 
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The relative risk aversion function 


(2o wu"(w) 
q(w)z uw) 
equals 1 when u(w) — In(w) (it is 0 for a risk-neutral investor). Thus, we observe 
that to do 'the best' in the long run in terms of capital growth, it is not only 
necessary to be risk averse in each period. We must also display the ‘right’ amount 
of risk aversion. The long-run growth rate of capital will be lower either if one 
invests in a way which is more risk averse than the logarithmic function or relies 
on an objective function which is less risk averse. 

The growth-optimal investment strategy is not only linear in beginning-of-period 
wealth but proportional as well since definitionally 


* * 
Zr = Wi -1X, . 


Both of these properties are shared by only a small family of investment models. 

Since the growth-optimal strategy is consistent with a logarithmic end-of-period 
utility function only, it is clearly not consistent with the mean-variance approach 
to portfolio choice — which in turn is consistent with quadratic utility for 
arbitrary security return structures, and, for normally distributed returns, with 
those utility functions whose expected utilities exist when integrated with the 
normal distribution, plus a few other cases, as shown by Ziemba & Vickson [1975] 
and Chamberlain [1983]. This incompatibility is easy to understand; in solving for 
the growth-optimal strategy, all of the moments of the return distributions matter, 
with positive skewness being particularly favored. When the returns on the risky 
assets are normally distributed, no matter how favorable the means and variances 
are, the growth-optimal strategy cooly places 100% of the investable funds in the 
riskfree asset. l 

The preceding does not imply that the growth-optimal portfolio necessarily is 
far from the mean-variance efficient frontier (although this may be the case [see 
e.g. Hakansson, 1971a]). It will generally be close to the MV-efficient frontier, 
especially when returns are fairly symmetric. And as shown in Section 8, the 
mean-variance model can in some cases be used to (sequentially) generate a close 
approximation to the growth-optimal portfolios. 

Other properties of the Kelly criterion can be found in MacLean, Ziemba & 
Blazenko [1992, table 1]. 


3.2. Capital growth vs. expected utility 


Based on (10), the uniqueness properties implied by (15), and the law of large 
numbers, it is undisputable, as noted in the previous subsection, that the growth- 
optimal strategy almost surely generates more capital (under basic reinvestment) 
in the long run than any other strategy which does not converge to it. At the 
same time, however, we observed that the growth-optimal strategy is consistent 
with logarithmic end-of-period utility of wealth only. This clearly implies that 
there must be 'reasonable' utility functions which value almost surely less capital 
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in the long run more than they value the distribution generated by the Kelly 
criterion. 
Consider the family 


1 
ulw) = yU^ y <l, (16) 


to which u(w) = In(w) belongs via y = 0, and let (x, (y)) be the optimal portfolio 
sequence generated by solving 


1 
Max E E at each t — 1. 
X; y 


For simplicity, consider the case of stationary returns. Since x, (y) Æ x, (0) = xf, 
it is evident that 


1 1 
Max E [Zwe] >E [w], y #0 (17) 
even though there exist numbers a > 1 and T (e) such that 


Pr {w (y) < woa < w(x} z1-e, t» Tle) (18) 


for every (1 >)e > 0. 

Many a student of investment has stubbed his toe by interpreting (18) to mean 
that (xf) generates higher expected utility than, say, (xi(y)) . (17) and (18) may 
seem like a paradox but clearly implies that the geometric mean criterion does not 
give rise to a ‘universally best’ investment strategy. 

The intuition behind this truth is as follows. For y « 0 in (16), (17) and 
(18) occur because, despite the fact that the wealth distribution for (x,(y)) lies 
almost entirely to the left of the wealth distribution for (x7) , the lower tail of the 
distribution for (x,(y)) is shorter and (imperceptibly) thinner than the (bounded) 
left tail of the growth-optimal distribution. Thus, for negative powers, very small 
adverse changes in the lower tail overpower the value of almost surely ending up. 
with a higher compound return. Conversely, for y > 0, it is the longer (though 
admittedly very thin) upper tail that gives rise to (17) in the presence of (18) even 
though, again, the wealth distribution for (x; (y)) lies almost entirely to the left of 
the wealth distribution for (x7). 


4. Conditions for capital growth 


As already noted, the determinants of whether capital will grow or decline 
(almost surely) in the long run are given by (12) and (11). Conditions (1)- 
(2) insure that (12) is feasible; in the absence of (1)-(2), positive growth may 
be infeasible. If a positive long-run growth rate (bounded away from zero) is 
achievable, then the growth-optimal strategy will find it. Thus we can state: 
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Theorem. /n the absence of (1) and (2), a necessary and sufficient condition for 
long-run capital growth to be feasible is that the growth-optimal strategy achieves a 
positive growth rate, i.e. that for some € > 0 and large T 


E[nR,(xp)]»e t>T (19) 


For y < 0, the objective functions in (16) attain long-run growth rates of capital 
between those of the risk-free asset and of the growth-optimal strategy. But for 
y > 0, the long run growth-rate may be negative. Consider for a moment the util- 
ity function u(w) = w!/?, one of the most frequently cited examples of ‘substantial’ 
risk aversion since Bernoulli's time. Even this venerable function may, however, 
lead to (almost sure) ruin in the long-run: suppose, for example, that the riskfree 
asset yields 2% per period and that there is only one risky asset, which gives either 
a loss of 8.2% with probability 0.9, or a gain of 206% with probability 0.1. The opti- 
mal policy then calls for investing the fraction 1.5792 in the risky asset (by borrow- 
ing the fraction 0.5792 of current wealth to complete the financing) in each period. 
But the average compound growth rate g, in (10) will now tend to —0.00756, or 
—3/4%. Thus, expected utility ‘grows’ as capital itself almost surely vanishes. 

What this example illustrates is that risk aversion plus a favorable return 
structure [see (1)-(3)] are not sufficient to insure capital growth in the basic 
reinvestment case. 


5. Relationship to other long-run investment models 


As shown in Section 3, the growth-optimal investment strategy has its traditional 
origin in arguments concerning capital growth and the law of large numbers. But 
it can also be derived strictly from an expected utility perspective — but only as a 
member of a small family. 

Let n be the number of periods left to a terminal horizon point at time 0. 
Assume that wealth at that point, wo, has utility Uo(wo), where Uj > 0 everywhere 
and Uj < 0 for large wo. Then, with one period to go, we have the single-period 
portfolio problem 


Ui(wi) = Max E[Uo(wo(z1))] 


where Uj(w}) is the induced, or derived, utility of wealth w; at time 1 and the 
difference equation (6) has been trivially modified to 


. M, 
Wn-1 = Y H Fin)zin + ws + rin), n= 1, 2,.... (20) 
i=2 


Thus, with n periods to go, we obtain 


Un (Wn) = Max E[Us-iQo i). no 1,2... Q1) 


‘where (21) is a standard recursive equation. 
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The induced utility of current wealth, Up (wn), of course, generally depends on 
all the inputs to the problem, that is the utility of terminal wealth Uo, the joint 
distribution functions of future returns F,,..., F;, and the future interest rates 
rin, ---, r11- But there are two rather interesting special cases. The first is the case 
in which the induced utility functions Up (wn) depend only on the terminal utility 
function Ug. This occurs when the returns are independent from period to period 
and Uo(wo) is isoelastic, i.e. 


1 
Ug(wo) = y some y « 1. 
As first shown by Mossin [1968], (21) now gives 
Us (Wa) = GnUo(wn) + b, 
~ Up(wn) 


(where ~ means equivalent to) since a, and b, are constraints with a, positive. 
The optimal investment policy is both myopic and proportional, i.e. 
z% (Un) = Xin(y)wn, alli 


where the xj; (y) are constants. 

The second special case obtains when returns are independent from period to 
period, interest rates are deterministic, and the terminal utility function reflects 
hyperbolic absolute risk aversion, that is (Hakansson 1971c) 


+ wo + 4)", y <1, 
Y . 
Or 
Uo(wo) — $ — wo)’, y » 1, ¢ large; (22) 
or 
—exp{—dwo} > 0. 


In the first subcase 


Un(un) == (w +4) (23) 
my NO (Ar)... n) 


where (23) holds globally for ¢ < 0 and locally for ¢ > 0, i.e. for wn > La > 0. 
The optimal investment policy is 


$ ) 
* (Wn) = Xi Ua + —————————— ] , > 2. 
Zin ( n) «o n (ru)... Qr) ti? 
In the other two subcases, a closed form solution holds only locally. 
But the most interesting result associated with (21) is surprisingly general. 
Under mild conditions on Ug(wo), and independent (but nonstationary) returns 
from period to period, we obtain [Hakansson, 1974; see also Leland, 1972; Ross, 
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1974; Huberman and Ross 1983]: 
1 
Un(Wn) > yun (24) 
and, if returns are stationary, 


Zin (Wn) > Xin (VY) Wn. (25) 


Thus, the class of utility functions 
1 
u(w) = y^ y <1, (16) 


the only family with constant relative risk aversion (ranging from 0 to infinity) 
and exhibiting myopic and proportional investment policies, is evidently applicable 
to a large class of long-run investors. The optimal policies above are not mean- 
variance efficient, but for reasonably symmetric return distributions, they come 
close to MV efficiency. 

Since y = 0 in (24) corresponds to logarithmic utility of wealth, the growth- 
optimal strategy is clearly a member of this elite family of long-run oriented 
investors. In other words, the geometric mean investment strategy has a solid 
foundation in utility theory as well. 


6. Relationship to intertemporal consumption-investment models 


Up to this point, we have examined the basic dynamic investment problem, 
ie. without reference to cash inflows or outflows. Under some conditions, the 
inclusion of these factors is straightforward and does not materially affect the 
optimal investment policy. But a realistic model incorporating noncapital in- and 
outflows typically complicates the model substantially. 

The basic dynamic consumption-investment model incorporates consumption 
and a labor income into the dynamic reinvestment model. Following Fisher [1936], 
wealth is viewed as a means to an end, namely consumption. 

The basic difference equation (6) now becomes 


M, ` 
W, = NUT — ru)z ct -dri(w--c0)ty, t=1,...,T, (26) 
i=2 


where c, is the amount consumed in period ¢ (set aside at the beginning of the 
period) and y, is the labor income received at the end of period t. 
Consistent with the foregoing, the individual's objective becomes 


Max E[U (ci, ..., c7)] 
subject to 
UR all t 
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where U is assumed to be monotone, strictly concave, and to reflect impatience, 
i.e. considering the two consumption streams 


(a, b, c3,..-, cr) 
(b, a, c3,...,Cr), a»b 
the first is preferred to the second. 
In order to attain tractability, several strong assumptions are usually imposed: 
1) the individual’s lifetime (horizon) is known, 


2) interest rates are viewed as deterministic, 
3) the labor income y, is deterministic; its present value is thus 


Mi YT 
Y, 1 = =— +... + ; 
"n Flr (1-F ru)... (d rur) 


4) the utility function is assumed to be additive, i.e. 


U (ci, ..., cr) = uie) + aqua(c2) +... +01... or-aóur(er), (27) 


where u; > 0, ui’ < 0, and typically œ; < 1, for all t, which implies that preferences 
are independent of past consumption. 


Let 
fr-1(Qur1-1) = maximum expected utility at £ — 1 given w;- 1. 
This gives 
fr-1(wi-1) = Max Uu) ro E[fiQeD] ¢=1,...,T7, (28) 
where fr(wr) =Q or br(wr) 
subject to e = 0 (29) 
Pr{w, > —-Y;}=1 (30) 
a 20, igs . (31) 


for each t, where br (wr) represents a possible bequest motive. It is apparent 
that f, 4 (w;-1) represents the utility of wealth and that it is induced or derived; it 
clearly depends on everything in the model. Solving (28) recursively, it is evident 
that, under our assumptions concerning labor income and interest rates, Y, can be 
exchanged for cash in the solution. 

Suppose that in (27) 


1 
ule) =>, y«1t-1...,T. (32) 
Y 
Then [Hakansson, 1970] 


Ji-1(Wwi-1) = Aia + YLAY + Bia 
ct (wr-1) = C, (wii + Yi a), 
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zy (w1) = 0- Cr) xj, Qu +Y) i22, 


and 
M, 
zi; (w1) = wi — Cf — 3 zh noi) 
i=2 


where the A;, B,, and C, are constants. Thus, the optimal consumption and 
investment policies are again proportional, not to w;_; but to w;-, + Y;-1. The 
latter quantity is sometimes referred to as permanent income. 

Note that when y = 0 in (32), the consumer-investor does indeed employ the 
growth-optimal strategy to invested funds. 

Finally, the model (28)-(31) has been extended in a number of directions, 
to incorporate a random lifetime, life insurance, a subsistence level constraint 
on consumption, a Markov process for the economy, and an uncertain income 
stream from labor — with limited success [see Hakansson 1969, 1971b, 1972; 
Miller, 1974]. In general, closed-form solutions do not exist when income streams, 
payment obligations, and interest rates are stochastic. In such cases, multi-stage 
stochastic programming models are helpful [see e.g. Mulvey & Ziemba, 1995]. 


7. Growth vs. security 


Empirical evidence suggests that the average investor is more risk averse than 
the growth-optimal investor, with a risk-tolerance corresponding to y % —3 in 
(16) [see e.g. Blume & Friend, 1975]. While real-world investors exhibit a wide 
range of attitudes towards risk, this means that the majority of investors are in 
effect willing to sacrifice a certain amount of growth in favor of less variability, or 
greater ‘security’, 


7.1 The discrete-time case 


In view of the convergence results (24) and (25), it is evident that repeated 
employment of (16) for any y < 0 attains an efficient tradeoff between growth and 
security, as defined above, for the long-run investor. The concept of ‘efficiency’ is 
thus employed in a sense analogous to that used in mean-variance analysis. 

A number of more direct measures of the sacrifice of growth for security have 
also been examined. In particular, MacLean, Ziemba & Blazenko [1992] analyzed 
the tradeoffs based on three growth and three security measures. The three growth 
measures are: 

1. E(w; ((x,))], the expected wealth level after ¢ periods; 

2. E[g,], the mean compound growth rate over the first t periods; 

3. Eft : wi ((xi)) 2 y], the mean first passage time to reach wealth level y; 
while the three security measures are: 

4. Pr(wi((x))) = y), the probability that wealth level y will be reached in t 
periods; 
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5. Pr(wi((x))) > Bb, t = 1,2,...), the probability that the investor's wealth is 
on or above a specified path; 

6. Pr(w; ((x;)) > y before w,((x,)) x b, where b < wo < y], which includes the 
probability of doubling before halving. 

Tradeoffs were generated via fractional Kelly strategies, i.e. strategies involving 
(stationary) mixtures of cash and the growth-optimal investment portfolio. Applied 
to a stationary environment, these strategies were shown to produce effective 
tradeoffs in that as growth declines, security increases. However, these tradeoffs, 
while easily computable, are generally not efficient, i.e. do not maximize security 
for a given (minimum) level of growth. Other comparisons involving the growth- 
optimal strategy and half Kelly or other strategies may be found in Ziemba & 
Hausch [1986], Rubinstein [1991], and Aucamp [1993]. 


7.2. The continuous-time case 


Since transaction costs are zero under the perfect market assumption, it is natu- 
ral to consider shorter and shorter periods between reinvestment decisions. In the 
limit, reinvestment takes place continuously. Assuming that the returns on risky 
assets can be described by diffusion processes, we obtain that optimal portfolios 
are mean-variance efficient in that the instantaneous variance is minimized for 
a given instantaneous expected return. The intuitive reason for this is that as 
the trading interval is shortened, the first two moments of the security's return 
become more and more dominant [see Samuelson, 1970]. The optimal portfolios 
also exhibit the separation property — as if returns over very short periods were 
normally distributed. Over any fixed interval, however, payoff distributions are, 
due to the compounding effect, usually lognormal. In other words, all investors 
with the same probability assessments, but regardless of risk attitude, invest in 
only two mutual funds, one of which is riskfree [Merton, 1971]. See also Karatzas, 
Lehoczky, Sethi and Shreve [1986] and Sethi and Taksar [1988]. 

In view of the above, it is evident that the tradeoff between growth and security 
generated by the fractional Kelly strategies in the continuous-time model when 
the wealth process is lognormal is efficient in a mean-variance sense. Li [1993] 
has addressed the growth vs. security question for the two asset case while L1 & 
Ziemba [1992] and Dohi, Tanaka, Kaio & Osaki [1994] have done so when there 
are n risky assets that are jointly lognormally distributed. 


8. Applications 
6.1. Asset allocation 


In view of the myopic property of the optimal investment policy in the dynamic 
reinvestment problem [see (24) and (25)], it is natural to apply (15) for different 
values of y to the problem of choosing investment portfolios over time. In 
particular, the choice of broad asset categories, also known as the asset allocation 


Capital Growth Theory 59] 


Ch. 3. Capital Growth Theory 79 


problem, lends itself especially well to such treatment. Thus, to implement the 
growth-optimal strategy, for example, we merely solve (15) subject to relevant 
constraints (on borrowing when available and on short positions) at the beginning 
of each period. 

To implement the model, it is necessary to estimate the joint distribution 
function for next period's returns. Since all moments and comoments matter, 
one way to do this is to employ the joint empirical distribution for the previous 
n periods. This approach provides a simple and realistic means of generating 
nonstationary scenarios of the possible outcomes over time. The raw distribution 
may of course may be modified in any number of ways, for example via Stein 
estimators [Jorion, 1985, 1986, 1991; Grauer & Hakansson, 1995], an inflation 
adapter [Hakansson, 1989], or some other method. 

Grauer and Hakansson applied the dynamic reinvestment model in a number 
of settings with up to 16 different risk attitudes y under both quarterly and annual 
portfolio revision. In the domestic setting [Grauer & Hakansson, 1982, 1985, 
1986], the model was employed to construct and rebalance portfolios composed of 
U.S. stocks, corporate bonds, government bonds, and a riskfree asset. Borrowing 
was ruled out in the first article while margin purchases were permitted in the 
other two. The third article also included small stocks as a separate investment ' 
vehicle. On the whole, the growth-optimal strategy lived up to its reputation. On 
the basis of the empirical probability assessment approach, quarterly rebalancing, 
and a 32-quarter estimating period applied to 1934-1992, the growth-optimal 
strategy outperformed all the others — with borrowing permitted, it earned an 
average annual compound return of nearly 15%. 

In Grauer & Hakansson [1987], the model was applied to a global environment 
by including in the universe the four principal U.S. asset categories and up to four- 
teen non-U.S. equity and bond categories. The results showed that the gains from 
including non-U.S. asset classes in the universe were remarkably large (in some 
cases statistically significant), especially for the highly risk-averse strategies. With 
leverage permitted and quarterly rebalancing, the geometric mean strategy again 
came out on top, generating an annual compound return of 27% over the 1970- 
1986 period. A different study examined the impact from adding three separate 
real estate investment categories to the universe of available categories [Grauer & 
Hakansson, 1994b]. Finally, Grauer, Hakansson & Shen [1990] examined the asset 
allocation problem when the universe of risky assets was composed of twelve equal- 
and value-weighted industry components of the U.S. stock market. 

Mulvey [1993] developed a multi-period model of asset allocation which in- 
corporates transaction costs, including price impact. The objective function is a 
general concave utility function. A computational version developed by Mulvey 
& Vladimirou [1992] focused on the isoelastic class of functions in which the 
objective was to maximize the expected utility of wealth at the end of the planning 
horizon. This model, like those based on the empirical distribution approach, can 
handle assets possessing skewed returns, such as options and other derivatives, 
and can be extended to include liabilities [see Mulvey & Ziemba, 1995]. Based on 
historical data over the period 1979 to 1988, this research, based on multi-stage 
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stochastic programming, showed that efficiencies could be gained vis-à-vis myopic 
models in the presence of transaction costs by taking advantage of the network or 
linear structure of the problem. 


Mean-variance approximations. A number of authors have argued that, in the 
single period case, power function policies can be well approximated by MV 
policies, e.g. Levy & Markowitz [1979], Pulley [1981, 1983], Kallberg & Ziemba 
[1979, 1983], and Kroll, Levy & Markowitz [1984]. However, there is an opposing 
intuition which suggests that the power functions' strong aversion to low returns 
and bankruptcy will lead them to select portfolios that are not MV-efficient, e.g. 
Hakansson [1971a] and Grauer [1981, 1986]. It is therefore of interest to know 
whether the power policies differ from the corresponding MV and quadratic 
policies when returns are compounded over many periods. 

Let pi: be the expected rate of return on security i at time ¢ and ojj, be the 
covariance between the returns on securities i and j at time ¢. Then the MV 
investment problem is 


Max {T (1 + 14) = 397), 


subject to the usual constraints. The MV approximation to the power functions in 
(16) are obtained [Ohlson, 1975; Pulley, 1981] when 
1 
=y 
Under certain conditions this result holds exactly in continuous time [see Merton, 
1973, 1980]. 

With quarterly revision, the MV model was found to approximate the exact 
power function model very well [Grauer & Hakansson, 1993]. But with annual 
revision, the portfolio compositions and returns earned by the more risk averse 
power function strategies bore little resemblance to those of the corresponding 
MV approximations. Quadratic approximations proved even less satisfactory in 
this case. These results contrast somewhat with those of Kallberg & Ziemba [1983], 
who in the quadratic case with smaller variances obtained good approximations for 
horizons up to a whole year [see also MacLean, Ziemba & Blazenko, 1992]. 


8.2. Growth-security tradeoffs 


The growth vs. security model has been applied to four well-known gambling- 
investment problems: blackjack, horse race wagering, lotto games, and commodity 
trading with stock index futures. In at least the first three cases, the basic invest- 
ment situation is unfavorable for the average player. However, systems have been 
developed that yield a positive expected return. The various applications use a 
variety of growth and security measures that appear to model each situation well. 
The size of the optimal investment gamble also varies greatly, from over half to 
less than one millionth of one's fortune. 
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Blackjack. By wagering more in favorable situations and less or nothing when the 
deck is unfavorable, an average weighted edge is about 2%. An approximation to 
provide insight into the long-run behavior of a player's fortune is to assume that 
the game is a Bernoulli trial with a probability of success equal to 0.51. With a 
2% edge, the optimal wager is also 2% of one's fortune. Professional blackjack 
teams often use a fractional Kelly wagering strategy with the fraction drawn from 
the interval 0.2 to 0.8. For further discussion, see Gottlieb [1985] and Maclean, 
Ziemba & Blazenko [1992]. 


Horseracing. There is considerable evidence supporting the proposition that it is 
possible to identify races where there is a substantial edge in the bettor's favor 
(see the survey by Hausch & Ziemba [1995] in this volume). At thoroughbred 
racetracks, one can find about 2-4 profitable wagers with an edge of 10% or more 
on an average day. These opportunities arise because (1) the public has a distaste 
for the high probability—low payoff wagers, and (2) the public is unable to properly 
evaluate the worth of multiple horse place and show and exotic wagers because 
of their complexity; for example, in a ten-horse race there are 120 possible show 
finishes, each with a different payoff and chance of occurrence. In this situation, 
interesting tradeoffs between growth and security arise as well. 

The Kentucky Derby represents an interesting special case because of the 
long distance (1 1/4 miles), the fact that the horses have not previously run this 
distance, and the fame of the race. Hausch, Bain & Ziemba [1995] tabulated the 
results from Kelly and half Kelly wagers using the system in Ziemba & Hausch 
[1987] over the 61-year period 1934—1994. They also report the results from using 
a filter rule based on the horse's breeding. 


Lotto games. Lotteries tend to have very low expected payoffs, typically on the 
order of 40 to 50%. One way to ‘beat’ parimutuel games is to wager on unpopular 
numbers — see Hausch & Ziemba [1995] for a survey. But even when the odds 
are ‘turned’ favorable, the optimal Kelly wagers are extremely small and it may 
take a very long time to reach substantial profits with high probability. Often an 
initial wealth level in the seven figures is required to justify the purchase of even 
a single $1 ticket. Comparisons between fractional and full Kelly strategies can be 
found in MacLean, Ziemba & Blazenko [1992]. 


Commodity trading. Repeated investments in commodity trades can be modeled 
as a capital growth problem via suitable modifications for margin requirements, 
daily mark-to-the-market procedures, and other practical details. An interesting 
example is the turn-of-the-year effect exhibited by U.S. small stocks in January. : 
One way to benefit from this anomaly is to take long positions in a small stock 
index and short positions in large stock indices, because the transaction costs 
(commissions plus market impact) are less than a tenth of what they would be 
by transacting in the corresponding basket of securities. Using data from 1976 
through January 1987, Clark & Ziemba [1987] calculated that the growth-optimal 


594 N. H. Hakansson and W. T. Ziemba 


82 N.H. Hakansson, W.T. Ziemba 


strategy would invest 74% of one's capital in this opportunity. Hence fractional 
Kelly strategies are suggested. See also Ziemba [1994]. 


9. Summary 


Capital growth theory is useful in the analysis of many dynamic investment 
situations, with many attractive properties. In the basic reinvestment case, the 
growth-optimal investment strategy, also known as the Kelly criterion, almost 
surely leads to more capital in the long run than any other investment policy which 
does not converge to it. It never risks ruin, and also has the appealing property 
that it asymptotically minimizes the expected time to reach a given level of capital. 
The Kelly criterion implies, and is implied by, logarithmic utility of wealth (only) 
at the end of each period; thus, its relative risk aversion equals 1, which makes it 
more risk-tolerant than the average investor. As a result, tradeoffs between growth 
and security have found application in a rich set of circumstances. 

The fact that the growth-optimal investment strategy is proportional to begin- 
ning-of-period wealth is of great practical value. But perhaps the most significant 
property of the Kelly criterion is that it is myopic not only when returns are 
nonstationary and independent but also when they obey a Markov process. In the 
dynamic investment model with a given terminal objective function, the growth- 
optimal strategy is a member of the set to which the optimal policy converges as 
the horizon becomes more distant. Finally, the Kelly criterion is optimal in many 
environments in which consumption, noncapital income, and payment obligations 
are present. 
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The appropriate criterion for evaluating, and hence also for properly constructing, investment 
portfolios whose performance is governed by an infinite sequence of stochastic returns has long been 
a subject of controversy and fascination. A criterion based on the expected logarithm of one-period 
return is known to lead to exponential growth with the greatest exponent, almost surely; and hence 
this criterion is frequently proposed. A refinement has been to include the variance of the logarithm 
of return as well, but this has had no substantial theoretical justification. 

This paper shows that log mean-variance criteria follow naturally from elementary assumptions 
on an individual's preference relation for deterministic wealth sequences. As a first and fundamental 
step, it is shown that if a preference relation involves only the tail of a sequence, then that relation 
can be extended to stochastic wealth sequences by almost sure equality.. It is not necessary to 
introduce a von Neumann-Morgenstern utility function or the associated axioms. 

It is then shown that if tail preferences can be described by a 'simple' utility function, one that is of 
the form lim, S, PF, n) where W, is wealth at period n, this utility must under suitable conditions 
be a function of the expected logarithm of return, independent of the functional form of p. Finally, 
‘compound’ utility functions are introduced; and they are shown under suitable conditions to be 
functions of the expected value and variance of the logarithm of one-period return, again indepen- 
dently of the specific form of the underlying function. The infinite repetitions of the dynamic process 
essentially 'hammer' all utility functions into a log mean-variance form. 


I. Introduction 


Consider an investment situation wherein a fund is initially endowed with 
capital and, through returns from investment, grows (or decreases) with time. 
After the initial endowment, no additions or withdrawals are made. The initial 
endowment is simply transformed by repeated chance operations of investment 
as controlled by the portfolio structure. The management objective in this 
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situation, roughly, is to cause the investment value to increase as rapidly as 
possible, subject perhaps to considerations of security. Beyond this general 
statement it is, at least initially, difficult to define a concrete investment criterion. 
Yet this situation, even though highly idealized, closely approximates many real 
money management situations, such as those involving university endowment 
funds, mutual funds, gambling, expansion of a business enterprise, and personal 
wealth-building. It is a situation that has received a great deal of attention in the 
literature. 

To make the situation tractable, it is usually assumed that the investment 
environment is stationary in the sense that there is a fixed set of investment 
opportunities each of which are available in every period and that the returns of 
these investments are independent and identically distributed between periods 
(although there is dependency among the different investments). With this 
framework, the academic literature has traditionally taken three main ap- 
proaches to this idealized management situation. One approach is to argue, on 
the basis of very strong limit properties, that the best policy is the one that 
maximizes E log W,, where W, denotes the wealth in the fund at some terminal 
period n and E denotes the expectation operator [Kelly (1956), Brieman (1961), 
Latané (1959)]. Because of the stationarity assumption, this policy reduces to 
that of successively repeating the single-period policy of maximizing E log W,. 
The second approach is based on adherence to the expected utility framework of 
von Neumann and Morgenstern and suggests that one should maximize 
EU(W,) for some utility function U. This includes U(W,) = log W,, but there 
is no particular rationale for selecting that over any other utility function [see 
Samuelson (1971)]. A third approach, also based on the expected utility frame- 
work, suggests that one maximize E $;. , (f)'u(c;), where c; is (a monetary 
equivalent of) consumption drawn out of the wealth at period i, and f, 
0 < B < 1, is a discount factor [Samuelson (1969)]. This last approach has good 
economic justification within the von Neumann-Morgenstern framework, al- 
though the special form of additive discounted utility is quite specialized and 
does not really capture the spirit of the original (vague) objective of managing 
money which is not to be drawn down until the (distant) future. Our objective is 
not to argue that one approach is superior to another, but simply to provide 
a new preference foundation for the E log W, approach (or a generalization of it), 
so that it can be addressed by economic principles. 

There has been considerable interest in considering the limiting behavior of 
the EU(W,) criterion as n — oo, to obtain approximate and simple results. It 
has been hoped that this might provide some economic justification for the 
expected log approach. The principal technique in such investigations has been 
to impose restrictions on the utility function (usually through boundedness 
assumptions) in order to deduce that the investment policy derived from the 
Elog W, criterion also maximizes EU(W,) as n— co [cf. Markowitz (1959), 
Leland (1972)]. Such attempts have been of limited success, as elucidated in 
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Samuelson (1971) and Goldman (1974). No convincing argument has yet been 
made that reduces the general utility function approach (for a wide class of 
useful utility functions) to the E log W, (or equivalently Elog W,) approach. 

The E log W, approach has been brought closer to standard economic theory 
by generalizing it to a criterion based on a trade-off between Elog W, and 
var(log W,) [ Williams (1936)]. Specifically, one defines an efficient frontier for 
fixed n, where var(log W,) is minimized subject to Elog W, > m, for various 
values of m. An efficient policy corresponds to a point on this frontier. This 
procedure parallels the familiar mean-variance approach to single-period in- 
vestment [Markowitz (1952), Sharpe (1970), Tobin (1965), Lintner (1965)] and 
has the appeal of simplicity of that familiar procedure. Of course, this itself may 
not eliminate the conceptual gap between the asymptotic approach and classical 
utility theory, since even single-period mean-variance theory is not consistent 
with expected utility maximization except in special circumstances [Borch 
(1963)]. However, it has been shown that the log mean-variance approach! 
serves as an approximation to the terminal utility approach when uncertainty is 
‘small’ [Samuelson (1970) ]. 

Because of its intuitive appeal and simplicity, it is natural to look for stronger 
justification for log mean-variance criteria, and the central limit theorem might 
seem to provide the basis for such a justification. The distribution of the 
sum of the logarithms of n independent and identically distributed returns, 
when its mean is subtracted and the result is normalized by dividing by n!?, 
converges to a normal distribution. Ignoring the required normalization, one 
might argue that the distribution of the sum itself is close to a normal distribu- 
tion for large n. Then since this normal distribution is completely characterized 
by its mean and variance, it is plausible to use these two parameters as 
determinants for an investment criterion. This idea has been explored in 
Hakansson (19712, b), and it was found that the corresponding efficient frontier 
degenerates to a single point as n > oo , since the ratio of standard deviation to 
mean goes to zero. 

If the normalization is accounted for, the situation is far more complex, since 
there seems to be no valid limit argument. This was forcefully pointed out 
in Merton and Samuelson (1974) and observation of this limit fallacy led 
them once again to dismiss log mean-variance criteria as too simplistic. (Addi- 
tionally, the very idea of using utility at a terminal time and taking the limit 
also implicitly involves a fallacy since lim,..,, E( U(W,)) is not an expected 
utility.) 

So where does all of this leave us? The log mean criterion, or more generally, 
the log mean-variance approach is intuitively appealing, but apparently not 
consistent with standard expected utility theory. 


! The phrase ‘log mean-variance criteria’ is used throughout as shorthand for ‘criteria involving 
only the mean and variance of the logarithm of the return’. 
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In section 3 we set the stage for a new preference-based approach to the 
problem. We begin by establishing preferences on infinite sequences of wealth 
rather than wealth at a fixed (but later taken to the limit) terminal time. This 
approach circumvents the limit problems encountered by the earlier ap- 
proaches, and in fact provides a path whereby powerful limit theorems of 
probability theory can be applied. The preference relations on infinite determin- 
istic sequences are extended to stochastic sequences, not by imposition of a von 
Neumann-Morgenstern expected value criterion, but instead quite naturally by 
an 'almost sure' criterion, which is applicable when the original preference 
relation depends entirely on asymptotic properties, as is appropriate for this 
problem. It is then shown that a wide spectrum of these asymptotic preferences 
reduce to log mean-variance criteria. Thus, under quite broad conditions the log 
mean-variance approach is identical with an asymptotic preference approach. 
The justification for the variance comes about, by the way, not from the central 
limit theorem, but from its stronger cousin, the law of the iterated logarithm. 


2. The investment environment 


We begin by more explicitly defining the investment environment. There is an 
underlying vector random process Z = {Z,} defined on a probability space 
(Q, F, P). Each Z, = (Zik, Z2 . .., Zsų) is a vector of S random variables. An 
investment policy (%1, %2, . . .) isa sequence of mappings a: R? — R. This policy 
defines a new process X = {X,} of random variables by X, = a,(Z,). The 
X process is the single-period return process. The X process in turn generates 
a wealth process W according to 


W, = W,-iX;, 


where W is the (fixed) initial wealth. The investment problem, in this general 
setting, is that of selecting the policy a within a class of feasible policies that leads 
to the most desirable W process. 

It should be noted that systematic withdrawals from the accumulated wealth 
can be accommodated in this framework by incorporating them into the 
definition of a,. For instance, a withdrawal of a fraction fj, of the wealth at 
period k using investment policy sx, is treated by setting X, = x,(Zi) 
= (1 — fi)o«( Z4). 

A concrete example of the general framework is where there are S securities, 
each of which may be purchased in any amount at the beginning of each period. 


If one unit of wealth (one dollar, say) is invested in security j, j = 1,2,...,5, 
during period k, its value at the end of the period is Z;j,, where Z;, is a non- 
negative random variable. The random vector Z, = (Zi Za -- +> Zs.) thus 


describes the single-period returns for period k. At the beginning of any period k, © 
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the investment manager apportions the current wealth W,.., among the S se- 
curities by choosing a weighting vector a, = (1%, 2ks... nx) such that 
Diz: a = l, and, accordingly, invests W,.,aj in security i, The composite 
single-period return is 


X, = Qu Zig + Xu Eu oth s Zsy 07 Zr. 
Wealth at the end of the period is then 
W, = Wy Xk. 


A choice of a sequence of vectors a,,a2,... is an investment policy.” 


The analysis in this paper is based on the following: 


(4.1) Basic assumptions 


(a) The Z, are independent, identically distributed, and nonnegative on 
(Q, F, P). 

(b) The Z, are bounded above. 

(c) Policies (a,, a5, . . .) must be constant (that is, a, =o for all k = 1,2,...), 
and a: RS. > R , (no short sales). 

(d) Each a is measurable and bounded. In particular,? there is a C > O such that 
a(Z,) < C for all æ and k. 


Primarily we shall be concerned with the processes X and W derived from 
Z and a policy a. For this purpose we only require the following assumptions 
(the first two of which follow directly from above): 


(4.2) Properties of X and W 


(a) The X, are independent, identically distributed, and nonnegative random 
variables on the probability space (Q, F, P). 


For all k: 


(b X, «x C. 
(c) log X, has finite first and second moments. 
(d) W, = W,.,X,, with W = 1.* 


?]n this example a, is linear. Nonlinear mappings can accommodate options or other nonlinear 
functions of a security's return without introducing additional securities. 


*This uniform upper bound requirement is for convenience. [t can be relaxed. 


*W, = 1 is a normalization that is used for convenience only. 
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The max E log policy 


The wealth W, can be written 
W,- X,X,... X,. 
Taking the logarithm of this equation we obtain 
log W, = log X, + log X; +: + log X,, 


or equivalently, 


l 
log [W,]'" = - (log X, + log X, +++ + logX,). (1) 
Under our assumptions, as n approaches infinity the right-hand side of (1) 
approaches (almost surely) E log X, (or Elog X, for any k = 1,2,... since all 
are equal). Hence for large n 


log[ W,]'^ = Elog x, , 
and, in a formal manner, we write 


W, ~ expn(Elog X,). (2) 


Roughly speaking (2) says that wealth grows approximately exponentially 
with a coefficient in the exponent equal to Elog X,. This is an indication that 
maximization of E log X, might be desirable. This reasoning can in fact be made 
rigorous. [See Breiman (1961), Algoet and Cover (1985), and Bell and Cover 
(1980).] However, this argument is not based on a definitive concept of optimal- 
ity. Nevertheless, as shown in section 4 of this paper, the optimal log mean 
policy does in fact correspond to optimization of any 'simple' asymptotic utility 
function. 


3. Preferences on sequences 


Our approach is based on recognizing from the outset that one is selecting 
from a set of competing infinite sequences of random variables. Hence the 
preference relation should be constructed on these sequences directly. 

Consider first deterministic sequences. Let l be the set of al] such sequences 
wi, W2, .. . having the following properties: 


(i) w,20  forallk 
(ii) w,-x C* for all k, 


where C is the bound in assumptions (A.1-d) and (A.2-b). 
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Assume that on P xT there is a preference relation >. That is, = defines 


a preference relation on a pair of sequences. (If w, ve and wzzv, we say w is 
preferred to v.) Such a preference relation satisfies the following properties: 


(P.1) Completeness: For every pair v, wel, either wv or vZzw. 

(P.2) Reflexivity: For every wel, wzzw. 

(P.3) Transitivity: For every u, v, we P, if uzv and vzzw, then uz w. 
Examples of preference relations can be easily constructed by restricting 

attention to a finite set of indices k = 1,2, ...,n and using any standard 

finite-dimensional preference relation. In particular, one might define the rela- 


tion in terms of wealth at a fixed (surrogate terminal) time n; i.e., w>p if w, > Va. 
More interesting examples defining wzzv include? 


— 1 — 1 
lim 4198 w, = lim —-logu,, (3) 
— ie — dic 
lim — Y. logw, > lim = Y. logv,, (4) 
no n k=1 noo k=1 

P T (5) 
D QC) À (2C)" 
— i — | 
lim -logw,w,,; 2 lim -Og V, Vn +1 - (6) 


We assume an additional property that is really the heart of our analysis. 


(P.4) Tail Property: The preference relation depends only on the tail of the 
sequences in T. That is if w, ve T and w=», then w> for any w, vef that differ 
from w and v in at most a finite number of elements. 


The examples (3), (4), and (6) are tail preferences. 

The motivation for this assumption stems from the original proposed setting 
of the investment problem. The investor is assumed to care only about the 
long-term behavior of the investment; no finite portion is of concern. From 
a technical perspective the introduction of tail preferences is precisely the 
concept required to introduce the limit idea early, so that it need not be 


*The notation lim indicates lim sup (that is, limit superior). 
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introduced at a later stage of the logical development and then interchanged 
with some other operation. 


Preferences on stochastic processes 


Now let W be the family of stochastic wealth processes derived from the 
investment procedure of section 2. We wish to establish a preference relation on 
W xW. That is, we wish to be able to compare two different stochastic 
processes. This new preference relation should be an extension of the preference 
relation on deterministic sequences defined above. [ x F is a subset of R^ x R? 
and we let 4 be the corresponding subset of the Borel field on R? x R*. This 
leads to an additional technical requirement on the preference relation. 


(P.5) Measurability: The subset {(w, vy wzzv) of T xT is in 2. 


We now extend the preference relation to a preference relation zz, on 
stochastic wealth processes. The extension is by almost sure association. 


Axiom. Let W —(W,,W;,..., V=(V,, V2, ...) be stochastic processes in 
W. Then WZsSVif WZ V as. That is 


WzsV if P(WEV|)-1. (7) 


In other words, if for almost every realization of the processes the sequence 
generated by W is deterministically preferred to the sequence generated by V, 
then the stochastic process W is preferred to the stochastic process V. 

This is a very natural (and very weak) axiom. Notice that the usual axioms 
required to define preferences over stochastic events (through an expected utility 
criterion) are not imposed. Our stochastic preference criterion is not an ‘ex- 
pected’ criterion; it is an ‘almost sure’ criterion. Stochastic preference is induced 
by almost sure deterministic preference. In a very real sense then this preference 
structure is substantially weaker than that employed in traditional preference 
approaches to this problem. (To be sure, it is in another sense stronger because 
of the tail assumption.) 

A potential difficulty with this axiom is that it may not make sense. It might 
be that P{ W=V} = 4 in which case neither W zz; V nor Vzzs W would hold. 
A zero-one law insures that this is never a problem. 


Theorem 1. If (P.1)-(P.5) hold, the relation Zs is complete, reflexive, and 
transitive. E 


Proof. Given W and V in W, the sequence of pairs ((W;, V;)} is a process on 
(Q, £, P). Let X =(X,,X2,...)and Y 2 (Y,, Y;,...) be the corresponding 


A Preference Foundation for Log Mean-Variance Criteria in Portfolio Choice Problems 607 


D.G. Luenberger, Log mean-variance criteria in investment problems 895 


single-period return processes. Under our assumptions (A.1) and (A.2) the pairs 
(Xk, Ya) and (X,, Y;) are independent for k # £. Because X, and Y; are uniformly 
bounded, all realizations of W and V are in T. The event (W zz Vj) c Q is 
measurable by (P.5) and is a tail event on the process of pairs ((W, Vi)). 
However, any tail event in this process is a symmetric event [see Breiman (1968) 
or Shiryayev (1984)] for the process {(X;, Y;)] since such an event is not 
influenced by a permutation of any finite number of elements of the processes. 
This latter process is independent, and hence by the Hewitt-Savage Zero-One 
Law [see Breiman (1968), the probability of the event { W = V } is either zero or 
one. 


If PLW zz V) = 1, then W z-, V. Suppose, however, that P{W 2- V) — 0. We 
have 


12 P((WEV) o(VzW)) s P(WzV) + P(VzW), 


where the first equality follows from the completeness of the order >. Hence, in 
this case P(V Z W} = 1, and V zz, W. Therefore in any case either W zzs V, or 
Ves W. 

Reflexivity follows immediately from the reflexivity of =. Transitivity is fairly 
immediate also: Let U zs V, V5 W. Then U zz V and V> W except on subsets 
M, and Mz, respectively, of Q, each of measure zero. Thus by transitivity of =, 
UzW except on M, Ma. ] 


The key ingredient of this result is that measurable tail events on wealth 
processes have probability of either zero or one. It is for that reason we can 
define preference for stochastic wealth processes by almost sure association - the 
relation is always almost sure one way or the other. We exploit this property 
throughout the remainder of the paper. 


4. Simple utility functions 


A particularly useful way to describe a large class of preferences is through 
a utility function. Again, a utility function first can be introduced on determinis- 
tic sequences, that is, on T'. Then this utility, if it defines a tail preference relation, 
can be extended by almost sure association to a utility function on the set W of 
stochastic processes. 

A utility function on F is a measurable real-valued function U:T > R. It 
defines a preference relation through the definition wzzv if U(w) 2 U(v). We say 
that a utility function is a tail utility function if U(w) depends only on the tail of 
w =(w,,W2,...) [that is, U(w) = U(w) if w and w differ in at most a finite 
number of elements]. 
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We extend a tail utility function U from T to W by use of the fact that, by the 
zero-one law, if We W, then if U(W) is finite-valued, it is a degenerate random 
variable; that is, it is a constant almost surely. The value of this constant is taken 
to be the value of U(W). 

Although there is an infinite variety of possible tail utility functions, we focus 
first on what we call simple utility functions. These are defined in terms of 
limiting operations on functions of wealth at individual periods, with no interac- 
tion between periods. Specifically, simple utility functions have the form 


or 


or algebraic combinations of these forms. In these expressions, p is a continuous 
and increasing function of w, for each n. This appears to be a very general class 
of tail utility functions; and if we accept the idea of basing preferences on tail 
events, this form of utility is very natural. We shall show, however, that this form 
(with some minor restrictions) leads inevitably to the expected log return 
criterion. 

We note first that without loss of generality we may write 
p(w, n) = p(log w, n) by applying the one-to-one transformation log w, which 
maps between (0, oo) and ( — oo, oo). The resulting p(', n) is continuous and 
increasing since p(-, n) is. Finally, by adding a constant to U, if necessary, we 
may assume pí(1, n) 2 O or equivalently p(0, n) = 0. Thus we define a simple 
utility function as having the form 


U(w) = lim p(log w,, n), (8) 
or . 
U(w) = lim p(log w,, n), (9) 


or combinations of the forms (8) and (9), where p has the properties stated above. 

We extend these to the stochastic wealth process W by use of the fact, from the 
proof of Theorem 1, that tail events on W have probability of either zero or one. 
This means, in particular, that if f: C ^ R is measurable and f (w) depends only 
on the tail of the sequence w, then the random variable {(W) is constant almost 
surely. Using this fact we define 


U(W) = lim p(log W,, n), (10) 


nx 
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or 
U(W) = lim p(log Wn, n). (11) 
n-*oo 


The random variables U (W) defined above are measurable as extended random 
variables — see Shiryayev (1984, pp. 171, 358). If U(W) is finite it is constant a.s., 
and the utility is taken to be this constant. 

Not all functions p(-,-) will lead to meaningful utility functions. For example, 
if p(z, n) = z, the process W, = c" with c > t will lead to U(W) = œ. In general, 
in order to obtain finite utility for feasible wealth processes, it is necessary that 
p(z, n) satisfy appropriate growth-rate conditions with respect to n. In particu- 
lar, p must decrease with n to counterbalance possible increases in z. The 
following theorem gives suitable conditions. 


Theorem 2. For each n, let p(z, n) be continuous and increasing in z with 
p(O, n) = 0. Let X and W be processes satisfying (A.2). Assume E log X, = m, and 
define U by (10). 

(a) If lim,..,, p(nz, n) = sgn(z): oo, then 


+o, m0, 
—o, m«O. 


van - | 


(b) If lim, p(nz, n) = g(z) where g(-) is continuous, then U(W) = g(m). 


Proof. We have (1/n)log W, > m a.s. Let z, = (1/n)logw,,n = 1,2, ...,corre-- 
spond to a particular realization of the process where convergence of z, ^ m 
holds. Given £ > 0, there is an N such that |z, — m| <e for all n > N. By 
monotonicity of p it follows that 


p(n(m — £) n) < p(nz,, n) x p(n(m + £), n), 
for all n > N. Hence 


lim p(n(m ~— €),n) x lim p(nz,, n) € lim p(n(m + £), n). 
If m > 0, then for sufficiently small e > 0 both m—t» 0 and m + £ > 0. There- 
fore, if the hypothesis (a) holds, it follows that lim,.. , p(nz,; n) = oo. A similar 


argument shows that for m < Q there holds lim,.,,p(nz,,n) = — œ. If the 
hypothesis in (b) holds, then 


lim p(nz,, n) = g(m). 


n> 
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Finally, the conclusion follows since 


| 1g 
7, 08 W, =- Y logX,—m as. 


k=1 


by the strong law of large numbers, and hence the above values hold for 


lim p(log W,, n) = lim p(n tog W,, J 
n 


nc nr 


almost surely. Jj 


The parallel of Theorem 2 holds for the form (10), with lim,..,, replaced 
everywhere by lim,.,,. It is clear that in either case g(m) is an increasing 
function of m. 

Theorem 2, and the above remarks, say roughly that, to within a monotone 
transformation, the only real-valued tail simple utility function is m = E log X ,. 
We remark again, however, that although this 'appears' to be an expected value 
criterion, it is actually an 'almost sure' criterion. The expected value arises from 
the strong law of large numbers, rather than from an assumption of expected 
value utility. 

It is perhaps useful to relate the theorem more directly to our earlier 
discussion of other utility functions. For example, consideration of the 
popular utility function defined as w} on terminal wealth leads one to set 
p(logw,, n) = exp(vlogw,) in (8). However, clearly p(nz,n)— sgn(z): oo, 
leading to the infinite result of part (a) of Theorem 2. The idea can be 
salvaged by introduction of a l/n in the power, ie, wJ" or p(z, n) =e". 
This leads to U(W) — e'", which again gives the expected logarithm 
criterion. 


5. Compound utility functions 


It is apparent that there is a slight gap in Theorem 2. Part (a) covers the 
cases m > 0 and m < 0 but not m = 0. This is no real detraction from the 
theorem, for part (a) says that under its conditions the corresponding utility 
function is degenerate unless it is applied to a process with m — 0. However, 
this gap motivates the introduction of compound utility functions; since 
by subtracting out the mean m from the process, there is room for other 
possible forms. 

Another motivation for compound utility functions is purely structural. We 
let a compound utility function have a simple utility function as an argument. 
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However, by Theorem 2, we can take this simple utility function to be 
Us(W) = m. This motivates the specific compound forms 


U(W) = lim V (log W, — nm,m,n) a.s., (12) 
U(W) = lim v(log W, — nm,m,n) as., (13) 


where y is continuous and increasing in its first two arguments. As before, the 
equalities in (12) and (13) are to be interpreted in the almost sure sense. Such 
a utility, depending on m, is clearly still a tail function. 

Once we recognize the role of the mean log return m in compound utility 
functions, it is not necessary to carry it along in our analysis. Instead, we 
consider 


U(W) = lim (log W,, n) a.s., (14) 
U(W) = lim é(logg W, n) a.s., (15) 


where $(z, n) is increasing in z, for the special case where m = E log X, = 0. 
Then, later, we can simply put m back into the utility as an additional variable, 
as in (12) or (13). This restriction to the study of utility functions (14) and (15) 
amounts, really, to a further examination of simple utility functions when it is 
known that m = 0. 

A special situation is the degenerate one where W, = 1 for all n. We wish this 
to have finite utility, and hence without loss of generality we may take 
(0, n) = 0 for all n. 

As before the behavior of $ as a function of n is critical. To characterize the 
appropriate behavior of $ it turns out that the asymptotic behavior of pi /nz, n) 
is critical. 

The following proposition and its corollary rule out a large class of 
functions. 


Lemma 1. Suppose $(z, n) is increasing in z for each n with $(0, n) = 0 and 


lim lim $( /nz &) = A. 


zo no 


Let X and W be processes satisfying (A.2) and assume Elog x, =0, 
var(log X,) = o? > 0. Then, for U defined by (14), U(W) > A. 
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Proof. Since $(0,n) = 0, it follows that A > 0. In order to treat the case 
A= œ as well as A < oc, we prove that if 


lim lim pinz, n)zA', 


with A’ € co, then U(W) > A’. The lemma follows from this. 


Let p(z) = lim, +. (nz, n). Select e > 0. Then there is a Z such that 
p(z) > A’ — e. Then, by the definition of p, there is N such that for all n > N, 


nZ, n) > A‘ — 2e. By the monotonicity of ġ{-, n) we have 


pl /nz, n) > (nz, n)> A’ — 2e, 


for all z > z and all n » N. 
Thus,? 


log W, 
Pal Va 08 ZE A' — 2€ io p} SE. Z io. =1, 
Vn Vn 


where the last equality follows from the law of the iterated logarithm, 


lim eet as 
^. (267 nloglogn)? — UU 


which shows that lim, -... (log W,)/s/n = oo a.s. Therefore U(W)> A’ — 2e. 
Since this is true for all ¢ > 0, it follows that U(W) >A". Jj 


Corollary. If for all z > 0, 


lim $( /nz, n) 2 x, 


then, for U defined by (14), U(W) = x. 
Proof. We have A= œ in the above lemma. J 


Let us consider some examples. The function $(z, n) = z/n corresponds, 
according to Theorem 2 of the previous section, to the simple utility function 
with value U( W) = m; and hence in this context, with m = 0, U(W) =0. A =0 
for this function, so Lemma 1 gives U(W) > 0. The reverse inequality can be 
established by a symmetric argument (see Lemma 3), and together these agree 
with what is known from Theorem 2. 


*i.o. denotes ‘infinitely often’. 


A Preference Foundation for Log Mean-Variance Criteria in Portfolio Choice Problems 613 


D.G. Luenberger, Log mean-variance criteria in investment problems 90t 


z 


Fig. 1. An unusual function that is decreasing in n above 4. 


In general, if $(z, n) = z/q(n), then clearly q(n) must go to infinity faster than 
Jn in order that A be finite and hence for the corresponding compound utility 


function to be real-valued. For example, the function plz, n)  z/ fn has 
A= o. 


Definition. Let A be a constant. The function of two variables pi nz, n) is 
said to be decreasing with respect to n above A if $ (nz, n) A implies 


pln + lzn+lb< e nz, n). 


The monotonicity property with respect to n is quite reasonable even if A > 0. 
An example is shown in fig. I. 

Corresponding to a function $(z, n) that is continuous and strictly increasing 
in z for each n, let f(-,n): D, + R [where D, is the range of $(-, 1)] be the 
inverse function of $(^, ny; that is, 


f(ó(z, n, n) =z, 


for every z. This inverse exists and is also strictly increasing. We need the 
following result which shows that if $(A/nz,n) is decreasing with respect to n, 
then f/vn is increasing with respect to n. 


Lemma 2. Suppose $(./n + 1z,n+1)< d nz, n) — r. Then, 


f(r, ny fn x fir, n fn a1. 
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Proof. We have 
é( nz, n) =r, 
d(./n+lznt+ las, 


with s < r. By applying f to these we have 
z= f(r nn, 
z=f(snt+l//nt 1 &f(r,n 9 D/i/n- 1, 


where the inequality follows from the monotonicity of f(:,n + 1). Therefore 


fir, n fn x fon ynn M 


We are now prepared to state the basic result of this section. We show, 
essentially, that any compound utility function of wealth (14), derived from 
a process with zero mean (of the logarithm), is a function only of 
c? = var(log X,) (or, equivalently, of ø). 


Proposition 1. Let U be defined by (14) and assume that $(z, n) is continuous and 
strictly increasing with respect to z for each n, and that $(0, n) = 0. Suppose 


further that ó/n z, n) is decreasing with respect to n above A as defined in Lemma 
1. Then there is a function h on R , possibly taking values — oo or + co such that 


(i) h(a) is nondecreasing with respect to o; 


(ii) for any process X and W satisfying (A.2) with m = E(log X1) « 0, 
var(log X,) = o? > 0, there holds 


U(W) = h(o). 


Proof. Defining A as in Lemma 1, we know that U(W) > A. For any r > A, we 
have U(W) >r if and only if 


Píó(logW, n) »r ioj-—l. 


This is equivalent to 


pte m, , finm ie] Ll. 


on o n | 
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According to the assumptions on @ and the above lemma, y, = f(r, n)/(o Jn) is 
increasing for r > A. Furthermore, f(r,n) » 0 for r> A z 0. Therefore the 
extended law of the iterated logarithm [Feller (1943), Breiman (1968, p. 297)] 
states that the probability above is either zero or one, depending on whether the 
sum 


converges or diverges, respectively. If the sum converges for a given sequence 
{yn}, then it will converge also for {y} with y, > ya. This is because (d/dx) 
x [xe 7] = [1 — x?]e^*7? < 0 for x > 1. Thus, for sufficiently large n, each 
term in the sum for | y;) is smaller than the corresponding term in the sum for 
{yn}. Therefore, if the sum converges for a given value of r, it also converges for 
r' >r. It follows that 


U(W) = ilr Y, Gne? converges. 


n=) 


In a similar way, if the series converges for a given ø, it will also converge for 
a’ « c. Hence U(W) = h(c) where h(-) is nondecreasing. J 


A useful $ for consideration is 


Z 


$an) = c loglogn)i? 


In this case the standard law of the iterated logarithm states that 


log W, 


U(W) = lim ———>—"— = 
(W) = lim (2n log log n)*/? 


noo 


G as., 


and hence this $ gives h(a) = c. 

Proposition 1 shows that an investor with a utility function of the general 
form (14), with the lim operation, prefers increased variance. Increased variance 
increases the extent of upward variations, and presumably the investor values 
these peaks. 

The comparison to Próposition ! treats utilities of form (15), with the lim 
operation. In this case the ibvestor wants to increase the lowest deviations, and 
hence such an investor values low variances more than high variances. 

Specifically, we have the following results. 
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Lemma 3. Suppose $(z, n) is increasing in z for each n with (0, n) = 0 and 


lim lim é( /nz, n) = B. 


T>- nx 


Let X and W be processes satisfying (2) and assume Elog X, — 0, 
var(log X,) = o? > 0. Then, for U defined by (15). U(W) < B. 


Proposition 2. Let U be defined by (15) and assume that dz, n) is continuous and 
strictly increasing with respect to z for each n, and that (0, n) = 0. Suppose 


further that ln z, n) is increasing with respect to n below B as defined in Lemma 
3. Then there is a function g on R , possibly taking values — oc or + oo such that 


(i) g(a) is nonincreasing with respect to a; 


(ii) for any processes X and W satisfying (4.2) with m = E(log X,) — 0, 
var(log X,) = a? > 0, there holds 


U(W) = g(a). 


Finally, we may combine Propositions ! and 2 and insert the explicit depen- 
dence of a general compound utility function on the mean m. We let j(z, m, n) be 
continuous and strictly increasing with respect to z and increasing with respect 
to m. 


Theorem 3. Let U be defined by either 
(a) U(W) = lim, (log W, — nm, m, n) a.s., or 
(b) U(W) = lim, (log W, — nm, m, n) a.s. 


Correspondingly, assume that for each m, $,(z, n) = y(z, m, n) satisfies the condi- 
tions of Propositions | or 2. Then there is a function f on R x R, possibly taking 
values — oo or + oo such that 


(i) f(m, a) is increasing with respect to m and either increasing or decreasing with 
respect to o corresponding to (a) or (by; 


(ii) for any processes X and W satisfying (4.2) with m= Elog X,, 
var(log X ,) = c? > 0, there holds U(W) = f(m, c). 


For a given process Z and a given family of feasible investment policies, one 
can define the set S of attainable (m, c) pairs (achieved as the policy ranges over 
all possibilities). If the set $ is compact, the right frontier of the set S is the 
right-most boundary of this set; that is, the points (m, a) having the largest a for 
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a given m. Then, according to the above theorem, an investor with a utility 
function of the form (a) will select a policy corresponding to a point on this 
frontier. Similarly, an investor with a utility function of the form (b) will select 
a policy corresponding to a point on the left frontier. 


6. Conclusion 


We have approached infinite-horizon investment situations by consider- 
ing preference orders on infinite sequences of wealth, and we suggested that 
tail preferences are appropriate if the goal of investment is long-term 'wealth 
building. We showed that if the tail preference takes the form of a simple 
utility function, then utility must be equivalent to the expected logarithm 
of return. If preferences are represented by a compound utility function, that 
function will be equivalent to a function of the expected logarithm and 
variance of the logarithm. 

Although our results require that certain technical assumptions be satisfied, 
they are robust enough to support, at least roughly, the statement that: a tail 
utility function involving the limits of total return must be equivalent to a 
log mean-variance criterion. We do not anticipate, however, that the con- 
troversy over the use of such a criterion will subside. Indeed it should not. 
Our result is based on the tail preference concept, which is an idealization 
of what is generally a complex economic problem. However, since the log 
mean-variance approach is so intuitively appealing, it is nice to have a 
framework in which it is validated. 
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Abstract 


This paper provides an alternative behavioral foundation for an investor’s use of power utility 
in the objective function and its particular risk aversion parameter. The foundation is grounded in 
an investor’s desire to minimize the objective probability that the growth rate of invested wealth 
will not exceed an investor-selected target growth rate. Large deviations theory is used to show 
that this is equivalent to using power utility, with an argument that depends on the investor’s 
target, and a risk aversion parameter determined by maximization. As a result, an investor’s risk 
aversion parameter is not independent of the investment opportunity set, contrary to the standard 
model assumption. 
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1. Introduction 


What criterion function should be used to guide personal investment decisions? Per- 
haps the earliest contribution was Bernoulli’s critique of expected wealth maximization, 
which led him to advocate maximization of the expected log wealth as a resolu- 
tion of the St. Petersburg Paradox. This was resurrected as a long-term investment 
strategy in the 1950s, and is now synonymously described as either the log optimal, 
growth-maximal, geometric mean, or Kelly investment strategy. As also noted in their 
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excellent survey on this portfolio selection rule, Hakansson and Ziemba (1995, 
pp. 65-70) argue that “...the power and durability of the model is due to a remarkable 
set of properties", e.g. that it “almost surely leads to more capital in the long run than 
any other investment policy which does not converge to it". ! 

But even as a long-term investment strategy, the log optimal portfolio is problem- 
atic. It often invests very heavily in risky assets, which has led several researchers to 
highlight the possibilities that invested wealth will fall short of investor goals, even 
over the multi-decade horizons typical of young workers saving for retirement. For 
example, MacLean et al. (1992, p. 1564) note that "the Kelly strategy never risks ruin, 
but in general it entails a considerable risk of losing a substantial portion of wealth". 
Findings like these motivated Browne (1995, 1999a) to develop a variety of alter- 
native, shortfall probability-based criteria, in specific continuous-time portfolio choice 
problems. Browne (1999b) considers these ideas in the context of the simplest possi- 
ble investment decision, which will also be utilized to illustrate the criterion developed 
herein. Further discussion of his work is included in Section 2.3. Another similarly 
motivated criterion for continuous time portfolio choice is developed in Bielecki et al. 
(2000), which will be discussed further in Section 2.2. 

The problem is exacerbated when investors have specific, short to medium term val- 
ues for their respective investment horizons. If so, some criteria will lead to horizon- 
dependent optimal asset allocations, but others will not. For example, Samuelson (1969) 
proposes the criterion of intertemporal maximization of expected discounted, time- 
additive constant relative risk aversion (CRRA) power utility of consumption. He 
proves that when asset returns are IID, portfolio weights are independent of the hori- 
zon length. So in that case, long horizon investors should not invest more heavily 
in stocks than do short horizon investors. Samuelson (1994) provided caveats to this 
investor advice, citing six modifications of this specification that will result in horizon 
dependencieg. ? 

But an investment advisor, hired to help an investor formulate asset allocation ad- 
vice, may have difficulty determining a specific value for the investor's horizon. The 
advisor may be unable to determine an’ investor’s exact horizon length when it ex- 
ists, while other investors may not have a specific investment horizon length at all. 
A considerable simplification results when an infinite horizon is assumed, as has also 
been done when deriving many, but not all, consumption-based asset pricing models.? 
An exception to the infinite horizon formulations is found in Detemple and Zapatero 
(1991). Of course, the cost of this simplification is the inability to model horizon 
dependencies. 

While the time horizon parameter is irrelevant for Samuelson's intertemporal power 
utility investor with IID returns, the optimal asset allocation is still very sensitive to the 
specific risk aversion parameter adopted, so an advisor would have to determine it with 


l See the analysis of Algoet-and Cover (1988) and the lucid exposition of Cover and Thomas (1991, 
Chapter 15) for more information on the growth maximal portfolio problem. For a spirited normative defense 
of the growth maximal portfolio criterion, see Thorp (1975). 

? However not all of these modifications would support the oft-repeated advice to invest more heavily in 
stocks when the investor's horizon is longer. 

3 For a survey, see Kocherlakota (1996). 
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precision. An even more basic consideration is specification of the utility functional 
form and its argument. Should it be a power function, or an exponential function, or 
perhaps some function outside the HARA class? Should the argument be a function of 
current wealth, current consumption, or some function of the consumption path (as in 
habit formation models)? As a first step toward answering these questions, Section 2 of 
this paper develops a new criterion of investor behavior. It starts from the observation 
that the realized growth rate of investor wealth is a random variable, dependent on 
the returns to invested wealth and the time that it is left invested (i.e. the investment 
horizon). To obviate the need to specify a value for the latter, first assume that an 
investor acts as-if she wants to ensure that the (horizon-dependent) realized growth 
rate of her invested wealth will exceed a numerical target that she has, e.g. 8% per 
year. By choosing a portfolio that results in a higher expected growth rate of wealth 
than the target rate, the investor can ensure that the probability of not exceeding the 
target growth rate decays to zero asymptotically, as the time horizon 7 — oo. But 
the probability that the realized growth rate of wealth at finite time 7 will not exceed 
the target might vary from portfolio to portfolio. Which portfolio should be chosen? 
Without adopting a specific value of 7, a sensible strategy is to choose a portfolio that 
makes this probability decay to zero as fast as possible as T — oo. This will ensure 
that the probability will be minimized for all but the relatively small values of 7. In 
other words, the decay rate maximizing portfolio will maximize the probability that the 
realized growth rate will exceed the target growth rate at time T, for all but relatively 
small values of T. In fact, this turns out to be true for all values of T in the special 
IID cases studied in Sections 2.1 and 3. mE 

Calculation of the decay rate maximizing portfolio is enabled by use of a simply 
stated, yet powerful result from large deviations theory, known as the Gártner-Ellis 
Theorem. Straightforward application of it in Section 2.2 provides an expected power 
utility formulation of the decay rate criterion. But there are two important differences 
between this formulation and the standard expected power utility problem. First, the 
argument of the utility function is the ratio of invested wealth to a level of wealth 
growing at the constant target rate. Second, the value of the power, i.e. the risk aversion 
parameter, is also determined by maximization. As a result, a decay rate maximizing 
investor's degree of relative risk aversion will depend on the investment opportunity 
set, an effect absent in extant uses of power utility. 

Because this endogenous degree of risk aversion is greater than 1, the third deriva- 
tive of the utility is positive, so there is also an endogenous degree of skewness 
preference. This is fortunate, as some have argued that skewness preference helps 
explain expected asset returns. To see why, note that in the standard CAPM, in- 
vestor aversion to variance makes an asset return’s covariance with the market re- 
turn a risk factor, so it is positively related to an asset's expected return. Kraus and 
Litzenberger (1976) argue that investor preference for positively skewed wealth dis- 
tributions (ceteris paribus) should make market coskewness an additional factor, that 
should be negatively related to an asset's expected retum. They thus generalized the 
standard CAPM to incorporate a market coskewness factor. The estimated model sup- 
ports this implication of investor skewness preference. Harvey and Siddique (2000) ex- 
tended this approach by incorporating conditional coskewness, concluding that “a model 
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incorporating coskewness is helpful in explaining the cross-sectional variation of asset 
returns", ^ 

The decay rate maximization criterion also nests Bernoulli's expected log maximiza- 
tion (a.k.a. growth optimal) criterion. An investor who has a target growth rate suitably 
close to the maximum feasible expected growth rate has an endogenous degree of risk 
aversion slightly greater than 1. As a result, the associated decay rate maximizing port- 
folio approaches the expected log maximizing portfolio. If the investor's target growth 
rate is lower, the investor uses a higher degree of risk aversion, and the associated 
decay rate maximizing portfolio is more conservative, with a lower expected growth 
rate, but a higher decay rate for the probability of underperforming that target growth 
rate (and hence a higher probability of realizing a growth rate of wealth in excess of 
that target). The (perhaps unlikely) presence of an unconditionally riskless asset, i.e. 
one with an intertemporally constant return, provides a floor on the attainable target 
growth rates. When the target growth rate is sufficiently near that floor, the investor's 
risk aversion will be quite high, and the associated decay rate maximizing portfolio 
will be close to full investment in the unconditionally riskless asset. The relationship 
between the target growth rate and the associated (maximum) decay rate of the prob- 
ability that it will not be exceeded quantifies the tradeoff between growth and shortfall 
risk that has concerned analysts studying the expected log criterion. 

Exact calculation of the decay rate (or equivalently, the expected power utility) 
requires the exact portfolio return process. In practice, the distribution is not exactly 
known. Even if its functional form is known, its parameters must still be estimated. To 
cope with this lack of exact knowledge, Section 3 adopts the assumption that portfolio 
log returns are IID with an unknown distribution, and follows Kroll et al. (1984) in 
estimating expected utility by substitution of a time average for the expectation operator. 
The estimated optimal portfolio and endogenous risk aversion parameter are those that 
jointly maximize the estimated expected power utility. An illustrative application of this 
estimator is included, contrasting decay rate maximization to both Sharpe Ratio and 
expected log maximization when allocating funds among domestic industry sectors. In 
it, decay rate maximization selects portfolios with higher skewness than Sharpe ratio 
maximization does. The IID assumption that underlies the estimator also permits the 
use of both a relative entropy minimizing, Esscher transformed log return distribution 
and a cumulant expansion to help interpret the empirical findings. 

Section 4 summarizes the most important results, and concludes with some good 
topics for future research. 


2. Porfolio analysis 


Following Hakansson and Ziemba (1995, p. 68), the wealth at time T resulting from 
investment in a portfolio is Wr = Wo inem R ji, where R p is the gross (hence positive) 


^ Hence, it is possible that an asset pricing model incorporating decay rate maximizing investors could 
outperform the CAPM, which incorporates Sharpe ratio maximizing investors. This topic is left for another 


paper. 
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rate of return between times t — 1 and ¢ from a portfolio p. Note that Wr does not 
depend on the length of the time interval between return measurements, but only on 
the product of the returns between those intervals. Dividing by Wo, taking the log of 
both sides, multiplying and dividing the right-hand side by T and exponentiating both 
sides produces the alterative expression 


Wr = Wy [eb enar" = Wole! oT. (1) 


From (1), we see that Wr is a monotone increasing function of the realized time 
average of the log gross return, denoted log Rp, which is the realized growth rate of 
wealth through time T. When the log retum process is ergodic in the mean, this will 
converge to a number denoted E[log Rp], as T — oo. Accordingly, there was early (and 
still continuing) interest in the portfolio choice that maximizes this expected growth 
rate, i.e. selects the portfolio arg max , E[log Rp], also known as the “growth optimal" 
or "Kelly" criterion. As noted by Hakansson and Ziemba (1995, p. 65) “...the power 
and durability of the model is due to a remarkable set of properties", e.g. that it “almost 
surely leads to more capital in the long run than any other investment policy which 
does not converge to it".? 

But maximizing the expected log return often invests very heavily in assets with 
volatile returns, which has led several researchers to highlight its substantial downside 
performance risks. Specifically, we will now examine the probability of the event that 
the realized growth rate of wealth log R, will not exceed a target growth rate logr 
specified by the investor or analyst. This is an event that will cause Wr in (1) to fail 
to exceed an amount equal to that earned by an account growing at a constant rate 
logr. The following subsection uses a simple and widely analyzed portfolio problem 
to calculate this downside performance risk for the growth optimal portfolio and a 
portfolio chosen to minimize it. 


2.1. The normal case 


A simple portfolio choice problem, used in Browne (1999b), requires choice of a 
proportion of wealth p to invest in single stock, whose price is lognormally distributed 
at all times, with the rest invested in a riskless asset with continuously compounded 
constant return i. In this case, log Rp ~ HD M (Ellog Rp], Var[log R5]). We now com- 
pute the probability that log Rp < logr. Because the retums are independent, log Rp ~ 
A (Ellog Rp], Var[log Rp]/T). The elementary transformation to the standard normal 
variate Z shows that the desired probability is 


log r — Eflog Rp] 


V Var[Ry/T (2) 


Prob[log Rp € logr] = Prob |Z < 


5 See the analysis of Algoet and Cover (1988) and the lucid exposition of Cover and Thomas (1991, 
Chapter 15) for more information on the expected log criterion. For a spirited normative defense of this 
criterion, see Thorp (1975). 
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In order to minimize (2), i.e. to maximize the complementary probability that 
log Rp > logr, one must choose the proportion of wealth p to minimize the expres- 
sion on the right-hand side of (2). This is equivalent to maximizing —1 times this 
expression. Independent of the specific value of T, this portfolio stock weight is 


E[log R,] — 
arg max [log Rp] — logr 


x Varlog A, ©) 


Portfolio (3) will differ considerably from the following growth optimal portfolio 


arg max Eflog Rp] (4) 


because of the presence of the target logr in the numerator of (3) and the standard 
deviation of the log portfolio return in its denominator. Portfolio (3) will also differ 
from the following Sharpe Ratio maximizing portfolio: 
are max E[R,] - i 
max ——— —— 
d PoX/Var[(R,] 
because of the presence of the presence of the target log r in (3) in place of the riskless 
rate i in (5), and because of the presence of log gross returns in (3) in place of the 
net returns used in (5). 
It will soon prove useful to reformulate the rule (3) in the following way. Note that 
Prob[log Rp < logr] will not decay asymptotically to zero unless the numerator of (3) 
is positive, so we need only consider portfolios p for which 


(5) 


E[log Rp] > logr, (6) 


in which case the objective in problem (3) can be equivalently reformulated by squar- 
ing, and dividing by 2. The result is the following criterion: 


2 
1 ( EllogR,] — logz 
max D,(logr) = 24 JVarlogR;] |` ' 
arg max p(logr) «mni Var[log Rp] T 


In order to quantitatively compare criteria (4), (5), and (7), it is useful to follow 
Browne (1999b) in using a parametric stochastic stock price process that results in 
the stock price being lognormally distributed at all times ¢, so that log Ry, ~ UD 
(E[log Rp], Var(log R5]) as assumed above. Specifically, the stock price S follows 
the geometric brownian motion with drift dS/S = mdt + vdW, where m denotes the 
instantaneous mean parameter, v denotes the instantaneous volatility parameter, and W 
denotes a standard Wiener process. The bond price B follows dB/B = idt. Denoting 
the period length between times ¢ and t+ 1 by At, Hull (1993, p. 210) shows that 


E[log Rp] = (pm + (1 — p)i — pv /2)At, (8) 


Var[log Rp] = p^v' AL. (9) 


Now substitute Eqs. (8) and (9) into (7), and write down the first-order condition 
for the maximizing stock weight p. You can verify by substitution that the following 
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p solves it: 


2(log r — i) 
D, =, = 
arg max p(logr) . (10) 


Using (8), the growth optimal criterion (4) yields the portfolio 


(m — i) 
E[log R,] — . 
arg max [log Rp] 3 (11) 


Using Browne's (1999b, p. 77) parameter values m = 1596, v = 30%, i = 7%, and 
a target growth rate log r = 8%, the outperformance probability maximizing rule (10) 
advocates investing a constant p=47% of wealth in the stock, while the growth optimal 
rule (11) advocates p=89%. Of course, (10)'s p=47% minimizes the probability that 
the realized growth rate log Rp < 8%. Fig. 1 illustrates the phenomena, by graphing 
Prob[log Rp < 8%] for the two portfolios, and a third portfolio with just 33% invested 
in the stock. It shows that Prob[log R, < 8%] decays to zero for all three portfolios, 
but decays at the fastest rate when (10)'s p=47% is used. Section 2.2 will show that 
the rate of probability decay rate in Fig. 1 is D,(logr) in (7). Fig. 1 also shows that 
even though investors can invest in a riskless asset earning 7%, and can try to beat 
the modest 8% target growth rate by also investing in a stock with an instantaneous 
expected return of 1596, there is still almost a 2096 probability that the investor's 
realized growth rate of wealth after 50 years will be less than 8%! 

Table 1 contrasts performance statistics for the outperformance probability maximiz- 
ing portfolios and the growth optimal portfolio p = 89% over the feasible range of 
target growth rates logr. Because the nskless rate of interest is only 796, the prob- 
ability of earning more than a target rate logr > 7% is always less than one. If the 
target rate logr < 7%, the investor could always ensure outperforming that rate by in- 
vesting solely 1n the riskless asset. Hence the lower limit of the feasible target growth 
rates is the 7% riskless rate. Line 1 in Table 1 shows that in order to maximize 
the probability of outperforming a target growth rate one basis point higher than this, 
i.e. logr = 7.01%, the investor need invest only p = 5% of wealth in the stock. As a 
result of this conservative portfolio, this investor will have a relatively low probabil- 
ity of not exceeding this target; the decay rate of the underperformance probability is 
max p D5(7.01) — 3.1996. But by investing 89% of wealth in the stock, the growth opti- 
mal investor will have a higher probability of not exceeding this 7.0196 target, because 
its associated decay rate is just 0.88%. This occurs despite its much higher expected 
growth rate E[log Rp] (10.6% vs. 7.4%) and higher expected net return u= pm-+(1— p)i 
(14.1% vs. 7.4%). Of course, the major reason for this is its higher volatility o = pv 
(26.7% vs. 1.5%), which increases the probability that a bad series of returns will drive 
the growth optimal portfolio's realized growth rate below log r = 7.01%. Also note in 
line 1 that in order to maximize the probability of outperforming the 7.0196 target, the 
investor must choose a portfolio with a higher expected growth rate (7.4%) than the 
target, as explained earlier. 


6 Of course, if a riskless rate does not exist, it would not provide a floor on the feasible target rates. 
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Underperformance Probabilities 
0.5 log r = 8% 
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Fig. 1. The probability of not exceeding the 8% target growth rate approaches zero at a portfolio dependent 
rate of decay. The rate of decay is highest for the portfolio with p = 47% invested in the stock. 


There is an important tradeoff present in Table 1. Note from columns | and 3 
that investors with successively higher growth targets logr have successively lower 
underperformance probability decay rates max, D,(logr). This implies that investors 
with higher targets will be exposed to a higher probability of realizing growth rates 
of wealth that do not exceed their respective targets. This occurs despite the fact that 
they did the best they could to minimize the probability of that happening. This is a 
consequence of the successively more aggressive portfolios needed to ensure asymptotic 
outperformance of their successively higher targets. 

This tradeoff is analogous to the tradeoff between mean and standard deviation asso- 
clated with the efficiency criterion that selects the portfolio with the smallest standard 
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Table | 
Performance Statistics for the maximum expected log portfolio and maximum decay rate portfolios associated 
with feasible target growth rates logr, when portfolios are formed from a lognormally distributed stock with 
m=15% instantaneous mean return and v=30% instantaneous volatility, and a riskless asset with instantaneous 
riskless rate i = 796 
ee 
Performance of portfolio (11) vs. portfolio (10) 
— 


log r% Stock weight p% Dp(log r)% E[iog Rp]% u% o% 
o LLL 
7.01 89 (5) 0.88 (3.19) 10.6 (7.4) 14.1 (7.4) 26.7 (1.5) 
7.5 89 (33) 0.66 (1.39) 10.6 (9.2) 14.1 (9.6) 26.7 (9.9) 
8.0 89 (47) 0.46 (0.78) 10.6 (9.8) 14.1 (10.8) 26.7 (14.1) 
8.5 89 (58) 0.30 (0.44) 10.6 (10.1) 14.1 (11.6) 26.7 (17.4) 
9.0 89 (67) 0.17 (0.22) 10.6 (10.3) 14.1 (12.4) 26.7 (20.1) 
9.5 89 (75) 0.08 (0.09) 10.6 (10.5) 14.1 (13.0) 26.7 (22.5) 
10.0 89 (82) 0.02 (0.02) 10.6 (10.5) 14.1 (13.6) 26.7 (24.6) 
10.6 89 (89) 0.00 (0.00) 10.6 (10.6) 14.1 (14.1) 26.7 (26.7) 


deviation of return, once the investor fixes a mean return. Here, the criterion selects the 
portfolio with the highest underperformance probability decay rate, once the investor 
fixes a target growth rate. In this way, the tradeoff between logr and max, D,(logr) 
can be thought of as an alternative efficiency frontier, which yields the growth opti- 
mal portfolio on one extreme and full investment in the constant interest rate (when 
it exists) on the other. The efficiency frontier is graphed in Fig. 2, which shows it to 
be a convex curve in this example. In Section 2.2, we will see that this is true more 
generally, 1.e. with multiple risky assets, whose log returns are not necessarily normal 
nor IID. 

Finally, there is just one risky asset used to form the optimal portfolios in Table 1, 
so the mean-standard deviation efficiency frontier is just swept out by varying the stock 
weight and calculating the mean p and standard deviation c of the net returns. For 
comparison purposes, this is reported in the last two columns of Table 1. Reading down 
the last three columns of the table, note that the difference between x and the expected 
growth rate E[logR,] of wealth grows wider as the standard deviation of portfolio 
returns c gets larger. The mean return increasingly overstates the expected growth 
rate of wealth as portfolio volatility increases. This is due to (1); as Hakansson and 
Ziemba (1995, p. 69) note, “...capital growth (positive or negative) is a multiplicative, 
not an additive process”.’ Here, due to lognormality, there is a precise relationship 
between the two: E[log Rp] = 4 — o?/2 (see Hull, 1993, p. 212). 

The following section will show that a simple, yet powerful result from large devia- 
tions theory permits us to rigorously characterize D,(logr) in Table 1 as the decay rate 
of the portfolios’ underperformance probabilities graphed in Fig. 1. More importantly, 
the result also shows how to correctly calculate the decay rate and associated decay 
rate maximizing portfolios when portfolio returns are not lognormally distributed. 


7 fn this regard, see Stutzer (2000) for a simpler model of fund managers who use arithmetic average net 
returns rather than average log gross returns, under the assumption that net returns are IID. 
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Fig. 2. The convex tradeoff between the target growth and underperformance probability decay rates for the 
optimal portfolios in Table 1. The convexity is generic. 


2.2. The general case 


As shown in the last section, when a portfolio's log returns were HD normally 
distributed, exact underperformance probabilities of the realized growth rate could 
be easily calculated using (2). But it is widely accepted that stock returns are of- 
ten skewed and leptokurtotic. Even if they weren't, the skewed returns of derivative 
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securities like stock options are inherently non-normally distributed. Hence there is 
an important need to rank portfolios according to their underperformance probabilities 
Prob|log R, < logr] in non-IID, non-normal circumstances. It is now shown how to 
calculate the decay rate of this probability in more general cases. We will then apply 
the general result to prove that D,(logr) in (7) is indeed the correct decay rate for 
the IID normal case. 

As in the previous section, we seek to rank portfolios p for which the underperfor- 
mance probability Prob[log R, < logr] — 0 as T — oo. Calculation of this probabil- 
ity's decay rate D,(logr) is facilitated by use of the powerful, yet simple to apply, 
Gártner-Ellis Large Deviations Theorem, e.g. see Bucklew (1990, pp. 14-20). For a 
log portfolio return process with random log return log R, at time t, consider the 
following time average of the partial sums’ log moment generating functions, i.e.: 


z 0 
$(8) = lim. = log Efe?" log p] = Jim 7 loge l3 | : (12) 
where the last expression is found by usipg (1) to compute Wr/W) = [T, R pts Sub- 
stituting log(W7/Wo) for the sum of the logs in (12), and simplifying. Hence (12) 
depends on the value of the random Wr, and so does not depend on the particular 
discrete time intervals between the log returns log Rp. We maintain the assumptions 
that the limit in (12) exists for all 0, possibly as the extended real number +00, 
and is differentiable at any 0 yielding a finite limit. From the last expression in (12), 
these assumptions must apply to the asymptotic growth rate of the expected power of 
Wr/Wo. Some well-analyzed log return processes will satisfy these hypotheses, as will 
be demonstrated shortly by example. However, these assumptions do rule out some pro- 
posed stock return processes, e.g. the stable Levy processes with characteristic exponent 
X < 2 and hence infinite variance, used in Fama and Miller (1972, pp. 261—274). 
The calculation of the decay rate D, (log) is the following Legendre—Fenchel trans- 
form of (12): 


D,(logr) = max 0logr — (0). (13) 


When log returns are independent, but not identically distributed, (12) specializes to 
D [s 
Lm OlogRui.. J; 0 
$(0) = jim T 3 log Efe” ?£^»] = jim 7 2 log E[R;,]. (14) 


When log returns are additionally identically distributed (IID), (12) simplifies to 
(0) = log Efe? ^57] = log FIRS], (15) 


which when substituted into (13) yields the decay rate calculation for the IID case. 
This result will form the basis for the empirical application in Section 3. It is known 
as Cramer's Theorem (Bucklew, 1990, pp. 7—9). 

To illustrate these calculations, let us return to the widely analyzed case where the 
log portfolio return log Ry, is a covariance-stationary normal process with absolutely 
summable autocovariances. Then the partial sum of log returns in (12) is also normally 
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distributed. The mean and variance of it can be easily calculated by adapting Hamilton's 
(1994, p. 279) calculations for the distribution of the sample mean, i.e. the partial sum 
divided by T. One immediately obtains 


T 
log(Wr/Wo) = log Ry, ~ NY (TE[logR,], T Covo 
p 


t=1 


T-\ 
+ M - 1)(Cov, + 22) ; (16) 


t=1 


where E[log Rp] denotes the log return process’ common mean and Cov, denotes its 
t-lagged autocovariance. Formula (12) is the limiting time average of the log moment 
generating functions of these normal distributions. Now remember from elementary 
statistics that a normal distribution’s log moment generating function is linear in its 
mean and quadratic in its variance. As a result, use (12) to calculate 


T-\ 
$(0) = lim. > (retos n (r Covo + 5 (T — 1)(Cov, + 22 rn) 


1-0 
t=+00 
— E[log R;]O + $` Cov,0?/2. (17) 
tT=—00 
Now substitute (17) into (13) and set its first derivative with respect to @ equal to zero 
to find that the maximum in (13) is attained by the following maximizer: 


TS=+00 


Omax = (logr ~ EflogRp]) / ÑO Cov, . (18) 


Substituting (18) back into (17) and rearranging yields the underperformance proba- 
bility decay rate 

2 
E[log R5] — logr 


YS Cov, 


t=- 


D,(logr) = ; (19) 


Note that maximization of the decay rate (19) rewards portfolios with a high expected 
growth rate E[log Rp] (in its numerator) and a low asymptotic variance rere Cov, = 
limr—oo Var{log(Wr/Wo)\/T (in its denominator). This differs from the criterion in 
Bielecki et al. (2000), which is approximately the asymptotic expected growth rate 
minus a multiple of the asymptotic variance. For the IID case used in Section 2.1, 
all covariance terms in (19) are zero except Covg = Var[logR,], so the decay rate 
function (19) reduces to the expression (7) used in Section 2.1 and Table 1. Fig. 3 
depicts this decay rate function over a range of logr, for each of the three portfolios 
whose underperformance probabilities are graphed in Fig. 1. There, we see that the 
portfolio p=47% from (10) does indeed have the highest decay rate when logr = 8%, 


Portfolio Choice with Endogenous Utility: A Large Deviations Approach 631 


M. Stutzer I Journal of Econometrics 116 (2003) 365—386 377 


Underperformance Probability Decay Rates 


3.0095 


—€— p - 8996 
—i— p = 33% 
—fh—p = 47% 


0.90% 


0.80% 


0.70% 


0.60% 


0.50% 


Decay Rate Dp(log r) 


0.40% 


0.30% 


0.20% 


0.10% 


0.00% : ^ 
Te) 

o oe o b D D o i2] e eo o 

$ 8 g 


Target Rate log(r) 


Fig. 3. The decay rate function Dy(logr) is convex, with a minimum at logr = E(log Rp]. The portfolio 
p — 4*6 attains the highest decay rate when log r = 8%. 


Note that the decay rate function D, (logr) in (19) for a covariance stationary Gaus- 
sian portfolio log return process is non-negative, and is a strictly convex function of 
logr, achieving its global minimum of zero at the value logr = E[log Rp]. These 
properties are true for more general processes (for a discussion, see Bucklew, 1990). 
As a result, remember from (6) that the decay rate criterion ranks portfolios with 
E[log R5] > logr, and apply the envelope theorem to the general rate function (13) to 


yield 


dD (log r) _ 
dlogr ^ Ologr 


max 0logr — $(0) = Omax < 0 (20) 
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as seen in the special case (18). Now differentiate (20) to find 


dD*?(logr) _ dO snax 
dlog r2 dlogr 


(21) 
due to convexity of D,(logr). Again due to the envelope theorem, (20) and (21) con- 


tinue to hold for max, D,(logr) as well. Fig. 2 depicts the convexity of max y D,(logr) 
over the relevant range of logr in the example of Table 1. 


2.3. Analogy with power utility 


The general decay rate criterion is a generalization of the expected power utility 
criterion. To uncover the generalization, substitute the right-hand side of (12) into 
(13), to derive 


l D 
D,( = — lim = — 
max p(log r) max max logr ium 7 log E ( ) | 


1 Wry’ 
= max max logr? — Jim 7 log E LG) | 


= max max ~ lim Lio E Wr ’ (22 
p a Too T E Wor? ? ) 
which yields the following large T approximation: 
Bmax (p) 
W. 
eme e). e) 


where we write Omax{p) in (23) to stress dependence (through the joint maximization 
(22)) of 8 on the portfolio p. The left-hand side of (23) increases with Dp, so a large 
T approximation of the portfolio ranking is produced by use of the expected power 
utility on the right-hand side of (23). 

There are both similarities and differences between the right-hand side of (23) and a 
conventional expected power utility E[ — (W7)?]. From (20), Omax(p) < 0. Evaluating 
it at the investor's decay rate maximizing portfolio p, note that the power function 
in (23) with the form U = —(e)'»«(?) increases toward zero as its argument grows 
to infinity, is strictly concave, and has a constant degree of relative risk aversion 
y= 1 — Ogx( p) > 1. Furthermore, O4 (p) « 0 implies that the third derivative of U 
is positive, so the criterion exhibits positive skewness preference. But there are two 
important differences between the concepts. First, the argument of the power function 
in (23) is altered; it is the ratio of invested wealth to a "benchmark" level of wealth 
accruing in an account that grows at the geometric rate r. While absent from tradi- 
tional criteria, this ratio is also present in other non-standard criteria, such as Browne's 
(1999a, p. 276) criterion to “maximize the probability of beating the benchmark by 
some predetermined percentage, before going below it by some other predetermined 
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percentage". Browne (1999a, p. 277) notes that ‘‘...the relevant state variable is the ra- 
tio of the investor's wealth to the benchmark". * Second, conventional portfolio theory 
assumes that the risk aversion parameter Ó is a preference parameter that is independent 
of the investment opportunity set. But in (23), 0=Omax(p) is determined by maximiza- 
tion, and hence is not independent of the investment opportunity set. Investors could 
utilize different investment opportunity sets, either because of differential regulatory 
constraints, such as hedge funds' greater ability to short sell, or because of different 
opinions about the parameters of portfolios' log return processes. When this happens, 
investors will have different decay rate maximizing portfolios p, and different degrees 
of risk aversion y — 1 — Ogax( p), even if they have the same target growth rate logr. 

Assuming that asset returns are generated by a continuous time, correlated geometric 
Brownian process, Browne (19992, p. 290) compares the formula for the optimal port- 
folio weights resulting from his criterion, to the formula resulting from conventional 
maximization of expected power utility at a fixed terminal time T. In this special case, 
he finds that the two formulae are isomorphic, i.e. there is a mapping between the 
models' parameters that equates the two formulae. He concludes that "there is a con- 
nection between maximizing the expected utility of terminal wealth for a power utility 
function, and the objective criteria of maximizing the probability of reaching a goal, or 
maximizing or minimizing the expected discounted reward of reaching certain goals". 
Connection (23) between decay rate maximization and expected power utility is quite 
specific, yet does not depend on a specific parametric model of the assets’ joint return 
process. 

Critics such as Bodie (1995, p. 19) have argued that "the probability of a shortfall 
is a flawed measure of risk because it completely ignores how large the potential 
shortfall might be". It is possible that this is a fair assessment of expected power 
utility maximization of wealth at a fixed horizon date T, subject to a “Value-At-Risk” 
(VaR) constraint that fixes a low probability for the event that terminal wealth could 
fall below a fixed floor. This problem was intensively studied by Basak and Shapiro 
(2001, p. 385), who concluded that “The shortcomings...stem from the fact that the VaR 
agent is concerned with controlling the probability of a loss rather than its magnitude". 
They proposed replacing the VaR constraint with an ad. hoc expected loss constraint, 
resulting in fewer shortcomings. The investor's target growth rate serves a similar 
function in the horizon-free, unconstrained criterion (22). 


3. Non-parametric implementation 


In the IID case, there is a simple, distribution-free way to estimate D,(logr) for a 
portfolio p. Following the comparative portfolio study of Kroll et al. (1984), we replace 
the expectation operator in (15) by an historical time average operator, substitute into 
(13), and numerically maximize that.? This estimator eliminates the need for prior 


8 While Browne considers a stochastic benchmark, the constant growth benchmark here can be modified 
to consider an arbitrary stochastic benchmark, at the cost of fewer concrete expository results. 

?]t is important to remember that the log moment generating function of the log return distribution 
necessarily has to exist near Omax in order for this technique to work here. 
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knowledge of the log return distribution's functional form and parameters. Specifically, 
n : . . 

let Rot) doo p;Rj(t) denote the historical return at time t of a portfolio comprised 

of n+l assets with respective returns Rj(t), with constantly rebalanced portfolio weights 

2^; Dj = 1. The estimator is 


T n 
D (logr) = max flog r — log 73 Se pRO (24) 
t=1 X je0 
and the optimal portfolio weights are estimated to be 
p=arg max max 0logr 
mu n n ld 
-log | = » 3 pjR((t) - | 1— 2n ROD) |. (25) 


The maximum expected log portfolio was similarly estimated, by numerically finding 
the weights that maximize the time average of log R,(). 

Let us now contrast the estimated decay rate maximizing portfolio (25) to both 
the expected log and Sharpe ratio maximizing, constantly rebalanced portfolios formed 
from Fama and French's 10 domestic industry, value-weighted assets, !° whose annual 
returns run from 1927 through 2000. The sample cross-correlations of the 10 indus- 
tries' gross returns range from 0.32 to 0.86, suggesting that diversified portfolios of 
them will provide significant investor benefits. The sample covariance matrix is in- 
vertible, permitting estimation of the Sharpe ratio maximizing “tangency” portfolio, by 
multiplying this inverse by the vector of sample mean excess returns over a riskless 
rate, and then normalizing the result. We assume that it was possible to costlessly 
store money between 1927-2000, with no positive constant nominal rate riskless asset 
available.!! Hence we assume a zero constant riskless rate when computing the Sharpe 
ratio maximizing tangency portfolio of the 10 industry assets. ' 

The results are seen in Table 2. 

The performance statistics in Table 2 show that the Sharpe ratio maximizing portfolio 
has almost no skewness. But the decay rate maximizing portfolios all have a skewness 
of about 1, as does the expected log maximizing portfolio. This reflects the skewness 
preference inherent in the generalized expected power utilities with degrees of risk 
aversion greater than (in the log case, equal to) one. '? In fact, these investors prefer 
all odd order moments and are averse to all even order moments. To see this, note 
that (15) is the cumulant generating function for the (assumed) ITD log portfolio return 


10 The data are currently available for download from a website maintained by Kenneth French at MIT. 

U Treasury Bills are not a constant rate riskless asset, like the one used to form portfolios in Section 2.1. 
A fixed percentage of wealth invested in Treasury Bills is just like any other risky asset. 

U See Kraus and Litzenberger (1976) and Harvey and Siddique (2000) for evidence that investors prefer 
skewness. 
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Table 2 

Comparison of estimated Sharpe ratio, expected log, and decay rate maximizing portfolios from Fama-French 
10 industry indices, 1927-2000 
m 


Industries Asset moments Portfolio weights 

Max logr logr logr Max 

H a Skewness Sharpe 5% 10% 15% Log 

ee 
NoDur 0.130 0.198 ~0.12 0.80 0.92 1.0 1.11 1.15 
Durbl 0.166 0.328 0.86 —0.01 0.27 0.52 0.97 1.22 
Oil 0.137 0220 0.01 0.75 0.77 0.96 1.24 1.36 
Chems 0.146 0,225 0.63 0.14 0.35 0.52 0.89 1.15 
Manuf 0.136 0.254 0.21 0.03 —0.10 0 0.11 0.20 
Telcm 0.123 0.200 0.07 0.35 0.48 0.38 0.30 0.28 
Utils 0.118 0.225 0.25 0.05 —0.20 —0.34 —0.61 —0.76 
Shops 0.141 0256 —0.25 0.13 0.44 0.60 0.96 12 
Money 0.142 0.245 —043 —0.23 —0.20 0.07 0.48 0.70 
Other 0.106 0.242 —0.04 0.76 0.86 1.5 2.52 3.09 


Performance statistics 


Mean 0.148 0.162 0.195 0.248 0.278 
Std. dev. 0.153 0.181 0.240 0.368 0.446 
Skewness —0.02 1.05 1.07 1.06 1.05 
Decay rate Dy(logr) 0.18 0.04 0.004 0 
Risk aversion 1 — Omax(p) 53 2.5 1.3 I 


distribution. Substituting it into (13) and evaluating it at @max(p) yields the following 
cumulant expansion: 


Var[log R 
D,(logr) = (logr — Elo Rp] mal p) — 155 2 in (p 
-Y F Omal PY, (26) 
i=3 7 


which uses the facts that E[log Rp] is the first cumulant of the log return distribution and 
that Var[log Rp] is its second cumulant, while x; denotes its ith order cumulant. Because 
Omax( p) < 0, we see that the decay rate increases in odd-order cumulants and decreases 
in even-order cumulants. With normally distributed log returns, all the cumulants in the 
infinite sum are zero. But with non-normally distributed returns, increased skewness will 
increase the decay rate (due to x3). The relative weighting of the mean, variance and 
skewness in (26) is determined by their sizes, the sizes of the higher order cumulants, 
the target growth rate logr, and the value of Omax(p) < 0 associated with log r. 

The top panel of Table 2 contains the 10 industry weights in each portfolio. As is 
typical of estimated Sharpe ratio maximizing portfolios with more than a few assets, it 
is heavily long in just three industries (Non-durables, Oil, and Telecommunications ). 
The decay rate maximizing portfolio for the target growth rate logr — 0.10 is also 
heavily invested in these industries, but in addition it has considerable long positions 
in the two most positively skewed industries (Durables and Chemicals). The Sharpe 
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ratio maximizing portfolio is heavily short in one industry (Other). The decay rate 
maximizing portfolios are heavily short in both this industry and as well as two others 
(Shops and Utilities). The differences between Sharpe ratio and decay rate maximizing 
portfolios are due to the presence of the target growth rate in decay rate maximization, 
its use of log gross returns rather than net returns when calculating portfolio means 
and variances, and the presence of higher order moments. It is difficult to assess the 
impact of higher order moments on the differences in portfolio weights. Bekaert et al. 
(1998, p. 113) were able to produce only a two percentage point difference in an asset 
weight, when simulating the effects of its return’s skewness over the range —1 to 2.0, 
on the portfolio chosen by an expected power utility maximizing agent whose degree 
of risk aversion was close to 10. This suggests that the use of a target growth rate, and 
the use of log gross returns rather than arithmetic net returns, account for most of the 
differences between the decay rate and Sharpe ratio maximizing portfolios’ weights. 

The convergence of decay rate maximizing portfolios to the expected log maximizing 
portfolio is seen when reading across the last four columns of Table 2. The last two 
rows in the bottom panel of Table 2 show the relationship between the target growth 
rates, their respective efficient portfolios’ maximum decay rates, and their respective 
endogenous degrees of risk aversion. Despite the fact that Omax( p) is determined by 
maximization in (25), we see that the degree of risk aversion 1 — Omax(p) is not 
unusually large in any of the decay rate maximizing portfolios tabled, '? and converges 
toward 1 as logr — max, E[log Rp]. An alternative interpretation of this is enabled by 
computing the first order condition for Omax(p) in the IID case. To do so, substitute 
(15) into (13) and differentiate to find 


E [log de = logr, (27) 


where the Esscher transformed probability density 


Omax 
do Rp? (28) 


is used to compute the expected log return (i.e. growth rate) in (27). ^ Furthermore, a 
result known as Kullback's Lemma (1990) shows that the Esscher transformed density 
(28) is the solution to the following constrained minimization of relative entropy, whose 
minimized value is the decay rate, i.e. 


d d 
D,(logr) = min E E: log A s.t. (27). (29) 

From (27), an efficient portfolio has the highest decay rate among those with a 
fixed transformed expected growth rate equal to logr. As logr —> max, E[log Rp], 


13 Of course, it can get unusually large when the target growth rate is unusually low, i i.e. when the investor 


is unusually conservative. 
1^ See Gerber and Shiu (1994) for option pricing formula derivations that use the Esscher transform to 


calculate the risk-neutral density required for option pricing. 
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Fig. 4. Bootstrap estimated underperformance probabilities for portfolios in Table 2, when logr = 10%. 


Mmax( p) — 0, density (28) concentrates at unity and the minimal relative entropy in 
(29) approaches zero, i.e. the transformed probabilities approach the actual probabilities. 
As a result, the transformed expected log return in (27) approaches the actual expected 
log return, so constraint (27) collapses the portfolio constraint set onto the log optimal 


portfolio. 


In order to determine if a decay rate maximizing portfolio in Table 2 will have 
lower underperformance probabilities than the Sharpe ratio and expected log maximiz- 
ing portfolios do, the probabilities were estimated by resampling the portfolios’ log 
returns 5000 times for each investment horizon length 7, and then tabulating the em- 
pirical frequency of underperformance for each T. The results for the target decay rate 
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log he 10% are graphed in Fig. 4. Fig. 4 shows that the estimated decay rate max- 
imizing portfolio in Table 2 had lower underperformance probabilities for all values 
of T. 


3.1. More general estimators 


The empirical estimates above were made under the assumption of IID returns. There 
is little evidence of serially correlated log returns in many equity portfolios, and what 
evidence there is finds low serial correlation. Hence there is little benefit in using an 
efficient estimator for the covariance stationary Gaussian rate function (20), e. g. using 
a Newey—West estimator of its denominator. But the presence of significant GARCH 
(perhaps with multiple components) effects (see Bollerslev, 1986) in log returns, as 
described in Tauchen (2001, p. 58), motivates the need for additional research into 
efficient estimation of (12) and (13) under specific parametric process assumptions. 
Alternatively, it may be possible to find an efficient nonparametric estimator for (12) 
and (13) by utilizing the smoothing technique exposited in Kitamura and Stutzer (1997, 
2002) to estimate the expectation in (12). 


4. Conclusions and future directions 


A simple large deviations result was used to show that an investor desiring to maxi- 
mize the probability of realizing invested wealth that grows faster than a target growth 
rate should choose a portfolio that makes the complimentary probability, i.e. of wealth 
growing no faster than the target rate, decay to zero at the maximum possible rate. A 
simple result in large deviations theory was used to show that this decay rate maxi- 
mization criterion is equivalent to maximizing an expected power utility of the ratio 
of invested wealth to a "benchmark" wealth accruing at the target growth rate. The 
risk aversion parameter that determines the required power utility, and the investor's 
degree of risk aversion, is also determined by maximization and is hence endogenously 
dependent on the investment opportunity set. Yet it was not seen to be unusually large 
in the applications developed here. 

The highest feasible target growth rate of wealth is that attained by the portfolio 
maximizing the expected log utility, i.e. that with the maximum expected growth rate 
of wealth. Investors with lower target growth rates choose decay rate maximizing 
portfolios that are more conservative, corresponding to degrees of risk aversion that 
exceed 1. As the target growth rate falls, it is easier to exceed it, so the decay rate of 
the probability of underperforming it goes up. The relationship between possible target 
growth rates and their corresponding maximal decay rates form an efficiency frontier 
that replaces the familiar mean-variance frontier. An investor's specific target growth 
rate determines the specific decay rate maximizing portfolio chosen by her. A decay 
rate maximizing investor does not choose a portfolio attaining an expected growth rate 
of wealth equal to her target growth rate (instead it is higher than her target). But there 
is an Esscher transformation of probabilities, under which the transformed expected 
growth rate of wealth is the target growth rate. 
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Researchers choosing to work in this area may select from several interesting topics. 
First, it is easy to generalize the analysis to incorporate a stochastic benchmark. This 
would be helpful in modelling an investor who wants to rank the probabilities that a 
group of similarly styled mutual funds will outperform their common style benchmark. 
Second, one could calculate the theoretical decay rate function using a multivariate 
GARCH model for the asset return processes, and then estimate the resulting function. 
Third, one could extend the decay rate maximizing investment problem to the joint 
consumption/portfolio choice problem, enabling the derivation of consumption-based 
asset pricing model with a decay rate maximizing representative agent. If it is possible 
to construct a model like this, the representative agent's degree of risk aversion will 
depend on the investment opportunity set—an effect heretofore unconsidered in the 
equity premium puzzle. 
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Abstract 


The expected log utility of wealth (i.e., the growth-optimal or Kelly) criterion 
has been oft-studied in the management science literature. It leads to the highest 
asymptotic growth rate of wealth, and has no adjustable “preference parameters” 
that would otherwise need to be precisely “adjusted” to a specific individual’s 
needs. But risk-control concerns led to alternative criteria that stress security 
against underperformance over finite horizons. Large deviations theory enables 
a straightforward generalization of log utility’s asymptotic analysis that incorpo- 
rates these security concerns. The result is a power utility criterion that (like 
log utility) is free of an adjustable risk aversion parameter, because the latter 
is endogenously determined by expected utility maximization itself! A Bayesian 
formulation of the Occam’s Razor Principle is used to illustrate the unavoidable 
reduction of scientific testability (i.e., the ability to more easily falsify) inherent 
in criterion functions that introduce additional adjustable parameters that are 
not directly observable. 


1 A Simple Repeated Betting Problem 


All concepts and results are illustrated using the simplest repeated betting problem 
analyzed in the management science literature. This is the popular “Blackjack” 
example, discussed by Thorp (1984), MacLean, Ziemba, and Blazenko (1992), and 
MacLean and Ziemba (1999). The agent must choose a fraction p of wealth to bet on 
an ITD Bernoulli process that has a gross return per bet Rp = 1+ p with probability 
m > 1/2, and Rp = 1 — p with probability 1 — m. In other words, the bettor either 
wins or loses the fraction p of accumulated wealth each try. For example, if the 
bettor’s initial wealth is Wo = $500 and chooses p = 5%, the bettor will either win 
or lose $25 on the first try. If the bettor wins, the second try will either win or 
lose $525 x .05 = $26.25. But if the better lost the first time, the second try will 
either win or lose 475 * .05 = $23.75. This is isomorphic to a binomially distributed 
stock portfolio initially worth $500, and that returns +5% in each subsequent time 
period. 
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The accumulated wealth after T' bets is Wr = Wo ees Ry. The bettor is 
interested in the security against underperforming some benchmark wealth path 
Wor = Wo Mh Ry. Taking the log of both establishes that: 


T Fu 
yr log Rye X logR 
Prob|Wr < Wyr] = Prob Stal ae _ fae <0 (1) 


while taking the exponentials and rearranging shows: 


ELa log Rpt T 
EM 


Wr = Woe (2) 

By laws of large numbers valid for this binomial and more realistic processes, 
Lites los Rot TY p [log Rp]. So (2) shows that the expected growth rate Eflog Rp] 
resulting from a bet p is realized only asymptotically. Hence arg max, Ellog Rp] 
is the bet yielding the highest asymptotic growth rate of wealth. For illustrative 
purposes only, let us assume that the bettor's *edge" is 60 — 40 — 2096. It is easy 
to verify that betting argmax, Ellog Rp] = .20 yields the maximum asymptotic 
growth rate E[log R.29| = 0.60 log 1.20 + 0.4010g 0.80 ~ 0.0201. 

As Hakansson and Ziemba (1995) noted, this maximum expected log, a.k.a. 
growth-optimal or “Kelly”, criterion “almost surely leads to more capital in the long 
run than any other investment policy which does not converge to it”, and that “the 
growth-optimal strategy also has the property that it asymptotically minimizes the 
expected time to reach a given level of capital”. Other interesting characterizations 
of the criterion are found in Bell and Cover (1988) as well as in the textbook of 
Cover and Thomas (1991). 

But some other researchers, e.g., Aucamp (1993), argued that in practice it 
is quite possible that the asymptotic outperformance won’t be realized with high 
probability until after a very, very long time — perhaps hundreds of years. Over 
finite horizons, MacLean, Ziemba and Blazenko (1992) note that “the Kelly strategy 
never risks ruin, but in general it entails a considerable risk of losing a substantial 
portion of wealth”, and define three measures of security against underperforming 
a benchmark. Figure 1 graphs the probability (1) of underperforming a benchmark 
wealth path growing at a constant target growth rate denoted logr = 1% per bet, 
i.e., substitute log Ry, = 1% for all t in (1). Money growing at a constant rate of 
logr = 196 will double every log 2/.01 ~ 70 bets. Figure 1 shows that the probability 
of underperformance achieved by each of the three betting fractions p approaches 
zero as T — oo, as the law of large numbers guarantees (because E[log Rp] > 196 
for all three values of p). But Figure 1 shows that the convergence to zero is quite 
slow, and that betting either too high a fraction of wealth (e.g., the growth-optimal 
p = 20% of wealth) or too low a fraction (e.g., p = 8%) results in lower security 
against underperformance than betting p = 14.1% of wealth per bet. This following 
section will show how that fraction is found. 
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Figure 1 Probability of underperforming Logr = 1% benchmark. 


2 Asymptotics of Benchmark Underperformance Probabilities 


The underperformance probabilities (1) graphed in Figure 1 decay to zero exponen- 
tially as T — oo. Our emphasis on the security after suitably large T differs from the 
security notions in MacLean, Ziemba, and Blazenko’s (1992), who define security at 
either a fied T (their definition S1) or at all T both small and large (their definition 
52}. But Figure 1 also suggests that the betting fraction with the relatively best 
large-T security will also have relatively good security when T is smaller, which is 
usually the case in other problems studied by the author. For each betting fraction 
p, large deviations theory can be used to calculate the exponential decay rate of the 
underperformance probability curve associated with it. The optimal betting fraction 
is the one associated with the most rapidly decaying curve. This criterion focuses 
attention on the asymptotics of the realized growth rate (i.e., 5 ", log Ry;/T) relative 
to a benchmark's realized growth rate (5^, log Ry:/T), rather than just the asymp- 
totics of the realized growth rate itself (i.e., E[log Rp). When the benchmark is the 
growth-optimal policy (i.e., arg max, E[log Rp|), the two criteria coincide. Hence 
the criterion exposited herein is a natural generalization of the asymptotic reasoning 
used by growth-optimal policy advocates, that additionally incorporates the objective 
fear of underperforming a benchmark (fixed or stochastic) over finite horizons. 

To find the optimal policy in our example, one must first determine the bet- 
ting fractions p that make those probabilities decay to zero asymptotically (i.e., 
Ellog Rp] — logr > 0), like the three fractions in Figure 1 do when logr = 1%. This 
restriction defines both minimum (denoted p) and maximum (denoted p) acceptable 
betting fractions. In our example, p % 5.9% < p < p z 33.6%. 

Within this range of p, it is à consequence of the Gartner-Ellis Large Deviation 
Theorem (e.g., see Bucklew (1990)) that under fairly general conditions, the under- 
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performance probabilities approach zero at an asymptotic exponential rate of decay 
denoted D,, computed below: 


= Las das Ove eso eer) 
Dp = max Jm T log E le 1 (3) 


But there are more insightful ways of restating this criterion. It turns out that 
when p < p < p, the maximizing 0 < 0 in (3) (Stutzer, 2003). So without loss of 


generality we can replace 0 in (3) by y = —@ > 0, yielding the equivalent formulation 
1 oT A 

p= NE rus | = fey (log Rp. — log an] 4 

d er E e 


Using the right hand side of (1) to calculate the summed exponent in (4), a bit 
of algebra (Stutzer, 2003) yields the following restatements of the optimal choice 


problem: 

Gm) | © 
ey] . 
con 2 


Blackjack 1 x 1-pN ^? 
“SIS arg max max — LE (=) +0.40 (=) | (8) 
p Yy20 im T 


: 1 
Dopt = argmaxD, = argmaxmax lim ——log E 
ap & p p > p 0 Too T š 


arg max max — log E 
p y>0 


arg max max E 
p Yy»0 


where Wer = Wo m, Rp = Wor? when the benchmark is just a constant target 
growth rate. 

Fixing the choice vector or scalar (in our example) denoted p, the inner maxi- 
mization over y in (5) is the most general expression for the asymptotic decay rate 
of its underperformance probability curve (see Figure 1), valid in both IID or non- 
IID situations with either constant or stochastic benchmark log returns. The inner 
maximization over y in (6) is the decay rate in IID situations when the benchmark 
is a constant target growth rate. 

There are two major differences between (4) or (5) or the IID special case (7), 
and conventional maximization of the following expected power utility with constant 
relative risk aversion 1 + 7: 


max E[- R57] (9) 


First, the argument in our criterion (7) is the ratio of the bet's return to the bench- 
mark return, rather than just the former in (9). The second major difference is that 
^ is not adjustable by the analyst to "fit" a bettor’s supposedly fixed, exogenous 
degree of relative risk aversion (1 + y). Instead, for a fixed p, y is determined by 
the inner maximization, and thus varies across the values of p considered. T'he 
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endogenous degree of risk aversion exhibited by the agent is 1+ (Pope), which arises 
from the subsequent outer maximization over p that determines the agents optimal 
choice pop;. This will depend on both the opportunity set of alternatives evaluated 
by the agent, and the constant target growth rate (or stochastic benchmark) that 
the agent wants to beat. In other words, the bettor jointly maximizes expected 
power utility by jointly varying both the feasible betting fraction and the utility's 
curvature or risk aversion "coefficient". 

This agent's rank ordering of each betting fraction in the range of p whose 
underperformance probability curves decay to zero (i.e., 5.9% = p « p « p = 33.6%) 
is determined by numerical solution of (8). A summary of the optimization is seen 
in Table 1 below: 


Table 1 The best security against underperforming a logr = 1% target growth 
rate of wealth per bet is achieved by betting pop: = 14.1%, resulting in the fastest 
decay rate (0.18% per bet) of underperformance probabilities. 


Bettor With logr — 1.096 Per Bet Growth Target 
p% Value of (8) | Dp % from (6) | E[log Rp] % | Risk Aversion 1+ 7 
5.9 £z: —1 £0 z 1.0 awl 
8.0 -.9994 0.06 1.28 1.45 
14.1 -.9982 0.18 1.83 1.43 
20.0 -.9987 0.13 2.01 1.25 
33.6 £z: —]1 c0 z 1.0 %1 


Table 1 shows that the highest attainable value of the decay rate is found by 
solving (8), producing pop; = 14.1% and (14.1%) = .43. The asymptotic growth 
rate of wealth associated with the optimal bet is E[log Ry4.1%] = 1.8396. This is 
higher than the agent's target growth rate of log r = 1%, as it must be in order to 
maximize the probability of outperforming that target (equivalently, to minimize 
the probability of underperforming it) over finite horizons T, as depicted in Figure 1, 
and is of course lower than the 2.0196 asymptotic growth rate associated with the 
growth-optimal p = 20%. The bettor exhibits an endogenous degree of relative risk 
aversion 1 J- (14.196) = 1.43, but uses different values of y to evaluate the expected 
utility of different p values, listed in the last column of Table 1. 

The optimization problem can be reformulated as a constrained entropy prob- 
lem of the sort analyzed in the management science literature, e.g., Dinkel and 
Kochenberger (1979). Let D(Q | 1) = Eg [log 2| denote the familiar Kullback- 
Leibler discrimination statistic (a.k.a. relative entropy, see Cover and Thomas, 
1991) measuring the difference between a variable probability measure Q and the 
fixed measure m. Here, Eg is used to denote the expectation taken with re- 
spect to the measure Q. There is a close connection between entropy and the 
large deviations rate function, so it is not surprising that the optimal betting 
problem can be reformulated as a constrained entropy problem. That connec- 
tion (e.g., Kitamura and Stutzer, 2002) shows that the optimal solution to the 
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Table 2 A bettor with a lower target growth rate of wealth will use different 
p-dependent coefficients of risk aversion to evaluate the same bets, will choose to 
bet less, and will exhibit a higher degree of endogenous risk aversion. 


Bettor With logr — 0.196 Per Bet Growth Target 
p% | Value of (9) | Dp % from (6) | Ellog Rp] % | Risk Aversion 1d-y 
[0.507 | x-1 | wo | #01 [| wl | 
1.0 -.9954 0.46 0.20 10.73 
4.5 -.9877 1.23 0.79 4.53 
14.1 -.9924 0.77 1.83 1.88 
20.0 -.9954 0.46 2.01 1.48 
33.0 -.9996 0.04 1.09 1.09 
38.5 æ% —l MU g 0.1 £1 


betting problem can also be computed by solving the constrained entropy problem: 
maxp ming D(Q | 7) s.t. Egllog Rp] € logr. In the above example, this problem 
reduces to finding scalar values for 0 < Q < 1 and 0 € p < 1 solving 


max min Q log(Q/0.60) + (1 — Q)log((1 — Q)/0.40) 
s.t. Qlog(1 +p) + (1 — Q)log(1— p) € 0.01 


It is easy to verify that a numerical solution to this problem is pop, = 14.1% and 
Q = 57%, so that the relative entropy Dy,,, = 0.18% at the saddlepoint. Fur- 
thermore, use of the notation Dp,,, for both the optimal relative entropy and the 
underperformance probability decay rate is not an accident: The bold faced row in 
Table 1 shows that the latter (0.18% per bet) is the same as the former. 

The following Table 2 summarizes the decision process for a more conservative 
agent, who would be satisfied by beating a lower, and hence more easily beaten 
target growth rate of log r = 0.1% (i.e., wealth doubles only every 700 bets or so), 
over the (now slightly different) range of bets 0.50796 — p « p « p — 38.596 that 
yield underperformance probabilities decaying to zero: H 
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Figure 2 Probability of underperforming Log r = 0.1% benchmark. 
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Table 2 shows that this agent should bet only pog, = 4.5% of wealth per bet to 
maximize (minimize) the probability of outperforming (underperforming) the more 
conservative (and hence more easily beaten) growth target logr — 0.196 per bet. 
Comparing Dy,,, in the bold faced row of Tables 2 and 1 shows that the lower 
target enables more rapid decay of the optimal bet's underperformance probability 
curve, seen by comparing the lowest curves in Figures 2 and 1. Table 2 shows that 
this bettor exhibits a higher (but still plausible) endogenous degree of relative risk 
aversion equal to 4.53. 

The following Table 3 shows the optimal betting fractions and degrees of relative 
risk aversion (solving (8)) exhibited by bettors satisfied with beating lower target 
growth rates than the growth-optimal 2.0196. 

Finally, it is easy to show that the relationship between the first and third 
columns in Table 3 is convex, as it will be in other betting or investment problems. 
This relationship shows the tradeoff between an index of desired growth (i.e., the 
target growth rate) and an index of security against underperformance (i.e., the 
decay rate). This efficiency frontier will shift out when the betting opportunity set 
is more favorable (i.e., a higher probability of winning 7), as depicted in Figure 3. 


Table 3 A Bettor who adopts a lower (than growth-optimal) target logr < 2.01% 
to attain more security against underperforming it (i.e., a higher decay rate Dp > 0) 
should choose a smaller betting fraction pope < 20% than growth-optimal in- 
vestors do, and will exhibit a higher endogenous degree of relative risk aversion 
1+ y¥(Popt) > 1 than growth-optimal investors do. 


Bettors With Other Growth Targets 
logr% | Popt% | Dpop(logr)% | E[log E5,,,]76 | Risk Aversion 1 + y(popt) 
2.01 20.0 0 2.01 1 

1.0 14.1 0.18 1.83 1.43 
0.5 10.0 0.51 1.50 2.02 
0.1 4.5 1.23 0.79 4.53 

4 

3.5 

3 


— Prob[Win]=60% 
— Prob[Win] = 70% 


Decay Rate Ip % 
N 


1.5 " 
1 j 
0.5 T 
CIC e 
0 
0 1 2 3 4 


Benchmark Growth Rate Log r % 


Figure 3 Efficiency frontiers. 
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3 A Brief Discussion of Closely Related Literature 


There is a management science literature advocating that the growth-optimal policy 
be combined with a riskless asset to strike an agent's desired tradeoff of growth 
for security against underperformance. Such fractional Kelly strategies have been 
studied by MacLean, Ziemba, and Blazenko (1992), MacLean and Ziemba (1999), 
Li (1993), and others. The criterion embodied in (5) permits the policy p to be any 
adapted process. For example, in portfolio choice over N-assets, the policy p would 
be a (generally time-varying) N-vector of portfolio weights varying over time. The 
only restriction on p is that the underperformance probabilities decay to zero as 
T — oo, in accord with the motive to minimize such probabilities. There is (as yet) 
no basis for restricting the strategy space to fractional Kelly strategies. 

Samuelson (1974, 1979) challenged the accuracy and/or relevance of some of 
the outperformance probability-based asymptotic reasoning advanced by those who 
advocated the use of log utility, i.e., the limiting case of exogenous y | 0 in (9). But 
an analysis in Stutzer (2004) showed that Samuelson's critiques do not apply to the 
criterion developed in the previous section. Furthermore, an important risk-scaling 
paradox highlighted by Rabin (2000) applies to the expected concave utility function 
maximization (like (9)) that Samuelson preferred to outperformance probability- 
based criteria. The Blackjack problem can be used to illustrate the nature of Rabin’s 
critique. The optimal betting fraction when conventionally maximizing the expected 
power utility E[Wr] = E[-W;.?] with exogenous y does not depend on the number 
of bets T. Suppose you were offered the opportunity to bet p — 10.896 of your 
initial wealth T' — 10,000 times, the end result of which is determined in a split 
second by computer simulation. The same calculations used to produce Figures 1 
and 2 (see footnote 3) show that after betting p = 10.8% of accumulated wealth 
T = 10,000 times, there is almost no chance that you could underperform a log r = 
0.696 growth rate per bet. That is, there is near-certainty that your wealth would 
be multiplied by a factor of at least e:006*10000 ~ $1026 in the split second it takes 
for the computer-determined outcome. Yet conventional use of expected constant 
relative risk aversion utility (9) — arguably the most commonly made behavioral 
assumption in the economics literature — implies that a bettor with an exogenous 
degree of risk aversion greater than just 1 + y = 3.76 would rather not play at all 
than have to bet p = 10.8%, no matter the size of T' nor the initial wealth Wo, 
even though it is always impossible to lose more than Wo in the Blackjack problem. 
Nonetheless maintaining the assumption that each individual acts as-if he/she does 
have an exogenous constant degree of relative risk aversion, Barsky et al. (1997) 
designed a questionnaire to measure it, which was given to thousands of individuals 
in person by Federal interviewers. About 2/3 of those surveyed had a degree of 
relative risk aversion higher than 3.76, so their maintained assumption implies that 
all of them would have refused to bet p — 10.896 once, twice, ten thousand, or ten 
million times. Would you? Fortunately, a bettor using the criterion in Section 2, 
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and who has the relatively modest growth target used above (ie., logr = 0.690), 
would indeed be willing to bet this much. 

Most importantly, Rabin (2000) also showed that analogous expected utility 
paradoxes are constructable when concave utility functions other than (9) are as- 
sumed. His concerns about expected concave utility of wealth criteria are not easily 
dismissed. 

Finally, there is of course a large literature proposing other ways to integrate 
shortfall probability into portfolio analysis, e.g., Browne (2000) and the other papers 
cited by him, or maximization of non-log expected utility subject to a "Value- 
At-Risk” constraint (2001). A full exploration of the connection between these 
alternatives and the criterion in Section 2 is too lengthy to be attempted in this 
note on the growth-optimal literature. 


4  Occam's Razor Critique of Alternative Decision Criteria 


One long-held disideratum was expressed by Albert Einstein: “Everything should be 
made as simple as possible, but not simpler". This is the principle of parsimonious 
parameterization that has historically been termed Occam’s (or Ockham’s) Razor. 
There are other ways to generalize the standard CRRA (9) and its limiting log utility 
criteria. But they have more adjustable parameters than the criterion described in 
Section 2. Each of the alternatives introduced additional adjustable parameters 
when modifying a conventional CRRA power utility (such as log utility), gaining 
flexibility that is helpful when confronting individual behavioral observations. But 
by impeding the ability to make as sharp a prediction about what could potentially 
be observed, the flexibility enabled by the additional adjustable parameters comes 
at a cost. This tradeoff is quantified in the Bayesian analysis of Jefferys and Berger 
(1992), summarized as follows: 


Bayesian analysis can shed new light on what the notion of the “simplest” 
hypothesis consistent with the data actually means... By choosing prior 
probabilities of hypotheses, one can quantify the scientific judgment that 
simpler hypotheses are more likely to be correct. Bayesian analysis also 
shows that a hypothesis with fewer adjustable parameters automatically 
has an enhanced posterior probability, because the predictions it makes 
are sharp. 


To formalize this, suppose we are trying to assess the plausibility (a.k.a. subjec- 
tive probability, as defined and interpreted by Bayesians) of one hypothesized deci- 
sion criterion Ho with fewer adjustable parameters, e.g., that described in Section 2, 
relative to another criterion H4. A Bayesian assesses prior probabilities u( Ho) and 
“(AH A), finds the likelihoods L of each potential individual decision under the two 
hypotheses, and finally calculates the following ratio of posterior probabilities (i.e., 
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the posterior odds in favor of Ho): 


Prob|Ho] L(Dovs | Ho) L(A) Def. n 
ee ee LU INC —' Bx Prior Odds 10 
Prob|Ha] | L(Dovs | Ha) u(HA) ie 


where Doo, is the individual decision that is actually observed and B is the Bayes 
factor, i.e., the ratio of the actual observation's likelihood under each hypothesis. 
Because Ho has fewer adjustable parameters, it is possible that the likelihood of 
a large subset of potential observations will be smaller under Ho than under Hy. 
But if so, the likelihood of the complementary (possibly smaller) set of potential 
observations will be higher under Ho. If the actually observed individual decision 
Doy, lies in this complementary set, the Bayes factor in (10) will favor Ho, even 
though its general fitting ability is lower than an alternative with more adjustable 
parameters. Unless there is some reason to place higher prior probability on the 
more profligately parameterized alternative H 4, the posterior odds ratio (10) will 
thus favor the "simpler", sharper hypothesis Ho over the more flexible H 4. 

The crucial claim that the Hp developed in section 2 makes a sharper prediction 
than the conventional (without benchmark) CRRA hypothesis H4 hinges on the 
ability to more accurately determine an agent's benchmark than direct measure- 
ments can accurately determine the hypothesized exogenous curvature parameter ^y 
(e.g., the aforementioned problematic questionnaire used by Barsky, et al. (1997)). 
It is fair to question this ability in the context of our betting example, where the 
analyst would have had to more accurately determine a value for the bettor's target 
growth rate logr than a value for y, before observing the fraction of wealth bet. 
But the ability to more accurately determine the benchmark is easy to envision 
in the more important context of investment behavior. A typical fund manager, 
and an investor who uses the manager, expressly try to outperform an identified 
benchmark portfolio tailored to the fund's style. For example, a recent advertise- 
ment by Standard and Poors cited independently gathered data indicating that $3.5 
trillion of funds are professionally invested in attempts to beat the S&P 500 index 
benchmark. Hence when considering fund managers (and their voluntary clients), 
the benchmark required for the criterion in Section 2 is a directly observable port- 
folio (e.g., the S&P 500), in which case the criterion in Section 2 has no adjustable 
parameters, and hence makes the sharpest conceivable predictions. 


Notes 


!Ergodic processes that are not IID still have limiting infinite time averages, that would 
need to be denoted differently than E[log Rp]. 

?MacLean, Ziemba, and Blazenko (1992) assumed that a value of r = 51% might be 
achievable by card counting techniques. 

3For each value of T and p, the event of underperformance will occur when there are x 
or fewer wins, where [xlog(14-p)--(T—x)log(1—p)]/T < 0.01. Hence the probability of this 
event is computed by evaluating the cumulative binomial distribution (with probability of 
winning equal to 7 = 0.60) at the highest integer x satisfying this inequality. 
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“In brief, define the partial sum Yr = YZ log Rpt — NL. log Ry. Define the time- 
averaged log moment generating function ¢r(@) = + log E[e?""]. Bucklew (1990) lists 
the following two implicit restrictions on the process needed to ensure that (3) is the 
aforementioned decay rate: (i) limroo $7 (0) exists for all 0 — but where oo is allowed 
both as a limit value as well as member of the sequence and (ii) this limit function is 
differentiable at all values of 0 for which it is finite. As shown in Bucklew (1990), these 
restrictions hold for à wide variety of both independent and dependent stochastic processes 
used in practice to model Yr, including our example’s IJD Bernoulli return process with 
return Rp and constant benchmark return log r. Stutzer (1995) used a vector version of the 
result to characterize an asset pricing model error diagnostic. I recently discovered that 
Sornette (1998) applied the IID version (Cramer’s Theorem) in several portfolio choice 
settings. With specific parametric processes, Pham (2003) found solutions for related 
continous time, dynamic portfolio choice problems. Dembo, Deuschel, and Duffie (2004) 
also used Cramer’s Theorem to approximate loss probabilities for suitably large numbers 
of loans. 

?Probably the earliest hint of a result like this is in Ferguson (1965). He studied a class 
of betting problems, and noted that if “the probability of ruin goes to zero exponentially 
as the fortune tends to infinity”, maximization of expected exponential utility would be 
“approximately valid” for some value of its 0 coefficient “when the fortune is large". A 
precise finite T' result of this nature was developed by Browne (1995), for a problem where 
wealth is generated by a specific parametric controlled stochastic differential equation. 
The more general, nonparametric results (5)-(7) were enabled by formulating shortfall 
probabilities using (1), and then applying the powerful Gártner-Ellis Theorem to simply 
compute the asymptotics. 

$Perhaps the first use of constrained entropy in finance was Cozzolino and Zahner 
(1973). 

"'The constraint is binding at the solution, so (in this scalar p example!) the constraint 
equation can be used to write Q as a function of p in closed form. Substitute this for Q in 
D(Q | 7) and numerically maximize over p to find pope = 14.1%. 

58ee Stutzer (2004) for a proof applicable to this and more general situations. 

?For example, consider the following “habit formation" modification of the CRRA 
utility (9): divide Rp by the exogenous parameter r as in (7), but leave + as an additional 
exogenous parameter. 

The likelihood functions must be calculated before the individual's decision Dobs) 
is observed. This is to avoid the following ersatz calculation: observe an individual's 
Pobs, then “fit” the adjustable parameters in Ha by working backwards to find adjustable 
parameter values that exactly “explain” Pobs, and then assert that L(poss | Ha) = 1 in 
(10)! This procedure would make it impossible to reject this or any other hypothesis that 
is flexible enough to generate all possible data that could be observed (and hence any 
particular poss. According to the widely accepted view of Popper (1977), an hypotheses 
that can't be rejected (in his terms, falsified) is not a scientific hypothesis. 
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Introduction to the Evidence of the Use of Kelly Type 
Strategies by the Great Investors and Others 


We know that in the long run, full Kelly strategies dominate other strategies, but 
they are very risky short term. Hence, practical applications to long sequences of 
wagers are especially appropriate. Current hedge fund trading that enters and exits 
in a few seconds is such an application. Thorp (1960) coined the term Fortunes's 
Formula in his application of Kelly to the game of blackjack using his card counting 
system. The count provides an estimate of the mean so that with more favorable 
counts the player should wager more. Typically, blackjack players can play about 
60-- hands per hour. So this application works well for full Kelly. But the risk 
is high that even after a few hours, with say a 1% edge, a player may be behind. 
Hence, blackjack teams typically use fractional Kelly strategies, with fractions of 
0.2 to 0.8 being common. Gottlieb (1984, 1985) describes the early use of these 
fractional Kelly strategies. 

It is no coincidence that many of the greatest hedge fund and portfolio man- 
agers are former blackjack players. Billionaires like trend follower Harold McPike, 
bond guru Bill Gross, options trader Blair Hull, and racetrack guru Bill Benter are 
some examples. Other top investors like Renaissance Medallion hedge fund trader 
Jim Simons and Warren Buffett, the legendary greatest investor in the world, use 
strategies that employ Kelly-type ideas to attain long term wealth growth. So did 
the great economist John Maynard Keynes in managing the King's College Cam- 
bridge endowment. As we have seen the size of Kelly wagers is largely dependent 
upon the probability of losing and it is important not to overbet. Since, the effect 
of errors in means are risk aversion dependent, these wagers for Kelly bettors are 
very sensitive to good mean estimates. Otherwise, it is easy to overbet. 

The book Fortune's Formula written by William Poundstone (2005) brought 
the Kelly ideas to a wider audience. So now many investment newsletters such 
as Morningstar and The Motley Fool suggest Kelly strategies (see Fuller (2006) 
and Lee, 2006). Unfortunately, most of these sources do not fully understand the 
computation of Kelly strategies in the multivariate case and when there are portfolio 
management imperfections (such as slippage). As well, they often miss the subtleties 
and danger of these Kelly applications. In this final part of the book, we present 
applications of Kelly and fractional Kelly modeling. 

Hausch, Ziemba, and Rubinstein (HRZ) (1981) present a weak market efficiency 
anomaly in the horserace place and show markets and how it can be exploited and 
wagered on using the Kelly criterion. The idea is to use probabilities derived in a 
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simple market (the win pool involving n horses) in a market that is more complex 
— the place pool involving n(n — 1) combinations and/or the show pool involving 
n(n — 1)(n — 2) combinations. Previous studies (see Hausch and Ziemba, 2008, 
for an up to date survey of results), indicate that the win market at racetracks is 
efficient except for a favorite-longshot bias. Favorites are underbet and longshots 
are overbet. This bias has been apparent for well over 50 years, although recent 
changes in the betting to allow rebates for large bettors and head to head wagering 
on a betting exchange have changed this shape to some extent; see graphs in Hausch 
and Ziemba (2008). This biases for win is in the opposite direction of the bias for 
place and show where favorites are overbet and longshots underbet so in the HZR 
analysis that was the probability of finishing one-two for place or one-two-three for 
show, the biases tend to cancel. So HZR assume that the probability that horse i 
wins is simply its betting fraction W;/W, and the probability that i is first and j is 
second is the chance that i wins and j wins a race that does not contain i, namely 
W/W 
ar) E | 

where W; are the wagers to win on horse i and W = $:W;. While its known that 
these Harville formulas are biased, the bias cancels in the HZR application, again 
see Hausch and Ziemba (2008) for current research on this bias. HZR compute the 
expected value of place and show bets and note that there frequently are favorable 
bets with expected values well above 1.0 per dollar bet. Applying the Kelly criterion 
leads to a complex non-concave non-linear programming model to compute optimal 
wagers that include the transactions costs or slippage that comes from the effect of 
our investor's bets on the odds. They show that the system yields positive profits 
at several racetracks. To make the system operational, they solved thousands of 
Kelly models and regressed the optimal bets and expected values on the ratio of 
the place and show pool bet on horse i. Then at the track, one only needs to look 
at W; and W and P; and P or S; and S, where P; are the wagers to place on horse 
i and S; are the wagers to show on horse i. A quick scan can indicate whether a 
bet is good or not. 

Hausch and Ziemba (1985) added further refinements to the basic system. These 
included equations for the optimal place and show bets for varying initial wealth and 
place or show betting pool size. These were subsequently used in a calculator that 
allows players to punch in the four parameters from above and compute quickly 
the expected value and Kelly bet. This includes the track take adjustment, and 
coupled entry adjustments if needed that Hausch and Ziemba worked out. They 
also looked into the effects of the track take and breakage (the rounding to 2.20 or 
2.10 for example from 2.25 and 2.18 payoffs to the nearest dime per $2 or $1 bet, 
respectively. Finally, they computed how many players can be using the system at 
the same time before the market will become efficient. Ziemba and Hausch (1984, 
1986, 1987) wrote two books that made the system popular and famous. Now 
twenty-five years late, the system is still in use although the conditions are rougher 
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in 2010 than in the 1980s because of the two factors mentioned above (rebates and 
betting exchanges) and the fact that there is so much cross track betting these days 
that about half the betting pool has not arrived when the horses begin running. 
But the clever punter can estimate this odds movement that occurs after the bets 
have to be made. 

Ziemba and Hausch (1994, 2008) show how to modify the place and show system 
developed in the HZR and HZ papers to a different betting system that is used 
in some other countries such as England. In the UK, there is no show betting, 
only place, but the number of horses that place is not necessarily two as in North 
America. Rather, it depends on the number of horses running in the race. There 
can be one, two, three or four horses that place when the number of horses running 
is at most four, 5-7, 8-15 or 16+, respectively. In the UK there are bookies offering 
fixed odds and the parimutuel tote as discussed here. In North America place 
and show bets pay back the stakes of the horses that place or show and then the 
net profit is divided equally among these two or three qualifiers. The the payoffs 
depend on the number of winning tickets on each qualifier. But in the UK, the 
entire pool, after the take, 1s divided into 1, 2, 3 or 4 equal portions which is then 
split into the individual payoffs. So you can see that with a large amount bet on 
one horse, the payoffs for a £1 bet can be as low as £1. With this minimum payoff 
there are arbitrages possible, see Hausch and Ziemba (1990) for the US and Jackson 
and Waldron (2003) for the UK and other locales. Ziemba and Hausch develop the 
expected value equation and Kelly criterion bet sizing. These equations yield simple 
graphs to apply the system in the UK. 

In all of these applications, the theory indictes that the calculations should be 
based, as Thorp has argued, on total liquid wealth not just the track wealth. Wilcox 
(2003ab, 2005) has argued for a related theory where decisions are made on excess 
wealth after liabilities are deducted. This is a fractional Kelly strategy. 

Grauer and Hakansson (1986) and in a number of other papers apply the Kelly 
modeling approach to asset-allocation problems. They use a simple asset return 
distribution procedure: it is assumed that the past will repeat itself. At the be- 
ginning of each period t, the investor chooses a portfolio on the basis of a utility 
function for gross returns, R, given by 


Y, fo y<l. 


The data used are the total monthly and yearly returns including interest and 
dividends on stocks, long-term government bonds, long-term corporate bonds and 
small capitalized stocks for 1926-1983 in the Ibbotson and Sinquefield CRSP file. 
They compute results for the no-leverage and leveraged cases with and without small 
capitalized stocks for y — 1 (positive power), 3/4, 1/2, 1/4 and 0 (full Kelly) and 
negative powers —1, —2,..., —75. They did quarterly and yearly revisions. The 
simple historical probability approach gives good results. Small stocks generally 
add value. 
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Mulvey, Bilgili and Vural (2010) apply Kelly type models to deal with vari- 
ous practical issues involving imperfections such as transaction costs, operational 
constraints and path dependencies, etc for use by investors such as defined ben- 
efit pension plans. Their model is modified to also provide downside protection 
in turbulent markets. They apply the theory to recent US stock market behavior 
including the 2008 crash period. They show how large endowments can modify 
their typical three step process: select benchmarks, revise portfolio allocation tar- 
gets and select active managers to deal with these issues. Fixed mix portfolio 
rates along with exchange traded ETFs provide volatility induced wealth growth. 
They show that gains are larger in upswings and losses lower in crash periods 
than the underlying indices. The increasing correlation of equity assets during 
crash periods is dealt with by developing a portfolio of low correlation investment 
strategies. 

Rudolf and Ziemba (2004) present a continuous time model for pension or insur- 
ance company surplus management over time. Such lifetime intertemporal portfolio 
investment models date to Ramsey (1928), Phelps (1962) and Samuelson (1969) in 
discrete time and Merton (1969) in continuous time. Rudolf and Ziemba use an 
extension of the Merton (1973, 1990) model that maximizes the intertemporal ex- 
pected utility of the surplus of assets net of liabilities using liabilities as a new vari- 
able. They assume that both the asset and the liability return follow Itó processes 
as functions of a risky state variable. The optimum occurs for investors holding four 
funds: the market portfolio, the hedge portfolio for the state variable, the hedge 
portfolio for the liabilities, and the riskless asset. This is a four fund CAPM, while 
Merton has a three fund CAPM, and the ordinary Sharpe-Lintner-Mossin CAPM 
has two funds. The hedge portfolio provides maximum correlation to the state vari- 
able, that is, it provides the best possible hedge against the variance of the state 
variable. In contrast to Merton's result in the assets only case, the liability hedge 
is independent of preferences and depends only on the funding ratio. With hyper- 
bolic risk aversion utility, which includes negative exponential, power and log, the 
investments in the state variable hedge portfolio are also preference independent, 
and with Kelly type log utility, the market portfolio investment depends only on 
the current funding ratio. 

Having surplus over time is what life and other insurance companies, pension 
funds and other organizations try to achieve. With both life insurance companies 
and pension funds, parts of the surplus are distributed to the clients usually once 
every year. Hence, optimizing their investment strategy is well represented by 
maximizing the expected lifetime utility of the surplus. 

A case study involves a US based investor investing in the stock and bond 
markets of the US, UK, Japan, the EMU, Canada and Switzerland. The investor 
faces an exposure to five foreign currencies euro, British pound, Japanese yen, 
Canadian dollar and Swiss franc, and can invest in eight funds. Five of them are 
hedge portfolios for the state variables, which are assumed to be currency returns 


Introduction to the Evidence of the Use of Kelly Type Strategies 661 


and the others by portfolio separation are the market portfolios, the riskless asset 
and the liability hedge portfolio. 

The holdings of the various funds depend only on the funding ratio and on 
the currency betas of the distinct markets. For a funding ratio of one, there is no 
investment in the market portfolio and only diminishing investments in the currency 
hedge portfolios. The portfolio betas against the five currencies are close to zero for 
all funding ratios. The higher the funding ratio is, the higher is the investment in the 
market portfolio, the lower are the investments in the liability and the state variable 
hedge portfolios, and the lower is the investment in the riskless fund. Negative 
currency hedge portfolios imply an increase of the currency exposure instead of à 
hedge against it. The increase of the market portfolio holdings and the reduction 
of the hedge portfolio holdings and the riskless fund for increasing funding ratios 
shows that the funding ratio is directly related to the ability to bear risk. Rather 
than risk aversion coefficients, the funding ratio provides an objective measure to 
quantify attitudes towards risk. 

Ziemba (2005) investigates the records of great investors and proposes a simple 
modification of the ordinary Sharpe ratio to evaluate them more fairly. In most 
cases these investors have mostly monthly or quarterly gains with few losses so the 
ordinary Sharpe ratio penalizes them for gains by increasing the portfolio standard 
deviation. The funds discussed are Windsor of George Neff, the Ford Foundation, 
the Harvard Endowment, Tiger of Julian Robertson, Quantum of George Soros 
and Berkshire Hathaway of Warren Buffett. 'The modified Sharpe ratio simply 
replaces the gains by the mirror image of the losses. So gains are not penalized but 
only losses. 

With the ordinary Sharpe ratio the Ford Foundation beats Berkshire Hathaway 
even though the geometric mean return is much less. So the question is does Berk- 
shire dominate with the modified DSSR. downside symmetric Sharpe ratio. Only 
Berkshire improves with the DSSR but the Ford Foundation and the Harvard en- 
dowment are still slightly better. This is because Berkshire has the most large 
gains of all the funds studied but also has a lot of large losses on a monthly or quar- 
terly basis and more than the other funds. So Berkshire is oriented towards a full 
Kelly long term valuation where intermediate monthly, quarterly and even yearly 
returns are not the goal, rather long terms growth is the goal. So we believe Buffett 
acts like a full Kelly bettor in his investment strategies. These funds have DSSRs 
about 1.0. 

Princeton-Newport, run by Edward O. Thorp, a co-editor of this book, had 
a DSSR of 13.8 with a very smooth wealth path with only three monthly losses, 
and no quarterly or yearly losses, over the twenty year period 1969-1988. Thorp 
had similar good results in other funds run later. Renaissance Medallion, run by 
James Simon, arguably the current top hedge fund in the world, had, as reported 
by Ziemba and Ziemba (2007), an even higher 26.4 DSSR with very few monthly 
or quarterly losses. no yearly losses, and very high mean returns during the period 
January 1993 to April 2005. 
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Thorp (2006) presents a comprehensive survey of his considerable use over almost 
fifty years of Kelly strategies in blackjack, sports betting and the stock market 
along with theoretical results he has found useful in practice. He traces the early 
first serious application of Kelly methods to blackjack bet sizing, to various sports 
betting applications and his great success as a hedge fund trader with the Princeton- 
Newport and later hedge funds. 
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EFFICIENCY OF THE MARKET FOR RACETRACK 
BETTING* 


DONALD B. HAUSCH, t WILLIAM T. ZIEMBAf AND MARK RUBINSTEIN§ 


Many racetrack bettors have systems. Since the track is a market similar in many ways to 
the stock market one would expect that the basic strategies would be either fundamental or 
technical in nature. Fundamental strategies utilize past data available from racing forms, 
special sources, etc. to "handicap" races. The investor then wagers on one or more horses 
whose probability of winning exceeds that determined by the odds by an amount sufficient to 
overcome the track take. Technical systems require less information and only utilize current 
betting data. They attempt to find inefficiencies in the "market" and bet on such “overlays” 
when they have positive expected value. Previous studies and our data confirm that for win 
bets these inefficiencies, which exist for underbet favorites and overbet longshots, are not 
sufficiently great to result in positive profits. This paper describes a technical system for place 
and show betting for which it appears to be possible to make substantial positive profits and 
thus to demonstrate market inefficiency in a weak form sense. Estimated theoretical probabili- 
ties of all possible finishes are compared with the actual amounts bet to determine profitable 
betting situations. Since the amount bet influences the odds and theory suggests that to 
maximize long run growth a logarithmic utility function is appropriate the resulting model is a 
nonlinear program. Side calculations generally reduce the number of possible bets in any one 
race to three or less hence the actual optimization is quite simple. The system was tested on 
data from Santa Anita and Exhibition Park using exact and approximate solutions (that make 
the system operational at tke track given the limited time available for placing bets) and found 
to produce substantial positive profits. A model is developed to demonstrate that the profits 
are not due to chance but rather to proper identification of market inefficiencies. 
(FINANCE-PORTFOLIO; GAMES-GAMBLING) 


1. The Racetrack Market 


For the most part, academic research on security markets has bypassed an 
interesting and accessible market—the racetrack for thoroughbred horses—with its 
highly standardized form of security—the tote ticket. The racetrack shares many of the 
characteristics of the archtypical securities market in listed common stocks. Moreover 
the racetrack gains further interest from its significant differences, and more impor- 
tantly, because it is inherently a more elementary market context, lacking many of the 
dynamic features which complicate analysis of the stock market. 

The “market” at the track in North America convenes for about 20 minutes, during 
which participants place bets on any number of the six to twelve horses in the ` 
following race. In a typical race, participants can bet on each horse, either to win, 
place or show.^ The horses that finish the race first, second or third are said to finish 
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! For surveys, see Copeland and Weston [6], Fama [9], [10] and Rubinstein [22]. 

2 Other bets such as the daily double (pick the winners in the first and second race), the quinella (pick the 
first two finishers regardless of order in a given race) and the exacta (pick the first two finishers in exact 
order in a given race) as well as various combinations are possible as well. Such bets are utilized by the 
public to construct low probability high payoff bets. For discussion of some of the implications of such bets 
see Rosett [21]. 
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*in-the-money." All participants who have bet a horse to win, realize a positive return 
on that bet only if the horse is first, while a place bet realizes a positive return if the 
horse is first or second, and a show bet realizes a positive return if the horse is first, 
second or third. Regardless of the outcome, all bets have limited liability. Unlike most 
casino games such as roulette, but like the stock market, security prices (Le. the 
"odds") are jointly determined by all the participants and a rule governing transaction 
costs (i.e. the track “take”). To take the simplest case, for win, all bets across all horses 
to win are aggregated to form the win pool. If W, represents the total amount bet by all 
participants on horse i to win, then W = $,W, is the win pool and WQ/ W, is the 
payoff per dollar bet on horse / to win if and only if horse i wins, where 1 — Q is the 
percentage transaction costs.’ If horse i does not win, the payoff per dollar bet is zero. 

The track is "competitive" in the sense that every dollar bet on horse / to win, 
regardless of the identification of the bettor, has the same payoff. The “state contin- 
gent" price of a dollar received if and only if horse i wins is p, = (W;/ WQ) and the 
one plus riskless return is 1/5 ;p; = Q, a number less than one. The interest rate at the 
track is thus negative and solely determined by the level of transactions costs. Thus, 
apart from an exogeneously set riskless rate, the participants at the track jointly 
determine the security prices. For example, by betting more on one horse than 
another, the state-contingent price of the first horse increases relative to the second, or 
alternatively the payoff per dollar bet decreases on the first relative to the second. 

The rules for division of the place and show pools are slightly more complex than 
the win pool. Let P, be the amount bet on horse j to place and P = EP, be the place 
pool. Similarly S, is the amount bet on horse k to show and the show pool is 
S = $, S}. The payoff per dollar bet on horse j to place is 


. i is first and / is second or 
1 +| PQ- P,- P,|J/(2P,) if 
[PQ - P, 4 ) i [ j is first and i is second 


0 otherwise. 


Thus if horses i and j are first and second each bettor on j (and also i) to place first 
receives the amount of his bet back. The remaining amount in the place pool, after the 
track take, is then split evenly between the place bettors on i and j. The payoff to horse 
j to place is independent of whether j finishes first or second, but it is dependent on 
which horse finishes with it. A bettor on horse j to place hopes that a longshot, not a 
favorite, will finish with it. 

The payoff per dollar bet on show is analogous 


: k is first, second or third 
I*[59- $-$- S]/OS,) if | and finishes with i and j 
0 otherwise. 


In many ways the racetrack is like the stock market. A technical strategy based on 
discrepancies between the amounts bet on the same horses to win, place and show, is 
examined in this paper. Since a short position has a perfectly negative correlated 
outcome to the result of normal bet a given horse can be "shorted" by buying tickets 
on all the other horses in a race. 

The racetrack also differs from the stock market in important ways. In the stock 
market, an investor's profit depends not only on the initial price he pays for a security, 


3The actual transactions cost is more complicated and is described below. 
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but also on what some other investor is willing to pay him for it when he decides to 
sell. Thus his profit depends not only on how well the underlying firm does in terms of 
earnings over the time he holds its stock (i.e. supply uncertainty), but also on how 
other investors value that stock in the future (i.e. demand uncertainty). Given the 
initial price, both the nature and behavior of other market participants determine his 
profit. Thus current stock prices might depend not only on "fundamental" factors but 
also on market “‘psychology”—the tastes, beliefs, and endowments of other investors, 
etc. In contrast once all bets are placed at the track prior to a given race (i.e. initial 
security prices are given), the result of the race and the corresponding payoffs depend 
only on nature. There is no demand uncertainty at the track. 


2. Previous Work on Racetrack Efficiency 


A market is efficient if current security prices fully reflect all available relevant 
information. If this is the case, experts should not be able to achieve higher than 
average returns with regularity. A number of investigators have demonstrated that the 
New York Stock Exchange and other major security markets are efficient and 
so-called experts in fact achieve returns when adjusted for risk that are no higher than 
those that would be received from random investments (see Copeland and Weston [6], 
Fama [9], [10] and Rubinstein [22] for discussion, terminology, and relevant refer- 
ences). For an exception see Downes and Dyckman [7]. 

Snyder [24] provided an investigation of the efficiency of the market for racetrack 
bets to win. The question Snyder poses is whether or not bets at different odds levels 
yield the same average return. The rate of return for odds group i is 


N? (0, + 1) - N, 
N T 


where N, and Nf are the number of horses, and the number who won, respectively, at 
odds O; — (WQ/ W, — 1). A weakly-efficient market in Snyder’s sense would set 
R; = Q for all i, where 1 — Q is the percentage track take. His results as well as those 
of Fabricant [8], Griffith [12], [13], McClothlin [18], Seligman [23] and Weitzman [27] 
suggest that there are "strong and stable biases but these are not large enough to make 
it possible to earn a positive profit" [24; 1110]. In particular, extreme favorites tend to 
be under bet and longshots overbet. The combined results of several studies comparing 
over 30,000 races are summarized in Table 1. 


R= 


TABLE 1 (Snyder [24]) 
Rates of Return on Bets to Win by Grouped Odds, Take Added Back 


Midpoint of grouped odds 


Study 0.75 125 2.5 5.0 75 10.0 15.0 33.0 
Fabricant ILIi* 9,08 4.6* - 14 -33 -3.7 -8.1 ~ 39.5" 
Griffith 8.0 4.9 3.1 -3.1 — 34.6* — 34.18 — 10.5 ~ 65,53 
McGlothlin 8.0? 8.0* 8.07 - 0.8 — 46 -70^ -9.7 — 11.0 
Seligman 14.0 4.0 - 10 1.0 -2.0 -40 -7.8 — 242 
Snyder 5.5 5.5 4.0 -12 34 29 > 2.4 — 15.8 
Weitzman 90 3.2 6.84 -13 — 42 - 5.1 — 82^ — 18.07 
Combined 9.18 6.48 6.1? -1.2 -5.2 -52* - 10.28 - 23.78 


*Significantly different from zero at 1% level or better. 
>Significantly different from zero at 5% level or better. 
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Since the track take averages about 18%, the net rate of return for any strategy 
which consistently bets within a single odds category is — 996 or less. For horses with 
odds averaging 33 the net rate of return is about — 42%. 

Conventional financial theory does not explain these biases because it is usual to 
assume that as nondiversifiable risk (e.g. variance) rises expected return rises as well. 
In the win pool expected return declines as risk increases. An explanation consistent 
with the expected utility hypothesis is that investors (as a composite) are risk lovers 
and behave as if the betting opportunities are limited to a single race. Weitzman [27] 
and Ali [1] have estimated such utility functions. Ali's estimated utility function over 
wealth w is the convex function 


u(x) = 191w^"9^ — (R?-.9981), 


which has increasing absolute risk aversion. Thus by the Arrow-Pratt [3], [19] theory 
investors will take more risk as their wealth declines. This explains the common 
phenomenon that bettors, when losing, tend to bet more and more on longer odds 
horses in a desperate attempt to recoup earlier losses. Moreover, since uw is nearly linear 
for large w investors are nearly risk neutral at such wealth levels. 

A second explanation is that gamblers simply prefer low probability high prize 
combinations (i.e. longshots) to high probability low prize combinations. Besides the 
possible gains involved, gamblers have egos associated with analyzing racing forms 
and pitting one's predictions against others. Luck and entertainment as well are largely 
absent in betting favorites. The thrill is to successfully detect a moderate or long odds 
winner and thus confirm one's ability to outperform the other bettors. Such a scenario 
is consistent with the data and leads to the biases. Rosett [21] provides an analysis of 
ways to construct low probability high prize bets through parlays and other combina- 
tions that can be used to avoid the longshot tail bias problem and to take advantage of 
the favorite bias. In an effort to capitalize on this market many tracks feature such bets 
in the form of the daily double, the exacta and the quinella. However, none of these 
schemes appear to yield bets with positive net returns.* 

Other studies of racetrack efficiency have been conducted by Ali [2], Figlewski [11] 
and Snyder [24]. Ali shows that the win market is efficient in the sense that indepen- 
dently derived bets with identical probabilities of winning do in fact have the same 
odds in a statistical sense. His analysis utilizes daily double bets and the corresponding 
parlays, i.e. bet the proceeds if the chosen horse wins the first race on the chosen horse 
in the second race, for 1089 races. Figlewski, using a multinomial logit probability 
model to measure the information content of the forecasts of professional handicap- 
pers and data from 189 races at Belmont in 1977, found that these forecasts do contain 
considerable information but the track odds generated by bettors discount almost all 
of it. Snyder provided strong form efficiency tests of the form: are there persons with 
special information that would allow them to outperform the general public? He found 
using data on 846 races at Arlington Park in Chicago that forecasts from three leading 
newspapers, the daily racing form and the official track handicapper did not lead to 
bets that outperformed the general public. 


‘For an entertaining account of an “expert” who was able to achieve positive net returns over a full racing 
season, see Beyer [5]. See also Vergin [26]. 
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3. Proposed Test 


The studies described in 82 examine racetrack efficiency with respect to the win pool 
only. In this paper our concern is with the efficiency of the place and show pools 
relative to the win pool and with the development of procedures to best capitalize on 
potential inefficiencies between these three "markets." We utilize the following defini- 
tion of weak-form efficiency: the market is weakly-efficient if no individual can earn 
positive profits using trading rules based on historical price information. In Baumol’s 
[4; 46] words ... “all opportunities for profit by systematic betting are eliminated". 
Our analysis utilizes the following two data sets: 1) Data Set 1: all dollar bets to win, 
place and show for the 627 races over 75 days involving 5895 horses running in the 
1973/74 winter season at Santa Anita Racetrack in Arcadia, California, collected by 
Mark Rubinstein; and 2) Data Set 2: all dollar bets to win, place and show for the 
1065 races over 110 days involving 9037 horses running in the 1978 summer season at 
Exhibition Park, Vancouver, British Columbia, collected by Donald B. Hausch and 
William T. Ziemba. 

In the analysis of the efficiency of the win pool one may compare the actual 
frequency of winning with the theoretical probability of winning as reflected through 
the odds. Similar analyses are possible for the place and show pools once an estimate 
of the theoretical probabilities of placing and showing for all horses is available. There 
is no unique way to obtain these estimates. However, very reasonable estimates obtain 
from the natural generalization of the following simple procedure. Suppose three 
horses have probabilities to win of 0.5, 0.3, and 0.2, respectively. Now if horse 2 wins, a 
Bayesian would naturally expect that the probabilities that horses 1 and 3 place (i.e. 


win second place) are 0.5/0.7 and 0.2/0.7, respectively. In general if q; (i — 1, ...., n) 
is the probability horse i wins, then the probability that i is first and j is second is 
(4,4) / (1 — q) (1) 
and the probability that i is first, j is second and k is third is 
9:9; I/ (Y - 1 - qi — 4) Q) 


Harville [14] gives an analysis of these formulas. Despite their apparent reasonableness 
they suffer from at least two flaws: 

1) no account is made of the possibility of the “Silky Sullivan" problem; that is, 
some horses generally either win or finish out-of-the-money—for example, see footnote 
5 in [15]; for these horses the formulas greatly over-estimate the true probability of 
finishing second or third; and 

2) the formulas are not derivable from first principles involving individual horses 
running times; even independence of these random variables T,,..., T, is neither 
necessary nor sufficient to imply the formulas. 

In addition to assuming (1) and (2) we assume that 


n 
q- W,/ > W. 3) 

ist 
That is, the win pool is efficient. The discussion above, of course, indicates that this 
assumption is suspect in the tails and this is discussed below. Table 2a-c compares the 
actual versus theoretical probability of winning, placing and showing for data set 2. 
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Similar tables for data set | appear in King [17]; see also Harville [14]. The usual tail 
biases appear in Table 2a (although they are not significant at the 5% confidence 
level). One has reverse tail biases in the finishing second and third probabilities, see 
Tables 2b,c in [15]. This occurs because if the probability to win is overestimated 
(underestimated) and probabilities sum to one it is likely that the probabilities of 
finishing second and third would be underestimated (overestimated). Tables 2b,c, 
indicate how these biases tend to cancel when they are aggregated to form the 
theoretical probabilities and frequencies of placing and showing.? 


TABLE 2 
Actual vs. Theoretical Probability of Winning, Placing and Showing: Exhibition Park 1978 
(a) Theoretical Number Average Actual Estimated 
Probability of Theoretical Frequency Standard 
of Winning Horses Probability of Winning Error 
0.000 -0.025 540 0.019 0.016 0.005 
0.026 -0.050 1498 0.037 0.036 0.005 
0.051 -0.100 2658 0.073 0.079 0.005 
0.101  -0.150 1772 0.123 0.126 0.008 
0.151  -0.200 1199 0.172 0.156 0.010 
0.201 -0.250 646 0.223 0.227 0.016 
0.251 -0.300 341 0.272 0.263 0.024 
0.301 -0.350 199 0.323 0.306 0.033 
0.351 -0.400 101 0.373 0.415: 0.049 
0.401 + _ 83 0.450 ^.469 0.055 
9037 
(b) Theoretical Number Average Actual Estimated 
Probability of Theoretical Frequency Standard 
of Placing Horses Probability of Placing Error 
0.000 ~0.025 21 0.022 0.000* 0.000 
0.026 -0.050 391 0.040 0.030 0.009 
0.051 -0.100 1394 0.075 0.080 0.007 
0.101 -0.150 1335 0.124 0.152* 0.010 
0.151 -0.200 1295 0.174 0.174 0.011 
0.201 -0.250 1057 0.223 0.243 0.013 
0.251 ~0.300 871 0.274 0.304 0.016 
0.301 -0.350 772 0.323 0.314 0.017 
0.351 -0.400 580 0.373 0.313* 0.019 
0.401  —0.450 420 0.424 0.395 0.024 
0.451 -0.500 321 0.472 0.457 0.028 
0.501 -0.550 202 0.523 0.415* 0.035 
0.551 -0.600 149 0.573 0.483* 0.041 
0.601 ~0.650 114 0.623 0.570 0.046 
0.651 -0.700 51 0.672 0.627 0.068 
0.7001 ~0.750 4l 0.721 0.731 0.069 
0.751 + _ 23 0.792 0.782 0.086 
9037 


5Since it is the accuracy of these probabilities rather than the q;'s that is of crucial importance in the 
calculations and model below this canceling provides some justification for omitting the tail biases in (3). In 
practice, bets are only made on horses with expected returns considerably above I, e.g., 1.16 at Santa Anita. 
Modification of the q; to include these biases might change the 1.16 to 1.14, for example. See also the 
discussion below and in §§4 and 5. 
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TABLE 2 (continued) 
Actual vs. Theoretical Probability of Winning, Placing and Showing: Exhibition Park 1978 


(c) Theoretical Number Average Actual Estimated 
Probability of Theoretical Frequency Standard 
of Showing Horses Probability of Showing Error 

0.000 -0.025 0 — — — 

0.026 -0.050 55 0.043 0.036 0.025 
0.051 -0.100 592 0.078 0.081 0.011 
0.101 -0.150 895 0.125 0.165* 0.012 
0.151  -0.200 909 0.175 0.195 0.013 
0.201 -0.250 799 0.224 0.292* 0.016 
0.25] -0.300 885 0.275 0.289 0.015 
0.301 -0.350 794 0.324 0.346 0.017 
0.351 -0.400 703 0.374 0.398 0.018 
0.401 -0.450 655 0.425 0.433 0.019 
0.451 -0.500 617 0.475 0.452 0.020 
0.501 -0.550 542 0.524 0.477* 0.021 
0.551 -0.600 396 0.573 0477* 0.025 
0.601 -0.650 375 0.624 0.599 0.025 
0.651 ~0.700 264 0.672 0.609* 0.030 
0.701 -0.750 206 0.722 0.582* 0.034 
0.751 -0.800 154 0.773 0.655* 0.038 
0.801 -0.850 113 0.821 0.752 0.041 
0.851 -0.900 53 0.873 0.830 0.052 
0.901 + .30 0.925 0.833 0.068 

9037 


“Categories when the theoretical probability and the actual frequency are different at the 5% 
significance level are denoted by *'s. The estimated standard error is (2/ N)i where the actual 
frequency sample variance s? = N(E(X?) — (EX)*)/(N — 1). Since the X; are either 0 or 1, E(X?) 
= EX and s$? = N(EX — (EXY)/(N - 1). 


Formulas (1)-(3) can be used to develop procedures that yield net rates of return for 
place and show betting that are higher than expected (i.e.-18%) and indeed make 
positive profits. As a first step towards development of a "system" we present the 
results on the two data sets of $1 bets when the theoretical expected return is a for 
varying a. The expected return from a $1 bet to place on horse / is$? 


+ pnrf (LCDa 1 } x20} 


z qf; 
P = 
EX; x6) 2 | * P, 
jel 
2 did: d Q(P * 1) - (1 * P; * Pj) ( l ) x 
PAE -) e gnr (SOO ) rag) x20} | 
ivl 


S'The expressions (4) and (5) give the marginal expected return for an additional $1 bet to place or show 
on horse /. To obtain the average expected rates of return one simply replaces (1 + pj) and (1 + sj) in these 
expressions by p, and s;, respectively. From a practical point of view with usual track data these quantities 
are virtually identical. 

7A further complication, not reflected in (4) and (5) below, is that a $2 winning bet must return at least 
$2.10. Hence in these "minus pools" involving an extreme favorite the track's take is less than 1 — Q. 
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where Q is | minus the track take, P = SP, is the place pool, P, is bet on horse i to 
place and INT(Y) means the largest integer not exceeding Y. In (4) the quantities P 
and P, are the amounts bet before an additional $1 is bet; similarly with S and S, in 
(5), below. The expressions involving INT take into account the fact that $2 bets 
return payoffs rounded down to the nearest $0.10. The two expressions represent the 
expected payoffs if / is first or second, respectively. Similarly the expected payoff from 
a $1 bet to show on horse / is 


n g 19% 
EX; = M 
' D 2 (17 aX! —- 4 - 4) 
j*etkslj 
Iaf 28*0-70* 5S * S* S) i 
x EL > (145) «20 
"x ANAK 
+ ——————————— 
2 p ]— qx 7 di ^ qi) 
inl kei) 
| Q(S*1-(1I* S SS). | 
x|l-4 gm [( eee rts) x20 
"2 UG 
+ Mr 
p> P (17 a7 4 - 4) 
il pei 
"n Q(S*1)-(1*SS*S)W | 
x it gnr ( 4 ETAT s) , (5) 


where S = YS, is the show pool, S; is bet on horse į to show and the three expressions 
represent the expected payoffs if / is first, second and third, respectively. 

Naturally one would expect that positive profits would not be obtained, given the 
inherent inaccuracies in assumptions (1)-(3), unless the theoretical expected return a 
was significantly greater than 1. However we might hope that the actual rate of return 
would at least increase with a and be somewhat near a. Table 3 indicates this is true 
for both data sets. The perverse behavior for high a in the place pool is presumably a 
small sample phenomenon. Additional calculations along these lines appear in Harville 
[14]. 


4. A Betting Model 


The results in Table 3 give a strong indication that there are significant inefficiencies 
in the place and show pools and that it is possible not only to achieve above average 
returns but to make substantial profits. In this section we develop a model indicating 
not only which horses should be bet but how much should be bet taking into account 
investor preferences and wealth levels and the effect of bet size on the odds. 

We consider an investor having initial wealth wy contemplating a series of bets. It is 
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TABLE 3 
Results of Betting $1 to Place or Show on Horses with a Theoretical Expected Return of at Least a 
Exhibition Park 


Place Show 
Number of Total Net Net Rate Number of Total Net Net Rate 
a Bets Profit ($) of Return (95) Bets Profit ($) of Return (%) 
1.04 225 5.10 23 612 33.20 5.4 
1.08 126 — 10.10 -8.0 386 53.50 13.9 
1.12 69 11.10 16.1 223 40.80 18.3 
1.16 40 5.10 12.8 143 26.30 18.4 
1.20 18 5.30 29.4 95 2170 22.8 
1.25 It - 2.70 - 24.5 44 11.20 25.5 
1.30 3 -3 - 100.0 27 10.80 40.0 
1.50 0 0 — 3 6 200.0 
Santa Anita 
Place Show 
Number of Total Net Net Rate Number of Total Net Net Rate 
a Bets Profit ($) of Return (96) Bets Profit ($) of Return (96) 

1.04 103 12.30 11.9 307 — 18.00 - 5.9 
1.08 52 12.80 24.6 162 6.90 43 
1.12 22 9.20 41.8 89 3.00 3.4 
1.16 7 2.30 32.9 46 12.40 27.0 
1.20 3 - 1.30 -43.3 27 6.20 23.0 
1.25 0 0 — 9 6.00 66.7 
1.30 0 0 — 5 5.10 102.0 
1.50 0 0 — 0 0 — 


natural to suppose that the investor would wish to maximize the long run rate of asset 
growth and thus employ the so-called Bernoulli capital growth model; see Ziemba and 
Vickson [28] for references and discussion of various assumptions and results. We use 
the following result: if in each time period ¢=1,2,... there are / investment 
opportunities with returns per unit invested denoted by the random variables 
Xie Xp where the x, have finitely many distinct values and for distinct ¢ the 
families are independent, then maximizing E log $AÀ,x,, s.t. DA; < wW, A, > 0 maxi- 
mizes the asymptotic rate of asset growth. The assumptions are quite reasonable in a 
horseracing context because there are a finite number of return possibilities and the 
race by race returns are likely to be nearly independent since different horses will be 
running (although the jockeys and trainers may not be). 

The second key feature of the model is that it considers an investor's ability to 
influence the odds by the size of his bets.* This yields the following model to calculate 


* The first model to include this feature seems to be Isaacs (16]. Only win bets are considered with linear 
utility, and he is able to determine the. exact solution in closed form. His model may be useful in situations 
where the perfect market assumption (3) is violated or where special expertise leads one to believe their 
estimates of the q, are better than those of the other bettors. 
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the optimal amounts to bet for place and show. 
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where Q = | — the track take, W, P, and S, are the total dollar amounts bet to win, 
place and show on the indicated horses by the crowd, respectively, W= dw, 
PZP, and S = $55, are the win, place and show pools, respectively, qg; = W,/ W is 
the theoretical probability that horse i wins, wọ is initial wealth and p, and s, are the 
investor's bets to place and show on horse /, respectively. 

The formulation (6) maximizes the expected logarithm of final wealth considering 
the probabilities and payoffs from all possible horserace finishes. [t is exact except for 
the minor adjustment made that the rounding down to the nearest $0.10 for a two 
dollar bet, see (4) and (5), is omitted.” For the values of a > 1.16 it was observed that 
in a given race at most three p, and three s, were nonzero. When (6) is then simplified it 
can be solved in less than 1 second of CPU time.'? A discussion of the generalized 
concavity properties of (6) will appear in a forthcoming paper by Kallberg and 
Ziemba. 

The results are illustrated by function 1 in Figures 1 and 2 for the two data sets 
using an initial wealth of $10,000. In both cases the bets produced from (6) lead to well 
above average returns and to positive profits.!! These results may be contrasted with 
random betting; function 2 in Figures | and 2. Intuition suggests that Santa Anita with 
its larger betting pools would have more accurate estimates of the q; than would be 
obtained at Exhibition Park. Hence positive profits would result from lower values of 
a. The results bear this out and only horses with a > 1.20 for Exhibition Park were 
considered for possible bets. Generally speaking the bets are usually favorites and 
almost always on those horses with maximum (W,/W)/(P,/P) and (W,/ W)/(S,/ S) 


?]t is possible to include this feature in (6) but it greatly complicates the solution procedure (e.g. 
differentiability is lost) with little added gain in accuracy. 

10 All calculations were made on UBC's AMDAHL 470V6 Model II computer using a code for the 
generalized reduced gradient algorithm. 

"'The procedure was to calculate the optimal bets to place and show in each race using (6) with the 
present wealth level. The results of the race and the actual payoffs that reflect the track's take and breakage 
are known. The payoff for our investor’s bets were calculated using all the bets of the crowd plus the bets of 
our investor taking into account the track's take and breakage, i.e. the payoffs are thus those that would 
have occurred had our investor actually made his bets. The wealth level was then adjusted to reflect the 
race's gain or loss and the procedure continued for all races. 
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! Results from expected log betting to place and show when expected returns are 1.16 or better with initial 
wealth $10,000. 

? Approximate wealth level history for random horse betting. Total dollars bet is as in system 1 ($116,074). 
Track payback is 82.5%, therefore final wealth level is $10,000 — 0.175($116.074) = —$10,313 (Note: 
breakage is not taken into consideration) 

? Results from using the Exhibition Park approximate regression scheme (with initial wealth $2,500) at 
Santa Anita. 


Ficure l. Wealth Level Histories for Alternative Betting Schemes; Santa Anita: 1973/74 Season. 


ratios. These ratios of the theoretical probability of winning to the track take unad- 
justed odds to place or show form a type of cost-benefit ratio that provides a first 
approximation to a. This is discussed further in $5, below. Most of the bets are to show 
and one tends to bet only about once per day. The numbers of bets and their size 
distribution are presented in Table 4. As expected, the influence on the odds made by 
our investor's bets is much greater at Exhibition Park hence the bets there tend to be 
much smaller than at Santa Anita. However, even there about 10% of the bets exceed 
$1000. 

The log formulation has absolute risk aversion |/w, which for wealth around 
$10,000 is virtually zero. Zero absolute risk aversion is, of course, achieved by linear 
utility. One then will bet on the horse (or horses) with the highest a until the influence 
on the odds drops this horse (or horses) below another horse's a, etc., continuing until 
there are no favorable bets pr the betting wealth has been fully utilized. The results of 
such linear utility betting with wọ = $10,000 are: at Exhibition Park with a > 1.20 final 
wealth is $14,818; at Santa Anita with a » 1.16 final wealth is $10,910. Such a strategy 
is a very risky one and leads to some very large bets. The log function has the distinct 
advantage that it implies negative infinite utility at zero wealth hence bets having any 
significant probability of yielding final wealth near zero are avoided. 
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DAY 


‘Results from expected log betting to place and show when expected returns are 1.20 or better with initial 


wealth $10,000. 
? Approximate wealth level history for random horse betting. Total dollars bet is as in system 2 ($8461). 


Track payback is 81.9% therefore final wealth level is $10,000 — 0.181($38461) = 33, 035. (Note: Breakage is 
not taken into consideration.) 
FicunE 2. Wealth Level Histories for Alternative Betting Schemes; Exhibition Park: 1978 Season. 


TABLE 4 
Size Distribution of Bets, wg = $10,000, Log Utility 
Santa Anita Exhibition Park 
Place Show Place Show 


% of % of % of % of % of % of % of % of 
Size Bets $Bet Bets $Bet Bets $Bet Bets $Bet 


0-50 T. 0.3 2.6 0.1 29.4 3.6 17.0 1.0 
51-100 0 0 1.3 0.1 23.5 6.7 13.8 2.8 
101—200 0 0 3.9 0.4 5.9 2.9 22.3 9.3 
201-300 214 6.1 5.2 1.0 5.9 44 14.9 10.3 
301-500 7.] 32 9.1 2.6 23.5 30.6 8.5 9.1 
501-700 14.3 10.3 13.0 5.7 0 0 6.4 10.5 
701-1000 — 143 14.4 7.8 4.8 0 0 74 17.5 
> 1001 35.8 65.7 57.1 85.3 11.8 51.8 9.7 39.5 


n*|4 $11,932 n=77 $104,142 n=17 $4,954 n=94* $33,507 


Total Place Total Show Total Place Total Show 
Bets Bets Bets Bets 


Total Santa _ Total Exhibition _ 
Anita Bettings $116,074 Park Betting $38, 461 
“Two of these bets had EX/ > 1.20 and sf = 0. 
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5. Making the System Operational 


The calculations reported in Figures 1 and 2 were made under the assumption that 
the investor is free to bet once all other bettors have placed their bets. In practice one 
can only attempt to be one of the last bettors." There is a natural tradeoff between 
placing a bet too early and increasing the inaccuracies and running the possibility of 
arriving too late at the betting window to place a bet.? The time just prior to the 
beginning of a race is crucial since many bets are typically made then including the so 
called "smart money" bets made very close to the beginning of the race so their impact 
on other bettors is minimized. It is thus extremely important that the investor be able 
to perform all calculations necessary to place the bet(s) very quickly. Typically, since 
tracks have neither public phones nor electricity, calculations can at most utilize 
battery operated calculators or possibly a battery operated special purpose computer. 
Even if computing times were negligible the very act of punching in the data needed 
for an exact calculation is too time-consuming since it takes more than one minute. 
Therefore, in practice, approximations that utilize a limited number of input data 
elements are required. Several types of approximations are possible such as the tabular 
rules of thumb developed by King [17] or the regression procedures suggested here. 
Our procedure indicates whether or not a bet to de or "t is warranted and at 
what level using the following eight data inputs: wy, Wix, Piz, Ove, Si W, P and S, 
where i* is an i for which (W,/W)/(P,/P) is maximized ur isa j for which 
(W;/ W)/(S;/ S) is maximized. It is easy to determine i* and j*, Sarticulily since i* 
often equals j*, by inspection of the tote board. The approximation supposes that the 
only possible bets are i* to place and j* to show. The regressions, as given below, must 
be calibrated to a given track and initial wealth level. As a prelude to actual betting 
the regressions were calibrated at Exhibition Park for wọ = $2500. For calculated i* the 
expected return on a $1 bet to place is approximated by 

W/W 
EXP, = 0.39445 + 0.51338 ——— CPC R? = 0,776. (7) 
Pix 


If EXP, > 1.20 then the optimal bet to place is approximated by 
Pix = — 459.32 + 1715.64, — 0.042518q,, P — 7440.12, 


+ 1379143, -- 0.10247 P,, + 49.572 In wo, R? = 0.954. (8) 


Similarly for j* the expected return on a $1 bet to show is approximated by 


EX, = 0.64514 + 0.32806 RUM R? = 0.650 9 
je n x EUR m , (9) 


"The model as developed in this paper utilizes an inefficiency in the place and show pools to yield 
positive profits. An investor's bets are determined by his wealth level as well as the profitability of one or 
more such bets. There may or may not exist "enough" inefficiency to provide positive profits for additional 
investors using a system of this nature. In the context of the Isaac's model, for win bets with linear utility, 
Thrall [25] showed that if there were positive profits to be made and each new investor was aware of all 
previous investor's bets then the profits for these various bettors are shared and in the limit become zero. It 
is likely that a similar result obtains for the model discussed here. 

13 At some tracks, such as Santa Anita, betting ends precisely at post time when the electric totalizator 
machines are shut off. At other tracks including Exhibition Park, betting ends when the horses enter the 
starting gate which may be 2 or even 3 minutes past the post time. 
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If EX; 2 1.20 then the optimal bet to show is approximated by 
Sj. = — 660.97 — 867.694, 0.259339, + 3715247, 


-0.195725, € 77.014 lnwọ R? = 0.970. (10) 


All the coefficients in (7)-(10) are highly significant at levels not exceeding 0.05. 
Utilizing the equations (7)-(10) results in a scheme in which the data input and 
execution time on a modern hand held programmable calculator is about 35 seconds. 

The results of utilizing this method on the Exhibition Park data with initial wealth of 
$2500 are shown in Figure 3, functions | and 2. Using the exact calculations yields a 
final wealth of $5197 while the approximation scheme has an even higher final wealth 
of $7698. The approximation scheme leads to 63 more bets (174 versus 111) than the 
exact calculation. Since these bets had a positive net return the tota] profit of the 
approximation scheme exceeds that of the exact calculation. Thus, it is clear that one 
would maximize profits with a cutoff rule below the conservative level of 1.20. The size 
distribution of bets from the exact and approximate solutions are remarkably similar; 
see Table 6 in [15]. 

For place there were 17 bets where EX? exceeded or equalled 1.20 of which 7 (41%) 
were in the money. Only 2 of these 17 bets were not chosen by the regression. However 
14 horses were chosen for betting by the regressions even though their "true" EX? was 
less than 1.20. Most of these had "true" EX? values close to 1.20 and were favorites. 
Four of these horses (29%) were in the money. For show there were 94 bets where EX" 
exceeded or equalled 1.20 of which 52 (55%) were in the money. Only 8 of these were 
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! Results from expected log betting to place and show when expected returns are 1.20 or better with initial 
wealth $2,500. 

? Results from using the approximate regression scheme with initial wealth $2,500. 

FiGURE 3. Wealth Level Histories for Exact and Approximate Regression Betting Schemes; Exhibition 
Park: 1978 Season. 
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overlooked by the regressions while 59 bets with "true" EX" values less than 1.20 were 
chosen by the regressions. Of these 39 (66%) were in the money. 

The regression method is a simple procedure that seems to work well. For example 
using it on the Santa Anita data (without reestimating the coefficients) indicates that 
initial wealth of $2500 would yield a final wealth of $8104; see function 3 in Figure 1. 
Conceivably, there are many possible refinements using fundamental information that 
could be added. Many of these refinements as well as discussion of the results of actual 
betting will appear in a forthcoming book by Ziemba and Hausch. One such refine- 
ment is the supposition that the win market is more efficient under normal fast track 
conditions and less efficient when the track is slow, muddy, heavy, wet, sloppy, etc. 
Bets were made on 87 of the 110 days at Exhibition Park; of these 57 days had a fast 
track. Using the regressions yields “fast track” bets of $32,501 returning $38,364 for a 
net profit of $5863. On the nonfast days the regressions suggest bets of $17,180 
returning $16,515 for a loss of $665. Hence this refinement decreases the time spent at 
the track and increases the net profit from $5198 to $5863. 


6. Implementation and Reliability of the System 


The model presented in this paper assumed that one can utilize the betting data that 
prevails at the end of the betting period, say t, to calculate optimal bets. In practice, 
however, even with the approximations given by (7)-(10), one requires about 1-1.5 
minutes to physically calculate the optimal bets and place them. Thus one can only 
utilize betting information from r (=~ t — 1.5). Hence bets that were optimal at r may 
not be as profitable at t. Some evidence by Ritter [20] seems to indicate that the odds 
on the expost favorite at t often decrease from 7 to t. In which case an optimal bet at r 
may be a poor bet at £. Ritter investigated the systems: bet on i* if (P.,/ P/(W,,/ W) 
€ 0.7 and on j* if (S,/S)/(W,,/ W) < 0.7. (For comparison equations (7) and (9) 
indicate the more restrictive constraints 0.6373 and 0.5912, respectively, instead of 0.7. 
The less restrictive Ritter constraints yield about three times as many bets as (7) and 
(9) indicate.) Using a sample of 229 harness races at Sportsman's Park and Hawthorne 
Park in Chicago he found that with $2 bets these systems gained 24% and 16%, 
respectively. But a 15% advantage shrunk to — 1% if one uses the 7 bets with a random 
shock over the r to ¢ period (for the show bets for the 95 races at Hawthorne with a 
0.65 cutoff). There are some difficulties with the design of Ritter's experiment, such as 
the small sample, the inclusion of only $2 bets and the use of a random shock rather 
than an estimate of usual trends from r to f, etc., however, his results point to the 
difficulty of actually making substantial profits in a racetrack setting. 

An attempt was made during the summer 1980 racing season at Exhibition Park to 
implement the proposed system and observe the "end of betting problem." Nine racing 
days were attended during which 90 races were run. At t — 2 minutes the win, place, 
and show data were recorded on any horse that, through equations (7) and (9), 
warranted a bet. Equations (8) and (10) were then used on this data to determine the 
regression estimates of the optimal bets. Updated toteboard data on these horses were 
recorded until the end of betting to determine if the horse remained a system bet. 
Results of this experiment are shown in Table 5 where: 1) an initial wealth of $2500 is 
assumed; 2) size of bet calculations are done on data at t — 2 minutes; and 3) returns 
are based on final data. Twenty-two bets were made that yielded final wealth of $3716 
for a profit of $1216. Actual betting was not done but by making the calculations two 
minutes before the end of betting allows 1.5 minutes to place the bet, more than 
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TABLE 5 
Results from Summer 1980 Exhibition Park Betting 
Regression Regression Regression Net 
estimate estimate estimate of return 
of expected of expected optimal based on 
return per return per bet final data 
dollar, 2 dollar, at 2 minutes with 
minutes the end before consideration 
before end of end of of our bets Final 
Date Race of betting betting betting Finish affecting odds wealth 
$2500 
July 2 9 120 122 $19,SHOW ON 4 5-6-7 -$19 2481 
" 10 120 123 72,SHOW ON 8 8-1-2 72 2553 
July 9 7 121 110 292,SHOW ON I 2-7-1 131 2684 
n 10 135 122 248,PLACE ON 1 1-6-2 260 2944 
July 16 6 131 122 487,PLACE ON 9 536 
139 n? 292,SHOW ON 9 | ane 146 | 6823 -3626 
M 7 125 127 7,SHOW ON I 5-2-8 -7 3619 
July 23 3 149 149 30,SHOW ON 2 2-10-7 92 3711 
5 4 139 134 573, SHOW ON 10 6-10-4 201 3912 
July 30 8 121 Hl 215,PLACE ON 4 4-1-5 129 4041 
e 9 123 125 591, SHOW ON 6 8-1-5 — 591 3450 
Aug 6 6 128 112 39,SHOW ON 4 4-3-1 59 3509 
2 9 124 103 51,SHOW ON 2 4-1-3 = 51 3458 
Aug 8 l 121 132 875 SHOW ON | 1-10-4 139 3597 
i 3 127 111 635,SHOW ON 3 3-4-7 127 3724 
M 4 126 113 126,SHOW ON 2 2-7-1 82 3806 
Aug l1 8 121 112 94,SHOW ON 8 8-6-2 113 3919 
" 9 131 130 688,SHOW ON 5 5-3-4 138 4057 
Aug 13 3 128 106 33,SHOW ON 2 1-6-7 -33 4024 
n 6 131 122 205, SHOW ON 5 5-8-4 144 4168 
M 7 134 133 5I L, SHOW ON 6 8-5-9 -51i 3657 
T 10 123 109 108,SHOW ON 5 3-5-1 59 3716 


enough time. Note in Table 5 that many systems bets at ! — 2 were not system bets at 
t, however all had regression expected returns of more than one. 

An important question concerns the reliability of the results: are the results true 
exploitations of market inefficiencies or could they be obtained simply by chance? 
This question is investigated utilizing a simple model which was suggested to us by an 
anonymous referee. The first application is concerned with an estimate of the probabil- 
ity that the system's theory is vacuous and indeed the observations conform to specific 
favorable samples from a random betting population. The second application esti- 
mates the probability of not making a positive profit. The calculations utilize the 1980 
Exhibition Park data; see Table 5. 

Let v be the probability of winning a bet in each trial and 


X= l | w if the bet is won, 
: 0 otherwise, 


be the return from a $1 bet in trial i. In n trials, the probability of winning at least 
100% of the total bet is 
(11) 


i=] 


a| i(Ex)-v» 
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Assume that the trials are independent. Since the X, are binomially distributed (11) can 
be approximated by a normal probability distribution as 


dn )— w 
m n y — (V w)mn * 01) (12) 


(1+ w)ry(l— T)/T 


where ® is the cumulative distribution function of a standard N(0,1) vanable. The 
Observed probability of winning a bet, weighted by size of bet made, yields 0.771 as an 
estimate of ~. If the systems theory was vacuous and random betting was actually 
being made then (1 + w) = 0.83 since the track's payback is approximately 83%. The 
22 bets made totalled $5304 and resulted in a profit of $1216 for a rate of return of 
22.9%. Using equation (12) with n = 22, gives 3 x 107 as the probability of making 
22.9% through random betting. 

Suppose that the 1980 Exhibition Park results represent typical system behavior so 
a — 0.771 and (1  w)s = 1.229. In n trials the probability of making a non-positive 
net return is 


(13) 


n 


| 4( Sx) -1<0 


which can be approximated as 
vn (1 — (1 + w)m) 


mp} SN 7y 
(1 — w)ry(1— v)/« 


For n — 22 this probability is only 0.054 and for n — 50 and 100 this probability is 
0.008 and 0.0003, respectively. Thus it is reasonable to suppose that the results from 
the 1980 Exhibition Park data (as well as the 1978 Exhibition Park and 1973/1974 
Santa Anita data with their larger samples) represent true exploitation of a market 
inefficiency." 


= ( — 0.342 n ). (14) 


'" Without implicating them we would like to thank Michael Alhadeff of Longacres Racetrack in Seattle, 
the American Totalizator Corporation and the staff of the Jockey Club at Exhibition Park in Vancouver for 
helping us obtain the data used in this study, and M. J. Brennan and E. U. Choo for helpful discussions. 
Thanks are also due to V. S. Bawa, J. R. Ritter, and two anonymous referees for helpful comments on an 
earlier draft of this paper. This paper is a condensation of [15]. 
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In a previous paper (Management Science, December 1981) Hausch, Ziemba and Ru- 
binstein (HZR) developed a system that demonstrated the existence of a weak market 
inefficiency in racetrack place and show betting pools. The system appeared to make possible 
substantial positive profits. To make the system operational, given the limited time available 
for placing bets, an approximate regression scheme was developed for the Exhibition Park 
Racetrack in Vancouver for initial betting wealth between $2500 and $7500 and a track take 
of 17.1%. This paper: (1) extends this scheme to virtually any track and initial wealth level; (2) 
develops a modified system for multiple horse entries; (3) allows for multiple bets; (4) analyzes 
the effects of the track take and breakage on profits; (5) presents recent results using this 
system; and (6) considers the extent of the inefficiency, i.e., how much can be bet before the 
market becomes efficient? 

(FINANCE-PORTFOLIO; GAMES--GAMBLING) 


1. The Racetrack Market 


The “market” at the track in North America convenes for about 20 minutes, during 
which participants make bets on any number of the six to twelve horses in the 
following race. In a typical race, participants can bet on each horse, either to win, 
place or show. All participants who have bet a horse to win realize a positive return on 
that bet only if the horse is first, while a place bet realizes a positive return if the horse 
is first or second, and a show bet realizes a positive return if the horse 1s first, second 
or third. Regardless of the outcome, all bets have limited liability. Unlike casino games 
such as roulette, but like the stock market, security prices (i.e. the "odds") are jointly 
determined by all the participants and the rules governing transactions costs (i.e. the 
track “take” and “breakage’’). To take the simplest case, all bets across all horses to 
win are aggregated to form the win pool. If W, represents the total amount bet by all 
participants on horse i to win, then W = $,W, is the win pool and WQ/ W, is the 
payoff per dollar bet on horse í to win if and only if horse i wins, where Q is the track 
payback proportion. 

The rules for division of the place and show pools are as follows. Let P, be the 
amount bet on horse j to place and P = XP, be the place pool. The payoff per dollar 
bet on horse j to place is 


i is first and j is second or 
jis first and į is second, (1) 
0 otherwise, 


1+[PQ-P,-P]/2P,) itf 


* Accepted by Donald G. Morrison as Special Departmenta! Editor; received December 13, 1983. This 
paper has been with the authors 2 months for 2 revisions. 


Efficiency of Racetrack Betting Markets 391 


682 D. B. Hausch and W. T. Ziemba 
392 D. B. HAUSCH AND W. T. ZIEMBA 


Thus if horses į and j are first and second each bettor on j (and also i) to place first 
receives the amount of his bet back. The remaining amount in the place pool, after the 
track take, is then split evenly between the place bettors and j and j. The payoff to 
horse j to place is independent of whether j finishes first or second, but it is dependent 
on which horse i finishes with it. A bettor on horse j to place hopes that a longshot 
with a smal) P; not a favorite will finish with it. 

The payoff per dollar bet on horse & to show is analogous 


. k is first, second or third 
1+[SQ- $- 5- S/B) i and finishes with i and j, (2) 


0 otherwise, 


where S, is the amount bet on horse k to show and the show pool is $ = X, S. 

Equations (1) and (2) are not quite correct as they do not account for "breakage". 
Breakage is discussed in 86; it is an extra commission resulting from payoffs being 
rounded down to the nearest 10€ or 20€ on a $2 bet. 


2. Racetrack Efficiency 


A market is efficient, see Fama (1970), if current prices fully reflect al! available 
relevant information. In this case experts should not be able to achieve higher than 
average returns with regularity. 

To investigate the efficiency of the racetrack's win market Snyder (1978) tested 
whether or not bets at different odds levels yielded the same average return. A 
weakly-efficient market in Snyder's sense would have the average rate of return for 
each odds level equal to Q, the track payback ratio which currently varies in North 
America from 0.852 in Ontario to 0.779 in Saskatchewan. His results suggest there are 
"strong and stable biases but these are not large enough to make it possible to earn a 
positive profit" (Snyder 1978, p. 1101). In particular, favorites tend to be underbet and 
longshots overbet. See Ziemba and Hausch (1984), hereafter referred. to as ZH, for a 
survey of the literature on this bias. 

To test the efficiency of the racetrack's place and show market HZR assumed: 

(1) If q; (i = L, ..., n horses) is the probability that j wins, then the probability that 
i is first and j is second is 


jid; ` 

ETE (3) 
and the probability that i is first, j is second and k is third ts 

4i4j4« (4) 


(1-417474). 


Harville (1973) developed and analyzed these formulas. 

(2) If W, is. the total amount bet on horse i to win and W = $75.,W, then 
qg; = W,/ W, i.e., the win market is efficient. While this assumption ignores the bias for 
favorites and longshots mentioned above there are reverse tail biases in the probabili- 
ties of finishing second and third. They occur because if the probability to win is 
overestimated (underestimated) and probabilities sum to one it is likely that the 
probabilities of finishing second and third are underestimated (overestimated). Tables 
2b,c in HZR indicate how these biases tend to cancel when they are aggregated to 
form the theoretical probabilities of placing and showing which are used in the HZR 


model.! 


'See HZR, 884-6. for a more in-depth discussion of the model. 
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Using equations (1)-(4), the Bernoulli capital growth model (see Ziemba and 
Vickson 1975), and assuming initial wealth is wọ, the HZR model to calculate optimal 
amounts to bet for place (p,,/=1,2,..., n) and show (s, /— 1.2, ...,n)is 


l Q(P + Ep- (ptp + Pj) 


2 
x Pi 5 
pit, B+P, 
`a g S Sas) - (5 sit st Sip 
Maximize Y, Y EFT EE LE , LEP tytn Sp) |. (s 
niim P gm c (l-X1- qj) 3 
jrit ij 5 j 5, 
x| —— + — + — 
GHS HS sts 
tw- 2 s- 2 p 
"E "P 
bei pk fwiy 
sL M (pts) < wo. p20. $20. I-V....n. 
fet 


The formulation (5) maximizes the expected logarithm of final wealth considering the 
probabilities and payoffs from the possible horserace finishes. For notational simplic- 
ity P,z P, + Pj and Sj, = S; + Sj Sy. 

The generalized concavity properties of (5) are discussed in Kallberg and Ziemba 
(1981). 

Using equations (1) and (3), the expected return on an additional $1 bet to place on 
horse 1 is: 


e [ 44 VN Q(P * 1) - (1o Pi £j) 
P= — —— Im 
EX => EYE I+ 5 "zu UR + x 20 


(6) 
INT[Y] means the largest integer not exceeding Y. The INT and multiplying and 
dividing by 20 accounts for breakage, i.e. the payoffs on a $2 bet being rounded down 
to the nearest 10€ in this instance. The 9,9,/(1 — q,) term is the probability that | is 
first and j is second while the q,4,/(1 — qj) term is the probability that j is first and 1 is 
second. A similar equation is available for show bets (see HZR). 

One might hope that when EX? or EX| equals say, a, the actual average rate of 
return would be near a or at least increasing in a. Despite the inherent inaccuracies in 
assumptions (1) and (2) this is indeed the case as is borne out in Table 3 in HZR and 
Table 5.1 in ZH. In fact for cases of a above about 1.02, positive profits seem 
realizable. These profits are maximized when a is about l. 16. 

An ideal model is then: 

(i) for each / check EX? and EX; to decide whether or not to bet on horse i, and 
then 

(ii) solve (5) to determine the optimal bet size after setting p, and s, 5, to zero for horses 
you definitely do not want to bet on. 

This analysis presumes one can bet after all other bettors have placed their bets. In 
practice one can only attempt to be one of the last bettors. Thus it is extremely 
important that one be able to perform all calculations necessary to determine the bet(s) 
quickly so that the bets can be placed as close to the end of the betting period as 
possible. 

The approximations developed in HZR are regression schemes with the minimal 
data inputs: wo, Wie, Pj, Wi, S., W, P and S, where i* = argmax,((W,/ W)/(P,/ P)) 
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and j* = argmax,(W;/ W)/(S,/ S). The ratios (W;/ W)/(P,/P) and (W,/W; 
/(S;/ S) may be thought of as simple measures of the inefficiency to place on i and 
show on j respectively. The regressions were calibrated for an initial wealth wo betweer. 
$2500 and $7500 and a track about the size of Exhibition Park in Vancouver, B.C 
(daily handle about $1.2 million) with Q = 0.829. Using this regression scheme on the 
Exhibition Part data with an initial wealth of $2500 and updating wealth over time 
resulted in a final wealth of $7698 at the end of the 110-day season (see Figure 3 in 
HZR). 

We now extend equations (7)-(10) in HZR to account for different wealth levels. 
different size tracks, different Q's, and coupled entries. 


3. Track Size and Wealth Level 


Track size and wealth level do not affect the expected return per dollar bet to place 
or show. However both are important factors in determining the optimal amounts to 
bet to place and show because the larger the betting pools at the track the less our bet 


TABLE ! 
The Optimal Place Bet for Various Betting Wealth Levels and Place Pools Sizes 


W, = $500 


Ws = $10,000 


Place Pool 261g + 2569? + 180g? 426g + 8029? 487g + 901g? 
ous 19997, \ 4599P; ue 521gP, 
TUE qRy- 0.10P, [P2] qP — 0.60P, [P5] ( qP — 0.60P, ) l 
Place Pool 399 + 52g? 375g + 5254? 1,3074 + 1.2802? 2,4979 + 1.8064? 
$10,000 _ 250% rei LN 9024? 2,0734? 
n "(ez )' ns AL l NEM qd 
993gP, 2,199 gP, 
-| gP- 0307; } 9 | -A gP 0697, ] | 
qP — 0.10P, qP — 0.60P, 


Place Pool 


= $150,000 


505g + 52147 2,3869 + 2,6689? 7,0729 + 10,4704 
q q q q q q 


3864P, 1.877qP, 5,2739P, 
-({ —— pa | -[———P-[———c |u 
qP — 0.60P, qP — 0.60P, qP — 0.70P, 


TABLE 2 
The Optimal Show Bet for Various Betting Wealth Levels and Show Pool Sizes 


W, = $50 Wo = $500 Wo = $2,500 


Show Pool 9 + 9949? — 464g? 13 + 1.5494? — 9012? 
$1,200 - - POL \ is _ 30395; _\ tss] 
qS - 0.805, ( qS — 0.605; 


Show Pool | 10+ 183g? - 1354? 86 + 1,5169? 53 4.52194? 58 + 7,406? 
115, 
- = psum ND = 3 = 3 is 3 
$6,000 ( "m ss | [S] 968g 2,5139 421g 
90.7 S, 93445, 1,35945, 
-| —— e J 1531 -| -a l6] -|§ — Ith 
gS — 0.855; gS — 0.708; gS — 0.655, 


1,682 + 28,200g? 
— 16,8804? 


12695, Y. 
B ( g5- oes | 5 


Show Pool 
= $100,000 


13] + 2,1507? 533 + 9.8624? 
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influences the odds and as our wealth increases we tend to wager more. Therefore new 
regressions were calculated for most reasonable track sizes and wealth levels. The data 
for these regressions were the true optimal bets from the NLP model (5) over a broad 
range of wealth levels and track sizes. The ideal data to represent a broad range of 
track sizes would be a season's data from many different tracks. A more practical 
alternative was to multiply the Exhibition Park data by varying constants to simulate 
data from smaller and larger tracks. Nineteen different regressions which appear in 
Tables 1 and 2 were determined for place and show depending upon wealth and track 
size. These regressions labelled P!-P10 and S1-S9 give the optimal place or show bet 
for specific values of initial wealth and size of pool. For intermediate values of these 
variables one may determine accurate betting amounts by taking convex combinations 
of these basic regressions. For example, for a place bet with initial wealth $1000 and 
place pool of $5000 the optimal bet is a, (equation [P,]) + a, (equation [P3]) + as 
(equation [P,]) + a, (equation [P,]), where 


= PEE = = 3 = = = = 
a-li-ü. asijs aalgag. atlas 


32° 32° 


4, Track Payback 


Both the expected return per dollar bet and the optimal bet size are increasing 
functions of Q, the track’s payback. Equations (7) to (10) in HZR were calculated with 
Q = 0.829. Modification of these equations for use at tracks with Q # 0.829 are now 
developed. 


4|. Adjustment of EX? and EX; for Q 
The regression equivalent for EX? when Q = 0.829 is: 


W,/W 
PP 


EX? = 0.319 + 0.559 (7) 


How can this equation be adjusted for a track payback different from 0.829? The 
“true” expected return on a one dollar place bet on horse i is 


a 9:9) qig I OP - (P; * 2 

EX? = + ] + ——————— |. (8) 
zu l- qj 2P; 
ivf 


Equation (7) is linear in Q with 


Using 124 Exhibition Park races with EX? in the range 1.10 and greater, the true 
9EX^/9Q was regressed against q; to give QEX//8Q «2.22 — 1.294, (R? = 0.86, 
SE — 0.055, both coefficients highly significant). Therefore when the track payback is 
Q, the expected return on a $1 place bet can be approximated by adjusting (7) to 


EX? = EX? + (222 - 1.299,)( Q — 0.829) 


W/W 


| 4 (222 - iae[ T )r Q-0829. (9) 


? Note that the coefficients of EX! here are different from those of equation (7) in HZR. In HZR only 
cases of expected return greater than 1.16 were considered. Here we may wish to bet on horses with expected 
returns as low as 1.10 to reflect a high quality track and high quality horses. Thus the EX? had to be 
recalculated to be accurate in the range 1.10 to 1.16. A similar change will be noted for EX;. 
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A similar analysis for show yields 
EX! = EX; + (3.60 — 2.139,)( Q — 0.829) 
(R ? = 0.565, SE = 0.198, both coefficients highly significant) 


= 0.543 + 0.369( d + (3.0 - 2i " ) Q — 0.829). (10) 


43. Adjustment of p* and 3* for Q 


Equations (8) and (10) in HZR were calibrated for Q = 0.829. Since the true p* and 
s* are nondecreasing in Q which varies from track to track we must adjust p* and $* at 
tracks with Q # 0.829. 

The exact NLP model was used on a number of Exhibition Park examples to 
compute the optimal place or show bets at different initial wealths, track sizes and 
different Q's (from 0.809 to 0.859). The results indicated that Ap*/AQ and As*/AQ 
are independent of Q in this range. Therefore with a AQ of 0.01, Ap*/AQ and 
As* /AQ were regressed on p*, wo, P,, P and s*, wo, S,, S, respectively. The analysis 
showed for Ap*/AQ that p* and wọ were very significant independent variables but 
neither P, and P were significant; similar results for As* /AQ were observed. Then the 
(Ap* /AQ, p*, wg) and (As* /AQ,s*, wg) were aggregated (due to a. small number of 
place data points) to give: 


Ap*/AQ] _ p 
aao | - 96] P 


k 
(R 2 = 0.948, SE = 2.23, n = 56, both coefficients highly significant). 


+ 0.000351 wg 


Therefore p* and $* (i.e. p* and 5* adjusted for Q) are: 
pt =pt+(Q- 0.829)(3.16p* -*0.0351w;) and (11) 
$* = $* + ( Q — 0.829)(3.165* + 0.0351 wọ). (12) 


4.3. Example Involving Q # 0.829 


May 7, 1983-Kentucky Derby at Churchill Downs, Louisville, Kentucky. The final 
win and show pools and the win and show bets on #8, Sunny’s Halo, were: 


W = $3,143,669, W, = $745,524, 
S = $1,099,990, Sg = $179,758. 


Using just the EXi portion of equation (10) gives EX; = 1.08, i.e. not enough to 
consider a bet if one is using a typical cutoff of 1.10 as recommended in ZH for a race 
like the Kentucky Derby. But Kentucky has Q = 0.85 and therefore it is more accurate 
to use equation (10) resulting in EX} = 1.14. Hence a show bet should be made. With 
an initial wealth of $1000, Table 2 gives the optimal show bet as 5* = $48. The 
correction for Q =.85 using equation (12) yields $* = $52. Sunny's Halo won the 
Derby and paid $4.00 per $2.00 bet to show so the $52 bet returned $104 for a $52 
profit. Full details on this race appear in ZH. 


5. Coupled Entries 


Occasionally two or more horses are run as a single “coupled entry” or simply 
“entry” because (1) an owner or a trainer has two or more horses in the same race, or 
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(2) there are more horses than the toteboard can accommodate (commonly called a 
field). The entry wins, places or shows if just one of the horses wins, places or shows. If 
any two of the horses in the entry come first and second all the place pool goes to the 
place tickets on the entry. If two of three of the 'in-the-money' horses are the entry 
then typically two thirds of the show pool goes to the draw tickets on the entry (rather 
than the usual third). 

Suppose the coupled entry has number | and let q, = W,/ W. Then q, estimates the 
probability that one of the horses in the entry will win the race. Suppose further that 
qia and qig (with qj, + gig = 4j) are the correct winning probability estimates of the 
two horses in the entry. Using q, (Le. thinking of the entry as a single horse) and 
equation (3) to calculate the probability of the entry placing gives 


41di 
l- qi ` 


Using 4,4, and qa (ie. thinking of the entry as two horses) and equation (3) to 
calculate the probability of the entry placing gives 


Pr(entry 1 is Ist or 2nd) = q, + >> (13) 
i22 


Pr(entry 1 is Ist and/or 2nd) = Pr(1A is lst and any horse but 1B is 2nd) 
+ Pr(1B is Ist and any horse but 14 is 2nd) 
+ Pr(14 is 2nd and any horse but | B is Ist) 
+ Pr(18 is 2nd and any horse but I is Ist) 
+ Pr(14 and |B are Ist and 2nd in either order) 


VAT . Y LED. 


= » Vai 9184 44: 
Caloga Ab 1- te 221-4 


+5 Fie Nadia + VIVEIT: 


+ , 
21-4 1-44 1-48 
which equals 
n (Qia + d:2)4i 
qutqe+ ETT 


which is equation (13). Hence considering the entry as two horses does not affect our 
estimate of the entry’s probability of placing. It does, hawever, affect our estimate of 
the expected return on a dollar bet to place since the possibility of a 14-18 or 1 B-14 
finish exists and for those finishes the place payoff will be high since the whole place 
pool (net of the track take and breakage) gaes to the holders of place tickets on the 
entry 1. Thus equation (9) will underestimate EX?. For the same reason p*, from Table 
l, will also be underestimated. We now consider the use of Tables 1 and 2 and 
equations (9), (11), (10) and (12) on EX», p*, EX' and $*, respectively, to account for 
an entry. 


5.1. Adjustments of EX( and EX; for Coupled Entries 


Equation (6) gives the expected return on a dollar bet to place on a horse, 
considering the entry | as a single horse. Ignoring breakage, this expected return is 


(00, 9 \(), 2PH P 
exe = x (125 + A f+ 2P, 
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More properly considering entry | as two horses, 1A and 1B, the expected return on a 
one dollar bet to place is 


EXLaa = D [ pot + Bat ie E poes 1A and 


1-41 1-4 2P, i:place 
+ S (hee 9189: + det | D QP - (P, + P;) { horses 1 and 
El- ge 1-4] 2P, i place 
+{ didis | didis 1+ QP — P, (roe 1A and 
l— qia | — qig P, 1B place. 


Let A? = EX?, ,, — EX. It is generally the case that the two horses in the coupled 
entry are not of equal ability. We assume that g,, — 2g, and qa = 1q,. 
Using the Exhibition Park data,’ A? was regressed on W ,/ W and P,/P, giving 
P, 
dp = 0.867 77) — 0.857 22 (14) 


(A? = 0.996, SE = 0.00267, and both coefficients highly significant). 


Then using equations (7) and (18) the regression approximation for expected return on 
a one dollar bet on the coupled entry | is 


W/W 
P,/P 


The same procedure for the expected return on a one dollar show bet on the coupled 
entry | yields 


- s. Ww p 
EX?, ig = EX? + A? = 0.319 + oss + 0.867 W — 0.857 T . (15) 


W,/W 
S/S 


W, Sı 
+ 0.842 = — 0.810. (16) 


EX; ig = EX; + A? = 0.543 + E d 


52. Adjustments of p* and s* for Coupled Entries 


When the possible bet is on an entry, equations (7) or (9) underestimate the expected 
return on an additional dollar bet to place. Therefore the optimal bet from the NLP (5) 
will underestimate the true optimal coupled entry place bet. To understand this 
phenomenon many Exhibition Park examples were solved using the exact NLP (5) 
assuming the entry was one horse. Then the same examples were solved supposing the 
entry was two horses (the formulation of the NLP was adjusted to consider the 
possibility of the two horses finishing first and second and then receiving a high place 
payoff) but the win bet on the entry was lowered, using an iterative scheme, until the 
optimal bet was the same as the optimal bet assuming the entry was one horse. This 
procedure gave pairs of GP and q,, where GP was the probability of entry i winning in 
part a (thinking of the entry as one horse); and g, was the adjusted probability that 
gave the same optimal bet when thinking of the entry as two horses as was observed 
when treating it as one horse. 

Since the regression formula (13) gives the approximate optimal place bet when the 
horse's probability of winning is g,, then using g? in that formula gives the approxi- 
mate optimal place bet when the coupled entry's probablity of winning is q;. 


`The data used were cases where the expected return (equalion (7)) was > 1.16. ie.. the cases of interest, 
For instances with a low expected return the correction factor is meaningless. 
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The regression relating g? and q, is 
d = 0.9919, + 0.13742 + 3.47 x 1077W, (17) 


(R? = 0.9998, SE = 0.00161, all coefficients highly significant). 


The examples from which the data were derived spanned many wealth levels and pool 
sizes. While the wealth level was a very significant independent variable the pool size 
was found to be statistically insignificant. Therefore to compute the optimal place bet 
on coupled entry į the procedure is: 

(0) determine that EX? is large enough to consider a place bet, 

(1) set di = Wi/ W, 

(2) determine 4^, and 

(3) substitute 9”, wo, P, P, in Table 1. 

The same procedure was carried out for the optimal show bet on coupled entry i: 

(0) determine that EX” is large enough to consider a show bet, 

(1) set q = W;/ W, 

(2) determine 


G7 = 1.079; + 4.13 x 1077W, — 0.00663, (18) 


and (R? = 0.999, SE = 0.00298, all coefficients highly significant), and 

(3) substitute q/, wọ, S, and S;, in Table 2. 

Three additional questions to be considered are: (1) Will the results be different with 
a weighting other than 1/3 and 2/3 on the two horses in the entry? (2) Should there be 
larger adjusunerits for entries of three or more horses? and (3) Should there be 
adjustments of the equations for a single horse running against a coupled entry? The 
answer to all three questions is yes, but how important is it to account for these 
possibilities? (1) The coupled entry adjustment appears to be fairly robust to the 
weighting. Also each race would require handicapping to determine its more accurate 
weighting. Since that goes beyond the scope of this research (in fact the system 
described in ZH requires absolutely no handicapping), we choose to opt for no 
adjustments for different weightings. In the extreme case where g,, ez qu and qg œO it 
may be better to treat the entry as a single horse and ignore the entry adjustments. (2) 
The additional benefits of three or more horses in an entry are small beyond 
accounting for the entry as two horses. Thus we suggest the simplification of no further 
adjustment. (3) Generally the expected return equations on a single horse running 
against an entry will overestimate the true expected return. In most cases though the 
bias is small and again we suggest no further adjustment. 


6. Multiple Bets 


Occasionally there is more than one system bet in a given race. Since the optimal bet 
equations in Tables 1 and 2 were calibrated assuming only one place bet or one show 
bet in a race it is not correct, in a multiple betting situation, to calculate each bet 
individually from the tables and then wager those amounts. Often that would result in 
overbetting but there are also times where, for diversification reasons, that would 
actually result in underbetting. i 

We have attempted to deal with the most common multiple betting situation—a 
place and show bet on the same horse. Ninety-eight cases of place and show system 
bets on the same horse were analyzed covering a wide range of track handles, q;'s, and 
wgo's. Using the optimization model (5) resulted in the quadruples ( pf, sz, p7,s7). The 
pi is the optimal place bet supposing it is the only good bet in the race, sf is the 
optimal show bet supposing it is the only good bet in the race, and (p7,5£) is the 
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Ficure ]. Betting Wealth Level Histories for System Bets at the 1981-82 Aqueduct Winter Meeting 
Using an Expected Value Cutoff of 1.14 for Track Takes of 14%, 15% and 17%. 


optimal pair of place and show bets when they are considered together. The p$ and s$ 
are the values we should calculate from Tables | and 2. Then p? was regressed on pł, 
så, Wọ P, P and q,. The only statistically significant independent variables were p? 
and s1 leading to the regression equation 


pt = 1.59p3 — 0.63954 (19) 
(R? = 0.967, SE = 73.7, both coefficients highly significant). 


A similar procedure for s# yields 
$$ = 0.90757 — 0.134p7 (20) 


(R? = 0.992, SE = 72.6, both coefficients highly significant). 


7. Effect of the Track Take 


The track take is a commission of 14-22% on every dollar wagered. As mentioned in 
$4 a change in Q, the track payback proportion, can have a substantial effect on EX? 
and EX}. A change in Q can also have a surprisingly dramatic effect on long run 
profit. This is illustrated in Figure 1 using data from the 1981-82 Winter Season at 
Aqueduct, New York* and supposing three different track takes: 14%, 15% and 17%. 
In 1981-82 the track take was 15%, earlier it had been 14% and just recently it has 
increased to 17%. Assuming an initial wealth of $2,500 the final wealths are $7,090, 
$6,292 and $5,058 at track’s takes of 14%, 15% and 17% respectively. The track take 
increasing from 14% to 15% dropped profits by $798 (17.4%). The track take increasing 
from 15% to 17% dropped profits by $1,234 (32.5%). 


8. Effect of Breakage 


In addition to the track take, bettors must also pay an additional commission called 
breakage. This commission refers to the funds not returned to the betting public 
because ‘the payoffs are rounded down to the nearest 10€ or 20€ on a $2 bet. For 
example, a payoff net of the track take of $6.39 would pay $6.30 or $6.20, respectively. 


*The data consisted of the final win, place and show mutuels for the 43-day period December 27, 
1981—March 27, 1982. During this period 3,470 horses ran in 380 races. Thanks go to Dr. Richard Van 
Slyke for collecting these data for us. 
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Ficure 2. Wealth Level Histories at Exhibition Park (1978) with Alternative Breakage Schemes. 


Breakage occurs in the win, place and show as well as other pools. We refer to 
rounding down to the nearest 10€ on a $2 bet as "5€ breakage” that is 5€ per dollar 
and rounding down to the nearest 20€ on a $2 bet as "10€ breakage”. Initially most 
tracks utilized a 5€ breakage procedure. However, in recent years more and more 
tracks have switched to 10€ breakage. The 10€ breakage is never less than 5€ breakage 
and usually is considerably more. As a percentage of the payoff breakage usually 
increases as the payoff becomes smaller unless the payoff is close to the breakage 
roundoff amount. An exception is the “minus pool" when $2.10° must be paid on a 
winning $2 ticket even if the payoff before breakage is only $2.08, or even $1.73. 

On average bettors lose about 1.79% of the total payoff to 5€ breakage and 3.14% to 
10€ breakage on bets using the system in ZH. Adding these amounts to the track take 
gives the total commission. For example, at Churchill Downs in Louisville, Kentucky 
the 15% track take becomes about 18.1% with their 10€ breakage, and at Exhibition 
Park in Vancouver, British Columbia the 15.8% track take becomes about 17.6% with 
their 5€ breakage. Thus to determine the full commission at a given racetrack one must 
take into account the breakage as well as the track take. 

The full extent of the effect of breakage is shown in its effect on profits. Using the 
1978 system bets for Exhibition Park with an initial wealth of $2500 one would have 
$8319 at the end of the year without breakage. With 5€ breakage one would have 
$7521 and $6918 with 10€ breakage. The effect of breakage throughout the 1978 
season is shown in Figure 2. In addition to taking money away from total wealth the 
breakage has the effect of lowering the bet size (because of this lower wealth) thus 
resulting in lower future profits. These calculations indicate that 5¢ breakage averages 
13.7% of profits and 10€ breakage averages 24.1% of profits on system bets. 

These calculations indicate that breakage (especially the very common 10¢ variety) 
is a very substantial cost. The costs are highest when one is placing bets on short odds 
horses. This is, unfortunately, an unavoidable aspect of the system presented in ZH. 


9. Results Using the Betting System 


In ZH many examples are presented showing precisely how to use the system. In 
Table 3 we present summary statistics on system bets made at several tracks over 
different seasons. The data sets for Aqueduct, Santa Anita and Exhibition Park 1978 
were collected after their seasons finished. The Exhibition Park 1980 and Kentucky 
Derby Days data were collected race by race at the track. In all cases initial betting 
wealth is assumed to be $2,500. The different expected cutoff levels reflect the quality 
of the horses at the different tracks. 

The most common system bet is to show on a favorite. Show and place bets occur 
about 85% and 15% of the time, respectively. The percent of bets won is about 59% 


5In Kentucky it is $2.20. 
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while the percent of bets won weighted by the size of the bet is 71%. This difference is 
because the bets are on shorter odds horses which finish in-the-money more often. Ata 
track such as Santa Anita with large betting pools, our bets do not affect the odds very 
much and the average bet is about 7% of the betting pool. Over all these thousands of 
races and hundreds of system bets the total amount wagered was $162,477. The track 
take was $26,035. Our profit was $18,505, for an 11.4% rate of return on dollars 
wagered. The higher rates of return were on the races where we were at the track, so 
we were able to skip rainy days and reject certain horses on the basis of very simple 
handicapping rules. The lower rates of return, as expected, were at the tracks where we 
had no information other than the win, place and show mutuel pools. A simple 
correction which has a surprisingly large effect is removing from the Exhibition Park 
1978 data the days when the track was not a fast track, i.e. rainy days when the track 
was slow, muddy, heavy, sloppy, etc. Doing so decreases the total money wagered 
from $49,99] to $32,811 but increases the profit from $5,198 to $5,863. Thus the rate 
of return on the “fast track" days is 17.9%, up from 10.4% over all days. It also 
increases the rate of return over al! the racetracks from 11.4% to 13.296. 

Since the bets usually have a high EX^ or EX; we might expect a rate of return 
around 1695-2095. Remember that EX? and EX; are on the first dollar bet. These 
values drop as we bet large amounts due to our bets affecting the odds. 

These profits do not consider several minor "entertainment type" costs that one 
must incur by actual attendance at the track to apply the system. Parking, gasoline, 
racing program, racing form, track admission and food amount to $3-10 or more. For 
example, at $10 per day Aqueduct's profits of $3792 over 43 days become $3362. 

Finally the average payout per $2 bet ranged from $2.97 to $3.33 at the various 
tracks, with an average of $3.12. This value is actually a high show return when one 
considers the heavy favorites the system often picks. 


10. Will the Market Become Efficient? 


As more and more individuals use this system the markets for place and show 
betting will tend to become efficient. Two important questions are: (1) How many 
people can play this system and still have it provide a return of 10-20%? and (2) How 
many people can play this system before the market becomes efficient enough that 
expected profits are zero? 

To consider the first question we can determine how much additional money can be 
wagered on a particular horse to place or show before the expected value per dollar bet 
drops to the suggested cutoff for good betting opportunities. As an illustration Figure 3 
indicates this amount to show for a cutoff of 1.14. These figures are based on a track 
take of 17.1% so for lower track takes more can be bet and for higher track takes less 
can be bet. 
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FiGuRE 3. How Much Can Be Bel. B. by System Betiors Relative to the Crowd's Show Bet, S,, on Horse 
i to Lower the Expected Value to Show on the Horse i from Z to t.14, When the Track Take is 17.1%. 
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An example that more directly answers questions | and 2 is provided below and is 
based on the data given in $4.3 on Sunny's Halo, the 1983 Kentucky Derby winner. 


Additional examples appear in ZH. 


Sunny's Halo 


Betting Wealth wo 


Optimal System 

Bets (Using the 
Actual Data One 
Minute Before Post 
Time) Assuming a 
Betting Wealth of wo 


Total Amount Number of System Bettors Needed to 
That Can be Bet Drop the Expected Return per Dollar Bet 
Before the Expected to a Assuming a Betting Wealth of 


Return per Dollar 


Bet Drops to a $200 $500 $1000 $2000 
1,757 623 372 201 
3,743 1,328 792 429 
6.219 2,207 1,316 713 


Our results show that a = 1.02 is a breakeven cutoff and a = 1.06 yields a rate of 
return of about 5-695. 
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Abstract 


The betting strategy proposed in Hausch, Ziemba and Rubinstein (1981) and Ziemba and Hausch 
(1984, 1987) has had considerable some success in North American place and show pools. The place pool 
is England is very different. This paper applies a similar strategy with appropriate modifications for 
places bet at British racetracks. The system or minor modifications also applies in a number of other 
countries such as Singapore with similar betting rules. The system appears to provide positive expectation 
wagers. However, with the higher track take it is not known how often profitable wagers will exist or 
what the long run performance might be. 


I. Introduction 


At North American racetracks, the parimutuel system of betting utilizing electric totalizator boards 
in the dominant method of betting. Las Vegas and the other legal sports books may set odds on particular 
betting situations, but these fixed odds are not available at racetracks. In England and in other 
Commonwealth countries, such as Italy and France, odds betting against bookies is the dominant betting 
scheme. This fixed-odds system is introduced in Hausch, Lo and Ziemba (1994). They also indicate a 
tendency of higher (lower) returns for lower (higher) odds ranges. Thus, the favorite-longshot bias 
appears to exist in England (see e.g. Ali (1977), Busche and Hall (1988)). 


This paper applies the betting system proposed by Hausch, Ziemba and Rubinstein (1981) and 
Hausch and Ziemba (1985) in England. This strategy is also called the Dr.Z system in the trade books 
Ziemba and Hausch (1984,1987) who discuss it more fully. The strategy utilizes the Kelly criterlon (Kelly 
(1956)) which maximizes the expected logarithm of wealth. The Kelly criterion has several advantages. 
First, it maximizes the capital growth asymptotically. Second, it prevents bankruptcy. Third, the expected 
time to reach a specified goal is minimum when the goal increases. Fourth, it is superior to any different 
strategy in the long min. These properties were proved in Breiman (1961). See McLean, Ziemba and 
Blazenko (1992) for discussion of these properties. 


II. The System 


Instead of the North American parimutuel system of win, place, and show, the bets in England 
are to win and place. By "place" the British mean "finish in the money." This is what North Americans 
call show except for one important difference. The number of horses that can place in a particular race 
is dependent on the number of starters. 


! Modified from an Appendix in Ziemba and Hausch (1987). 
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Table 1. Relationship between number of horses that place and number of starters 


Number of Horses that Place 
one: the winner 
two: winner and second 
three; winner, second, and third 


"UT 


| 


four: winner, 


The place pools are not shown on the tote board, but the current payoffs for place bets for each 
horse are flashed on the screen. Bookies, on the other hand, simply pay a percentage of the win odds, 
as shown in Table 2. 


There are many types of exotic bets as well. The tote jackpot corresponds to what North 
Americans call the pick six or sweep six. The tote placepot bet has no analogue in North America. The 
average rates of return on various bets on and off course against a bookmaker or the tote are listed in 
Table 3. The track take is 5% larger in the place poo) than in the win pool. The tote take is larger than 
what the bookies make on average, and on-course betting takes are much less than off-course takes. 


The races in England are on the turf for distances of generally at least a mile, except for some 
shorter races for two-year-olds. The season in southern England is unique in that races are run for about 
three days at each race course. The jockeys, trainers, and so forth then move on to a new course. After 
a month or so they return to the same course. Handicapping is very sophisticated in England. It has to 
be, with little information easily accessible (they have no analogue of the Daily Racing Form, although 
some past performances are available in newspapers) and all that moving from course to course. 


The method of computing the place payoffs in England differs from that used in North America. 
In both locales, the net pool is the total amount wagered minus the track take. In North America, the cost 
of the winning in-the-money tickets is first subtracted to form the profit. This profit is then shared equally 
among the in-the-money horses. Holders of winning tickets receive a payoff consisting of the original 
stake plus their proportionate share of the horse's profits. This means that the amount of money wagered 
on the other horses in the money greatly affects the payoff. In England, the total net pool is divided 
equally among the horses that finish in the money. This means that the payoff on a particular horse 
depends upon how much is bet on this horse to place but not on how much is bet on the other horses. 
Since the minimum payoff is £1 per £1 wager, management is able to keep a control on betting for 
particular favorites. Once this minimum level is reached, it does not pay to wager on a given horse. This 
occurs whenever the percentage of the place pool that is bet on a given horse becomes as large as Q, 
which is the track take for place, divided by m, which is the number of in-the-money horses. In a race 
with 8-15 starters, if Q, is about 0.735, and m=3, the just-get-your-money-back point is reached when 
the bet on a particular horse to place becomes 24.5% of the total place pool: 0.735/3 = 24.5%. Hence 
in England you will often see horses whose place payoffs are £1 or just slightly higher. This method of 
` Sharing the place pool tends to favor longer-priced horses at the expense of the favorites. 
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Table 2. Bookmakers payoff for place bets 


Fracdon of Win 
Number of Odds Paid on Hoss 
Runners Type of Race Place Element Regarded 
Two to five No place bening 
Six or seven Any 1 First and 
second 
Eight or more Any except 1 First, 
handicaps second, 
involving and third 
twelve or 
more 
runners 
Twelve to fifteen Handicaps 1 First 
second, 
and third 
Sixteen to twenty-one Handicaps H First, 
second, 
third, and 
fourth 
Twenty-two or more Handicaps I First, 
second, 
third, and 
fourth 


Source: Rorhschiic (1978). 


Table 3, Rates of return on different types of bets in England on thoroughbred and greyhound racing 


Rare of Rerum 


Type of Bet (%) 
On-course bookmaker 90 
Off-course bookmaker 81 
Single bet to win with off-course bookmaker B5 
Double bet to win with of-course bookmaker 78 
Treble bet to win with off-course bookmaker 72 
ITV Seven bet to win with off-course bookmaker 70-75 
Computer straight forecast with of-course bookmaker 65 
Greyhound forecast with off-course bookmaker 7 
Greyhound forecast double with off-course bookmaker 58 
Place element of cach-way with off-course bookmaker 80 
Ante-post betting with off-course bookmaker 96 
Horse race tote win pool (on course) 80 
Horse race tote win pool (off course) 77 
Horse race tote place pool (on course) 7 
Horse race tote place pool (off course) 72 
Horse race tote daily double pool (on course) 74 
Horse race tote daily double pool (off course) 7 
Horse race tote daily teble pool (on course) 70 
Horse race tote daily treble pool (off course) 67 
Horse race tote daily forecast pool (on course) 7i 
Horse race tote daily forecast pool (off course) 67 
Horse race tote jackpot pool (on course) 70 
Horse race tote jackpot pool (off course) 67 
Horse race tote placepot pool (on course) ` 70 
Horse race tote placepot pool (off course) 67 
Greyhound tote pool betting, average 83.5 


Searre: Rothschild (1978). 
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The current track take to win is about 20.6% and to place is 26.5%, and the breakage is of 10¢ 
variety, or more properly 10p, for pence?. These track takes are much higher than those in North 
America. Since the track paybacks to win and place are different, we call the former, O.=0.794, and 
the latter, Q,=0.735. 


It is easy to apply the Dr.Z system in Great Britain, although with its much higher track takes, 
there may not be many Dr.Z system bets, see Mordin (1992) for a discussion of this. We utilize the 
substitution that q,— Q,/O,, where O, are the odds to win on the horse under consideration. 


The expected value per pound bet to place on horse í is 


EXPlace - (probability of placing) (place odds) - (Prob) (PO). (1) 


In (D, PO, refers to the odds to place on horse i. Prob, the probability of placing is determined as 
follows.? * 


7We can calculate these track takes as follows: The payoff on horse i if it wins is Q,W/W, where Q, 
is the track payback to win, and W, and W are the bet amount of horse i and the total bet amount, 
respectively. So let q = W/W, the efficient-market assumption. Let B be the average breakage, namely, 
4.5p. Since breakage can be 0,1,2,...,,9 pence, its average is 4.5p. Then the payoff on i is Q./q,B, 
which equals the odds O,, since the odds are based on total return (not return plus original stake as in 
North America). So q, = Q,/(B+0,). Summing over all n horses gives 


A, 1 
Ya -1- Q3 Gio 


i-i del 
since some horse must win. Hence 


Q- i 


3:100) 
i-1 


For place, there are one, two, three, or four horses that are in the money, depending upon the number 
of starters. So 


m 


Q,- ——, 


L (8«O) 


i=] 


where m=1,2,3 or 4. 
With the above formulas, Q, = 0.794 and Q, = 0.745. 


*These equations were developed using the 1981-1982 Aqueduct data to relate probability of in-the- 
money finishes to g, the probability of winning and z, the number of horses. Equations (2), (3) and (4) 
had R? of 0.991, 0.993 and 0.998, respectively. These equations are valid when q ranges from 0 to 0.6 
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With n=5 to 7 horses, the first 2 horses place and 


Prob - 0.0667 +2.37q - 1.619? - 0.0097n. (2) 


With n=8 to 15 horses, the first 3 horses place and 


Prob - 0.0665 + 3.44q - 3.47q* - 0.0049n. Q) 


With n—16 or more horses, the first 4 horses place and 


Prob - 0.0371 «4.47q - 6.294? - 0.00164n. (4) 


Figures 1,2, and 3 determine Prob directly using only O,, the win odds on the horse in question. 
Figure 1 applies when there are five, six or seven horses. Figure 2 corresponds to equation (3) and 
applies when there are eight to fifteen horses. Finally, Figure 3 corresponds to equation (4) and applies 
when there are sixteen or more horses. 


The optimal Kelly criterion bet is to wager (Prob PO,-1)/(PO,-1) percent of your betting wealth’. 
We can determine the optimal fraction of your wealth to bet indicated by equation (5) using Figure 4. 


for (2), from 0 to 0.45 for (3), and 0 to 0.3 for (4), which should be the case in most instances. 
However, Figures 1,2 and 3 are valid for any q. 


‘In a race with 1—2,3, or 4 horses, only one horse places, the winner. Such races are rare. Also, it 
is unlikely that the win and place pools would then become so unbalanced as to yield a Dr.Z system bet. 
However, one would occur when PO,/O, was at least 1.44, for a track payback of 0.794 and an expected- 
value cutoff of 1.14, since 1.14/0.794 is 1.44. Tn such a case, one would have a good bet. 


5We have assumed that your bets will be small and hence will not affect the odds very much. Thus 
to determine the optimal bet b for betting wealth w, you maximize Prob log(w, + (PO;-1)b] + (1- 
Prob)log(w;4), whose solution is equation (5). 
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Odds to Win on Horse / 


Figure 1. Probabilities of placing for different odds horses when the race has five to seven starters 


Place Probability for Horse / 


1 2 3 t 5 6 ? 8 9 0 
Odds to Win Horse / 


Figure 3. Probabilities of placing for different odds horses when the race has eight to fifteen starters 
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1 2 3 4 5 6 7 8 9 Y 
Odds to Win on Horse ! 


Figure 3, Probabilities of placing for different odds horses when the race has sixteen or more starters 
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rure 4. Optimal bets when the probability of placing is Prob and the place odds of the horse in question 
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Small Stocks* 


I. Introduction 


In two earlier papers (Grauer and Hakansson 
1982; Grauer and Hakansson 1984), we applied 
the multiperiod portfolio theory of Mossin 
(1968), Hakansson (1971, 1974), Leland (1972), 
Ross (1974), and Huberman and Ross (1983) to 
the construction and rebalancing of portfolios 
composed of U.S. stocks, corporate bonds, gov- 
ernment bonds, and a risk-free asset. Borrowing 
was ruled out in the first article, while margin 
purchases were permitted in the second. The 
probability distributions used were naively es- 
timated from past realized returns in Ibbotson 
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This paper applies 
multiperiod portfolio 
theory to the construc- 
tion and rebalancing of 
portfolios composed of 
U.S. stocks, corporate 
bonds, government 
bonds, and a risk-free 
asset, with small stocks 
included as a separate 
investment vehicle. 
Probability assessments 
are based on the past, 
joint empirical distribu- 
tion. Our principal 
findings are (1) small 
stocks, while totally ig- 
nored at times, entered 
even the most risk- 
averse portfolios most 
of the time and (2) 
small stocks, when 
chosen, tended to re- 
place common stocks, 
except in the 1970s and 
early 1980s, when they 
were primarily held in 
lieu of the risk-free as- 
set. 
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and Sinquefield (1982) and in their Center for Research in Security 
Prices (CRSP) 1926-83 data base, and both annual and quarterly hold- 
ing periods were employed from the mid-1930s forward. The results of 
both papers revealed that the gains from active diversification among 
the major asset categories were substantial, especially for the highly 
risk averse strategies. In addition, we found evidence of substantial use 
of, and gains from, margin purchases for the more risk-tolerant strate- 
gies from the mid-1930s to the mid-1960s. 

In the present paper, small stocks are included as a separate invest- 
ment vehicle. Thus the opportunity set is expanded to five categories: 
common stocks, government bonds, corporate bonds, small stocks, 
and a risk-free asset. The resulting sequential portfolio problem is 
solved both with and without leverage. When used, all borrowing is 
assumed to bear interest at the call money rate + 1% and to be limited 
in size by applicable initial margin requirements. Our principal findings 
are (1) small stocks, while totally ignored at times, entered even the 
most risk-averse portfolios most of the time, (2) small stocks, when 
chosen, tended to replace common stocks, except in the 1970s and 
early 1980s, when they were primarily held in lieu of the risk-free asset, 
and (3) small stocks had a salutary effect on the geometric means of 
realized returns, especially from the mid-1930s to the early 1950s in the 
presence of margin purchases. In the 1934—47 subperiod with leverage, 
for example, small stocks raised the geometric mean return for the 
power .5 strategy from 9.09% to 25.95% while reducing variability. 


H. Theory 


Despite explosive development over three decades and extensive ap- 
plication to the construction of equity portfolios, modern portfolio 
theory has found relatively little use in the larger portfolio context, 
namely, the choice of the proportions to be held in the major categories 
of common stocks and in different types of bonds, money market in- 
struments, real estate, and foreign securities. There are several reasons 
for this. First, extant portfolio theory, being principally based on the 
mean-variance model, is fundamentally single-period in nature, where- 
as the larger problem accents the multiperiod, sequential nature of 
investment decisions. On top of this, since the universe of interest 
extends well beyond common stocks, extant betas are too narrowly 
defined to be useful, and the appropriate betas are not easily estimated 
because of data problems concerning the market weights of bonds, for 
example. At the other extreme, continuous-time portfolio theory is 
somewhat intractable in a world of nontrivial transaction costs. Fi- 
nally, extant models rely heavily on narrow classes of theoretical (sta- 
tionary) return distributions, with limited ability to capture the richness 
of joint, real-world stochastic processes. 

There is, however, a middle category of investment models, usually 
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classified under the heading of discrete-time multiperiod portfolio 
theory, which has been largely ignored in portfolio selection applica- 
tions. This is so despite the fact that these models have a strong foun- 
dation in theory and lend themselves naturally to the problem of re- 
balancing portfolios over many periods (200 quarters in the present 
study). An additional virtue of these models is that they can handle 
return distributions of every conceivable shape and are unfazed by 
such things as nonstationary returns or distributions that are just plain 
unknown beyond the period just ahead. 

To review, consider the simplest reinvestment problem, in which the 
market is perfect and returns are independent over time but otherwise 
arbitrary and not necessarily stationary. The investor has a preference 
function Ug (with Uy > 0, Up < 0) defined on wealth wg at some 
terminal point (time 0). Let w, denote the investor's wealth with n 
periods to go, rj, the return on asset i in period n, z;, the amount 
invested in asset i in period n (with i = 1 being the safe asset), and 
U,,(w,,) the relevant (unknown) utility of wealth with n periods to go. At 
the end of period n (time n — 1), the investor's wealth is 


M 
Wy (Zn) = ` (Fin — NndZin + wall + rin), 
i=2 


where z = (z24,.. . , Zan) and M is the number of securities. 
Consider the portfolio problem with one period to go. The investor, 

with w, to invest, must now solve 

max E[Uo(wo(zi))] = Ui(w). 

M 
Clearly, U;(w,) represents the highest attainable expected utility level 
from capital level w; at time 1 and thus the "derived" utility of wy. 
Employing the induced utility function U,(w,), the portfolio problem 
with two periods to go becomes 

U2(w2) = max E[U,(w{(z2))]. 


Zaw 


Thus with n periods to go we obtain (the recursive equation) 


U,(w,) = max ELU, Q9 ZA], n = l, 2, ee’ 


Ta wa 


Examining the above system, it is evident that the induced utility of 
current wealth, U,(w,), generally depends on "'everything," namely, 
the terminal utility function Uo, the joint distribution functions of fu- 
ture returns, and future interest rates. There is, however, a special case 
in which U,(w,) depends only on Up: this occurs (Mossin 1968) if and 
only if Uo(wo) is isoelastic; that is, if and only if 

1 


Uo(wo) = E wy, yl, 
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(Note that for y = 0, Uo[wo] = In wo.) The variable U,(w,) is now a 
positive linear transformation of Ug(w,); that is, we can write 
1 


U,(w,) = — wà. (1) 
Yy 


For these preferences, the optimal investment policy Zh, is proportional 
to wealth; that is, 


zw. = xh wa all i, (2) 


where the Xin, are constants. It is also completely myopic since it only 
depends on Uo and the current period’s return structure and not on 
returns beyond the current period. Both of these properties hold only 
for family (1), which is also the only class of preferences exhibiting 
constant relative risk aversion.' Finally, (2) also implies that the utility 
of wealth relatives, V,(1 + r,), is of the same form only for this family; 
that is, 


U,(w,) = T wi <=> Val +r) = mae + ry). 


While the above properties are interesting, they are clearly rather 
special. However, the isoelastic family’s influence extends far beyond 
its numbers. As shown by Leland (1972), Hakansson (1974), Ross 
(1974), and Huberman and Ross (1983), there is a very broad class of 
terminal utility functions Up(wo) for which the induced utility functions 
U,, converge to an isoelastic function, that is, for which 


U,(w,) > T w} forsome y< l. (3) 


Hakansson (1974) has also shown that (3) is usually accompanied by 
convergence in policy; that is, 


zh > Xh Wn 


Thus the objectives given by (1) are quite robust and encompass a 
broad variety of different goal formulations for investors with inter- 
mediate- to long-term investment horizons.? In particular, class (1) 
spans a continuum of risk attitudes all the way from risk neutrality 
(y = 1) to infinite risk aversion (y = —9).? 

Having selected our model, we turn next to what we need to operate 
it. The major input to the model is an estimate of next period's joint 
return distribution for the various asset categories.^ As in our previous 


1. This measure is defined as — wU(wYU',(w) and equals 1 — y for the class (1). 

2. The simple reinvestment formulation does ignore consumption of the course. 

3. A plot of the functions (1/y)(1 + r)” for several values of y was given in Grauer and 
Hakansson (1982, p. 42). 

4. For a comprehensive overview of the issues and problems associated with the 
estimation of return distributions, see Bawa, Brown, and Klein (1979). 
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studies, we based this estimate on the so-called simple probability 
assessment approach. In this approach, the realized returns of the most 
recent n periods are recorded; each of the n joint realizations is then 
assumed to have probability I/n of occurring in the coming period. 
Thus estimates are obtained on a moving basis and used in raw form 
without adjustment of any kind. Since the whole joint distribution is 
specified and used, there is no information loss; all moments— 
marginal and conditional, for example—and every last shred of corre- 
lation are implicitly taken into account. It may be noted that the empir- 
ical distribution of the past n periods is optimal if the investor has no 
information about the form and parameters of the true distribution but 
believes that this distribution went into effect periods ago.? Conse- 
quently, we have, as a starting point, resisted all temptations to param- 
eterize the input distributions. 


III. Calculations 

The model used can be summarized as follows. At the beginning of 

each period t, the investor chooses a portfolio, x,, on the basis of some 

member, y, of the family of utility functions for returns r given by 
Vil +r) = +a op. (4) 


This is equivalent to solving the following problem in each period t: 


1 
max E+ (1 Exara + Xero + xorb] 
i 


“ (Sa) 
1 
= max Xm, — (1 + Lilies + Xut + Xgi 
X, S$ y i 
subject to 
Xj 20, x,,20, xg, = 0, all i, t, (6) 
DXi + Xz, + Xp, = l, all t, (7) 
i 
Em;xa = l, all t, (8) 
i 
pr(l + Xx + Xli + Xpil i 20) = 1, (9) 
F 
where 
y € 1 = a parameter that remains fixed over time; 
X= (Xin -> <, Xnb XLo XB); 
Xj = the amount invested in risky asset category i in period ¢ as 


a fraction of own capital; 


5. See Bawa et al. 1979, p. 100. 
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Xi, = the amount invested in the risk-free asset in period t as a 
fraction of own capital; 

Xg, = the amount borrowed in period t as a fraction of own 
capital; 

ri = the anticipated return on asset category i in period t; 

rp, = the return on the risk-free asset of period t; 

rg, = the borrowing rate at the time of the decision at the begin- 
ning of period t; 

mij, = the initial margin requirement for asset category i in period 
t expressed as a fraction; and 

T,, = the probability of event s in period t, in which case the 
random return r; will assume the value rj;,. 


As noted, when y equals zero, (4) reduces to the logarithmic utility 
function: 
V = In(1 +r). 


The equivalent of (5a) is then 


max E[In(1 t Danan + Xp rr, + Xp,rd)]. (5b) 


X, i 


Constraint (6) rules out short positions, and (7) is the budget con- 
straint. Constraint (8) serves to limit borrowing (when desired) to the 
maximum permissible under the margin requirements that apply to the 
various asset categories. Finally, since 


d 
Luin *oXpfüp + XBif Bi 
t 


is the (ex ante) return distribution for portfolio x,, the expressions in 
parentheses in (Sa) and (Sb) represent the (ex ante) wealth relative in 
period t. 

On the basis of the probability estimation method described earlier, 
the (sequential) solution of the portfolio problem may be pictured as 
follows. Suppose quarterly revision is used. Then, at the beginning of 
quarter ¢, the portfolio problem (5a) or (Sb) for that quarter uses the 
following inputs—the (observable) risk-free return for quarter t; 
the (observable) call money rate + 1% at the beginning of quarter t; 
and the (observable) realized returns for common stocks, government 
bonds, corporate bonds, and small stocks for the previous n quarters. 
Each joint realization in quarters t — n through t — 1 is given probabil- 
ity I/n of occurring in quarter f. 

With these inputs in place, the portfolio weights for the various asset 
categories and the proportion of assets borrowed are calculated by 
solving system (5a)-(9) (or [Sb]-[9) via nonlinear programming 
methods. At the end of quarter t, the realized returns on stocks, gov- 


6. The nonlinear programming algorithm employed is described in Best (1975). 
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ernment bonds, corporate bonds, and small stocks are observed, along 
with the realized borrowing rate r5, (which may differ from the decision 
borrowing rate r%,).’ Then, using the weights selected at the beginning 
of the quarter, the realized return on the portfolio chosen for quarter t 
is recorded. The cycle is then repeated in all subsequent quarters.® 

All reported returns are gross of transaction costs and taxes and 
assume that the investor in question had no influence on prices. There 
are several reasons for this approach. First, since this is one of the first 
studies in this area, we wish to keep the complications to a minimum. 
Second, the Ibbotson and Sinquefield series used as inputs and for 
comparisons also exclude transaction costs (for reinvestment of inter- 
est and dividends) and taxes. Third, niany investors are tax-exempt, 
and various techniques are available for keeping transaction costs low. 
Finally, since the proper treatment of these items is nontrivial, they are 
better left to a later study. 


IV. Data 


The data used to estimate the probabilities of next period's returns and 
to calculate each period's realized returns on risky assets are the total 
monthly and annual returns on common stocks, long-term government 
bonds, long-term corporate bonds, and small stocks for 1926-83 in the 
Ibbotson and Sinquefield CRSP data file; both dividends and capital 
appreciation are taken into account. The risk-free asset used for quar- 
terly revision was assumed to be 90-day U.S. Treasury bills maturing 
at the end of the quarter; we used the Survey of Current Business and 
the Wall Street Journal as sources. In the annual portfolio revision 
case, the risk-free return was obtained from the yield, as of the begin- 
ning of the year, on that U.S. government obligation (note or bond) 
that matured on the date closest to the end of the year in question; we 
obtained the 1926-76 data privately from Roger Ibbotson and the re- 
mainder from the Wall Street Journal. 

Margin requirements for stocks were obtained from the Federal Re- 
serve Bulletin. Initial margins were set at 10% for government bonds 
and at 35% for corporate bonds. These levels are on the conservative 
side and are designed to compensate for the absence of maintenance 
requirements.? 


7. The realized borrowing rate rg, was calculated as a monthly average. 

8. Note that, if n — 32 under quarterly revision, then the first quarter for which a 
portfolio can be selected is the first quarter of 1934 since the period 1926-33 is required to 
develop the estimated return distributions used for that quarter's portfolio choice. 

9. There was no practical way to take maintenance margins into account in our pro- 
grams. In any case, it is evident from the results that they would come into play only for 
the more risk tolerant strategies, and even for them only occasionally, and that the net 
effect would be relatively neutral. 
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TABLE 1 Composition of Fixed Weight Portfolios 


Proportion in: 


Government Corporate Small Risk-free 
Stocks Bonds Bonds Stocks Asset 

Portfolio A eel .40 .40 eel .20 
Portfolio B .20 30 30 ee .20 
Portfolio C 45 .20 25 eel .10 
Portfolio D 55 AS .20 ee .10 
Portfolio E .65 .10 .15 ee .10 
Portfolio F 75 ee A5 eel 10 
Portfolio G .90 .05 .05 ee eve 
Portfolio H .70 .05 .05 .20 

Portfolio I 50 .05 .05 .40 

Portfolio J .30 .05 .05 .60 


As noted, the borrowing rate was assumed to be the call money rate 
+1%; for decision purposes (but not for rate of return calculations), 
the applicable beginning of period rate, /5,, was viewed as persisting 
throughout the period and thus as risk free.'!° For 1934—76, the call 
money rates were obtained from the Survey of Current Business; for 
later periods, the Wall Street Journal was the source. 


V. Results 


Because of space limitations, only a portion of the results can be re- 
ported here. However, tables 2-6 below and figures 1 and 2 provide a 
fairly representative sample of our findings. 

For comparison, we have calculated and included the returns for 10 
fixed-weight portfolios. The compositions of these fixed-weight port- 
folios are shown in table 1. 


Portfolio Returns: The No-Leverage Case 


Table 2 compares the geometric means of the annual returns, along 
with the standard deviations of In(1 + rj), with and without small 
stocks for the subperiods 1936-47, 1948—65, and 1966-83 and for the 
full 1936-83 period, under quarterly portfolio revision (with a 40- 
quarter estimating period) in the absence of leverage. Note that the 
realized returns without small stocks differed substantially between the 
three subperiods. For example, the highest geometric mean was 7.7596 
for 1936-47, 15.69% for 1948-65, 7.38% for 1966-83, and 10.11% for 


10. It may be noted that the minimum differential between the borrowing and the 
lending rate was 1.4% (1935), while the maximum was 5.0% (1981). The differential was 
. 2% or less through 1952 and again in 1964-65. For most of the 1950s, 1960s, and 1970s, 
and again in 1983, the differential was between 2% and 3%; the exceptions were 1974 and 
1980, when it was 4.5%, and, as already mentioned, 1981. 
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the whole 1936-83 period. In contrast, the realized returns were much 
more uniform with small stocks included: the highest geometric means 
were then 15.79% for 1936—47, 15.27% for 1948—65, 14.97% for 1966- 
83, and 14.54% for the full period. 

For the 1936-47 subperiod, table 2 shows that the presence of the 
small stock category increased both the standard deviation and the 
geometric mean for each of the 16 strategies. These increases are rela- 
tively small for powers —75 through — 10 but quite substantial for the 
others, especially powers ~1 and up. The situation for 1966—83 is 
similar but less dramatic. In the 1948-65 subperiod, on the other hand, 
the presence of the small stock category again led to uniformly higher 
standard deviations but also to uniformly lower geometric means. The 
differences are quite small, however. Standard deviations also in- 
creased across the board for the full period, as did the geometric 
means. 


Portfolio Returns: The Leverage Case 


Table 3 shows the geometric means of the annual returns, and the 
standard deviations of In(1 + rj), with and without small stocks for the 
leverage case. This table is based on quarterly portfolio revision and a 
32-quarter estimating period. Again it is evident that the presence of 
small stocks tended to increase both the geometric means and the 
standard deviations of returns. This was uniformly so in the 1966-83 
subperiod as well as for the full 1934—83 period. This pattern was 
broken in only two cases: in the 1934-47 subperiod, when powers 0, 
.25, and .50 achieved lower standard deviations in the presence of the 
small stock category, and in the 1948-65 subperiod, when powers —3 
through 1 attained higher geometric means in the absence of small 
stocks. 

Table 3 also shows that small stocks had a rather uneven effect 
across the three subperiods. In the 1934—47 subperiod, they provided 
an enormous lift to most geometric means with very little effect on 
standard deviations. On the other hand, their effect in the 1948—65 
subperiod was minimal. The effect in the last subperiod was fairly 
modest, while the effect of small stocks on returns over the full 50 
years was rather substantial. 


Portfolio Compositions 


An examination of the strategies used reveals some interesting pat- 
terns. In the no-borrowing case, small stocks were ignored by all the 
active strategies, including the risk-neutral one, from the third quarter 
of 1955 through the first quarter of 1962, in the third quarter of 1963, 
and again in the first quarter of 1979. The rest of the time, however, 
small stocks tended to be an important investment outlet, mostly at the 
expense of common stocks. Even the ultraconservative power —75 
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TABLE 2 Comparison of Geometric Means and Standard Deviations of Annual 
Returns with and without Small Stocks, 1936-83: No-Leverage Case 
(Quarterly Portfolio Revision, 40-Quarter Estimating Period) 


A. 1936-47 
Without Small Stocks With Small Stocks 
Geometric Standard Geometric Standard 
Portfolio Mean Deviation* Mean Deviation* 
Common stocks 6.71 22.28 6.71 22.28 
Government bonds 3.46 3.62 3.46 3.62 
Corporate bonds 3.25 2.26 3.25 2.26 
Small stocks kan abe 14.65 41.68 
Risk free 24 .18 .24 .18 
Power 1 4.78 23.74 14.65 41.68 
Power .75 5.19 20.98 12.64 41.72 
Power .50 6.95 18.21 14.29 35.77 
Power .25 7.15 14.92 15.79 30.19 
Power 0 7.72 12.92 15.48 26.11 
Power — 1 6.18 8.31 9.88 14.29 
Power -2 5.27 6.04 7.67 9.87 
Power —3 4.79 4.96 6.54 7.71 
Power -5 4.31 3.96 5.43 5.63 
Power —7 4.03 3.39 4.94 4.66 
Power — 10 3.73 2.99 4.48 3.81 
Power -15 3.53 2.68 4.03 3.24 
Power —20 3.11 2.49 3.50 2.92 
Power — 30 2.52 2.13 2.78 2.51 
Power — 50 2.09 1.94 2.28 2.27 
Power ~75 1.74 1.65 2.07 2.14 
Portfolio A 2.74 2.19 2.74 2.19 
Portfolio B 3.78 5.45 3.78 5.45 
Portfolio C 5.13 10.58 5.13 10.58 
Portfolio D 5.47 12.63 5.47 12.63 
Portfolio E 5.75 14.71 5.75 14.71 
Portfolio F 5.97 16.76 5.97 16.76 
Portfolio G 6.59 20.13 6.59 20.13 
Portfolio H ae bate 8.75 23.70 
Portfolio I in zu 10.68 27.45 


Portfolio J n iud 12.36 31.32 
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TABLE 2 (Continued) 


B. 1948-65 


Without Small Stocks 


Geometric Standard Geometric 
Portfolio Mean Deviation* Mean 
Common stocks 15.67 14.54 15.67 
Government bonds 1.94 4.96 1.94 
Corporate bonds 2.52 4.11 2.52 
Small stocks e. eel 15.61 
Risk free 2.29 .94 2.29 
Power | 15.67 14.54 14.03 
Power .75 15.67 14.54 14.64 
Power .50 15.67 14.54 14.70 
Power .25 15.67 14.54 14.83 
Power 0 15.67 14.54 14.90 
Power -1 15.69 14.54 15.27 
Power -2 15.52 14.48 15.25 
Power —3 15.18 14.42 14.93 
Power —5 13.65 14.45 13.24 
Power —7 11.85 13.58 11.59 
Power — 10 9.48 10.91 9.35 
Power — 15 7.27 7.60 7.15 
Power —20 6.12 5.68 6.06 
Power — 30 4.98 3.73 4.95 
Power —50 3.92 2.19 3.91 
Power —75 3.39 1.46 3.38 
Portfolio A 2.26 3.56 2.26 
Portfolio B 5.00 3.02 5.00 
Portfolio C 8.41 5.99 8.41 
Portfolio D 9.75 7.46 9.75 
Portfolio E 11.09 9.01 11.09 
Portfolio F 12.44 10.63 12.44 
Portfolio G 14.37 12:90 14.37 
Portfolio H 14.44 
Portfolio I 14.47 
Portfolio J 14.46 
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With Small Stocks 


Standard 
Deviation* 


14.54 
4.96 
4.11 

19.36 

.94 

17.12 

17.14 

17.16 

17.04 

16.98 

16.21 

15.54 

14.72 

14.60 

13.79 

11.09 
7.81 
5.86 
3.87 
2.31 
1.56 
3.56 
3.02 
5.99 
7.46 
9.01 

10.63 

12.90 

13.55 

14.41 

15.46 
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TABLE 2 (Continued) 
C. 1966-83 
Without Small Stocks With Small Stocks 
Geometric Standard Geometric Standard 

Portfolio Mean Deviation* Mean Deviation* 
Common stocks 7.62 17.03 7.62 17.03 
Government bonds 4.13 9.9] 4.13 9.9] 
Corporate bonds 4.90 10.82 4.90 10.82 
Small stocks ee. ee 16.11 27.78 
Risk free 7.37 2.86 7.37 2.86 
Power I 7.38 12.03 14.97 27.29 
Power .75 6.65 11.33 13.06 25.72 
Power .50 6.40 11.09 12.85 24.50 
Power .25 6.34 10.85 11.86 24.30 
Power 0 6.35 10.39 11.48 23.98 
Power -I 6.84 9.07 10.24 22.37 
Power -2 7.08 7.79 10.01 19.65 
Power —3 7.18 6.22 9.89 15.98 
Power -5 7.28 4.58 9.23 10.88 
Power -7 7.31 , 3.86 8.84 8.35 
Power - 10 7.33 3,37 8.49 6.33 
Power —15 7.35 3.05 8.17 4.74 
Power ~ 20 7.36 2.94 7.99 4.00 
Power — 30 7.36 2.86 7.80 3.36 
Power —50 7.37 2.84 7.63 2.99 
Power —75 7.37 2.83 7.55 2.88 
Portfolio A 5.20 8.27 5.20 8.27 
Portfolio B 6.03 7.96 6.03 7.96 
Portfolio C 6.61 10.08 6.61 10.08 
Portfolio D 6.92 10.99 6.92 10.99 
Portfolio E 7.20 12.06 7.20 12.06 
Portfolio F 7.48 13.38 7.48 13.38 
Portfolio G —. — 7.44 15.62 7.44 15.62 
Portfolio H ee eel 9.41 16.86 
Portfolio I e ee 11.25 18.73 


Portfolio J ee ees 12.95 21.06 
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TABLE 2 (Continued) 
D. 1936-83 
Without Small Stocks With Small Stocks 

Geometric Standard Geometric Standard 
Portfolio Mean Deviation* Mean Deviation* 
Common stocks 10.34 17.64 10.34 17.64 
Government bonds 3.14 6.96 3.14 6.96 
Corporate bonds 3.59 7.12 3.59 7.12 
Small stocks ele. ee 15.56 28.66 
Risk free 3.64 3.40 3.64 3.40 
Power 1 9.74 16.69 14.54 27.97 
Power .75 9.57 15.63 13.54 27.45 
Power .50 9.93 14.65 13.90 24.97 
Power .25 10.11 13.65 13.94 23.11 
Power 0 10.11 13.03 13.75 21.78 
Power —1 9.91 11.77 12.01 18.14 
Power —2 9.70 11.10 11.35 16.06 
Power —3 9.50 10.56 10.89 13.93 
Power -5 8.86 9.97 9.74 11.63 
Power -7 8.15 9.12 8.87 10.24 
Power —10 7.21 7.33 7.79 8.11 
Power — 15 6.35 5.33 6.74 5.92 
Power — 20 5.82 4.33 6.13 4.79 
Power —30 5.24 3.52 5.46 3.82 
Power — 50 4.73 3.13 4.88 3.30 
Power —75 4.44 3.06 4.59 3.16 
Portfolio A 3.48 5.67 3.48 5.67 
Portfolio B 5.08 5.82 5.08 5.82 
Portfolio C 6.91 8.80 6.91 8.80 
Portfolio D 7.60 10.19 7.60 10.19 
Portfolio E 8.27 11.70 8.27 11.70 
Portfolio F 8.93 13.34 8.93 13.34 
Portfolio G 9.77 15.93 9.77 15.93 
Portfolio H 11.10 17.50 
Portfolio I 12.30 19.51 
Portfolio J 13.37 21.84 


* Standard deviation is for the variables in(1 + r,). 
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TABLE 3 Comparison of Geometric Means and Standard Deviations of Annual 
Returns with and without Small Stocks, 1934—83: Leverage Case 
(Quarterly Portfolio Revision, 32-Quarter Estimating Period) 


A. 1934-47 
Without Small Stocks With Small Stocks 
Geometric Standard Geometric Standard 
Portfolio Mean Deviation* Mean Deviation* 
Common stocks 8.59 22.42 8.59 22.42 
Government bonds 4.03 3.72 4.03 3.72 
Corporate bonds 4.42 3.61 4.42 3.61 
Small stocks ee e. 16.98 38.76 
Risk free 24 7 24 AT 
Power | — 5.86 56.91 11.57 88.59 
Power .75 5.30 40.76 19.95 48.99 
Power .50 9.09 38.63 25.95 35.10 
Power .25 9.19 34.75 23.43 31.01 
Power 0 10.67 31.30 22.58 28.25 
Power -1 10.19 22.95 17.47 24.83 
Power —2 9.32 17.86 13.94 20.89 
Power —3 8.83 14.91 12.33 17.41 
Power - 5 7.17 10.36 9.44 11.87 
Power -7 6.24 7.67 7.85 8.67 
Power —10 5.53 5.59 6.58 6.36 
Power -15 4.85 4.43 5.69 5.06 
Power -20 3.92 3.60 4.55 4.18 
Power — 30 2.99 2.64 3.39 3.02 
Power — 50 2.42 2.09 2.63 2.31 
Power —75 1.86 1.64 2.29 1.98 
Portfolio A 3.44 2.72 3.44 2.72 
Portfolio B 4.68 5.60 4.68 5.60 
Portfolio C 6.40 10.74 6.40 10.74 
Portfolio D 6.83 12.80 6.83 12.80 
Portfolio E 7.22 14.88 7.22 14.88 
Portfolio F 7.57 16.96 7.57 16.96 
Portfolio G 8.38 20.28 8.38 20.28 
Portfolio H e. e 10.63 23.01 
Portfolio I ee ee 12.64 26.09 


Portfolio J ee ee 14.40 29.40 
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TABLE 3 (Continued) 
B. 1948-65 
Without Small Stocks With Small Stocks 
Geometric Standard Geometric Standard 

Portfolio Mean Deviation* Mean Deviation* 
Common stocks 15.67 14.54 15.67 14.54 
Government bonds 1.94 4.96 1.94 4.96 
Corporate bonds 2.52 4.1 2.52 4.11 
Small stocks ee eel 15.61 19.36 
Risk free 2.29 .94 2.29 .94 
Power 1 24.70 24.79 23.40 28.61 
Power .75 24.70 24.79 22.92 28.01 
Power .50 24.70 24.79 23.34 27.60 
Power .25 24.70 24.79 23.54 27.49 
Power 0 24.70 24.79 23.75 27.41 
Power -1 22.79 25.18 21.93 27.25 
Power -2 20.17 23.67 19.49 24.30 
Power -3 17.97 22.31 17.79 22.82 
Power -5 15.04 19.35 15.08 19.73 
Power -7 12.38 16.07 12.49 16.28 
Power — 10 9.96 12.82 10.09 13.00 
Power — 15 8.00 9.45 8.17 9.62 
Power — 20 6.73 7.16 6.8] 7.37 
Power —30 5.36 4.71 5.39 4.88 
Power —50 4.17 2.73 4.19 2.87 
Power —75 3.56 1.77 3.57 1.90 
Portfolio A 2.26 3.56 2.26 3.56 
Portfolio B 5.00 3.02 5.00 3.02 
Portfolio C 8.41 5.99 8.41 5.99 
Portfolio D 9.75 7.46 9.75 7.46 
Portfolio E 11.09 9.01 11.09 9.01 
Portfolio F 12.44 10.63 12.44 10.63 
Portfolio G 14.37 12.90 14.37 12.90 
Portfolio H 14.44 13.55 
Portfolio 1 14.47 14.41 
Portfolio J 14.46 15.46 
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TABLE 3 (Continued) 
C. 1966-83 
Without Small Stocks With Smal! Stocks 

Geometric Standard Geometric Standard 
Portfolio Mean Deviation* Mean Deviation* 
Common stocks 7.62 17.03 7.62 17.03 
Government bonds 4.13 9.91 4.13 9.91 
Corporate bonds 4.90 10.82 4.90 10.82 
Small stocks ee ee 16.11 27.78 
Risk free 7.37 2.86 7.37 2.86 
Power | 9.22 15.82 13.26 33.56 
Power .75 10.16 14.98 11.80 34.55 
Power .50 9.91 13.58 10.11 32.42 
Power .25 9,74 11.93 11.54 31.12 
Power 0 9.25 11.44 12.17 30.16 
Power —1 9.32 10.42 10.87 25.07 
Power ~—2 8.96 8.68 10.51 19.69 
Power -3 8.41 7.24 10.33 16.11 
Power —5 8.05 5.46 9.54 11.13 
Power —7 7.89 4.58 9.08 8.61 
Power — 10 7.15 3.92 8.66 6.58 
Power -15 7.64 3.45 8.29 4.97 
Power —20 7.58 3.25 8.08 4.21 
Power —30 7.51 3.07 7.86 3.53 
Power —50 7.46 2.96 7.67 3.11 
Power —75 7.43 2.92 7.58 2.97 
Portfolio A 5.20 8.27 5.20 8.27 
Portfolio B 6.03 7.96 6.03 7.96 
Portfolio C 6.61 10.08 6.61 10.08 
Portfolio D 6.92 10,99 6.92 10.99 
Portfolio E 7.20 12.06 7.20 12.06 
Portfolio F 7.48 13,38 7.48 13.38 
Portfolio G 7.44 15.62 7.44 15.62 
Portfolio H ee ee 9.41 16.86 
Portfolio I ee ee 11.25 18.73 


Portfolio J en ee. 12.95 21.06 
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TABLE 3 (Continued) 


D. 1934-83 


Geometric 

Portfolio Mean 
Common stocks 10.73 
Government bonds 3.31 
Corporate bonds 3.90 
Small stocks ees 
Risk free 3.51 
Power | 9.89 
Power .75 13.74 
Power .50 14.78 
Power .25 14.75 
Power 0 14.99 
Power -1 14.25 
Power -2 12.97 
Power -3 11.88 
Power —5 10.26 
Power -7 9.01 
Power — 10 7.91 
Power —15 6.98 
Power -20 6.24 
Power — 30 5.45 
Power -50 4.84 
Power —75 4.45 
Portfolio A 3.64 
Portfolio B 5.28 
Portfolio C 7.19 
Portfolio D 7.91 
Portfolio E 8.59 
Portfolio F 9.27 
Portfolio G 10.16 
Portfolio H 

Portfolio I 

Portfolio J 


Without Small Stocks 
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With Small Stocks 


Standard 
Deviation* 


17.84 
6.88 
7.14 
3.40 

35.87 

28.00 

26.70 

24.96 

23.59 

20.67 

18.09 

16.31 

13.39 

10.87 
8.57 
6.48 
5.19 
3.97 
3.27 
3.12 
5.62 
5.81 
8.89 

10.31 

11.85 

13.50 

16.12 


* Standard deviation is for the variables In(f + rj). 


Geometric Standard 
Mean Deviation* 
10.73 17.84 
3.31 6.88 
3.90 7.14 
16.17 28.22 
3.51 3.40 
16.32 52.70 
17.98 36.62 
19.09 31.49 
19.05 29.63 
19.13 28.44 
16.6] 25.62 
14.64 21.60 
13.53 18.95 
11.47 14.88 
9.95 11.87 
8.58 9.28 
7.51 6.97 
6.63 5.61 
5.71 4.23 
4.99 3.41 
4.63 3.16 
3.64 5.62 
5.28 5.81 
7.19 8.89 
7.91 10.31 
8.59 11.85 
9.27 13.50 
10.16 16.12 
11.54 17.51 
12.79 19.39 
13.90 21.60 
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strategy, for example, had an uninterrupted presence in small stocks 
from the beginning of 1942 through the first quarter of 1952, from the 
fourth quarter of 1963 through the second quarter of 1974, and again 
from the third quarter of 1980 on. 

Table 4 gives a comparison of the quarter-by-quarter portfolio com- 
positions and returns for powers 0 and — 15 with and without small 
stocks in the leverage case. For each power, the first five columns 
show the proportions invested in common stocks, government bonds, 
corporate bonds, small stocks, and the risk-free asset. The sixth col- 
umn reports the fraction in borrowing and the last column the port- 
folio's return for the quarter. This is then repeated with small stocks 
excluded. 

Turning first to the logarithmic investor, we note that he stayed away 
from small stocks in the first quarter of 1934, from the beginning of 
1953 through mid-1960, during the second quarter of 1961, and from the 
beginning of 1974 through mid-1977 with the exception of three quar- 
ters. During these periods, portfolio holdings and returns were thus 
unaffected by the opportunity to invest in small stocks. In the 1930s, 
small stocks tended to replace holdings in governments and, to a lesser 
extent, common stocks. In the 1940s and early 1950s, and again in the 
1960s and early 1970s, it was principally common stocks that were 
being crowded out by small stocks for the logarithmic investor. In the 
late 1970s through the third quarter of 1982, it was primarily the risk- 
free asset that gave way for small stocks, followed by common stocks 
one more time. 

The logarithmic investor's use of leverage was remarkably similar 
with and without the small stock category. Its presence did not always 
increase leverage; while they markedly did so in 1982 and 1983, small 
stocks sharply reduced the use of leverage in 1946 and 1947. 

Turning now to the rather conservative power — 15 strategy, we first 
Observe that this investor was a heavy and repeated user of the risk- 
free asset beginning with the third quarter of 1962. Prior to the second 
quarter of 1951, however, the risk-free asset was practically ignored. In 
fact, from the beginning of 1940 through the third quarter of 1946, the 
power — 15 strategy was a consistent user of leverage, with holdings of 
government bonds on margin. 

Small stocks first entered the power — 15 portfolio in the last quarter 
of 1939, growing gradually in importance to a 19% allocation in the first 
quarter of 1950, then declining and disappearing as of mid-1951, 47 
quarters later. As table 4 shows, these holdings were primarily at the 
expense of common stocks. After a nearly total hiatus during the rest of 
the 1950s, small stocks reappeared for 55 consecutive quarters begin- 
ning with the second quarter of 1960, reaching a maximum allocation of 
30% in the second quarter of 1962, in a streak that lasted through 1973. 
These positions were also essentially at the expense of common 
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stocks, except in the second half of 1973, when they replaced the risk- 
free asset. When the power — 15 investor returned to small stocks at 
the beginning of 1978, they first replaced principally investments in the 
risk-free asset and, beginning in the last quarter of 1982, holdings in 
common stocks. 

Many investigators have noted that small stocks have, over long 
periods, produced what is usually referred to as excess risk-adjusted 
returns (e.g., Banz 1981; and Reinganum 1981). This suggests that the 
empirical (past) distribution used in our model should, at a minimum, 
have found small stocks a consistently attractive outlet. While this was 
obviously the case most of the time, small stocks were, as noted, 
totally ignored in the middle and late 1950s and for intervals in the 
middle 1970s even by the more risk-tolerant strategies. 


Other Results 


We also examined the case in which portfolios were revised only once 
a year rather than quarterly, with 8- and 10-year estimating periods. As 
in the previous studies, the differences in returns and portfolio compo- 
sitions were fairly small. The use of 24-quarter and 40-quarter estimat- 
ing periods also led to only minor differences in the results. 


VI. Tests 


The returns under quarterly reinvestment with leverage in the presence 
of small stocks over the 1934—47 subperiod are shown as round dots in 
figure 1, while the returns over the full 1934—83 period are similarly 
depicted in figure 2. The same figures also plot the returns for the 
higher powers in the absence of small stocks (see diamonds) as well as 
for the fixed-weight portfolios in table 1 (see triangles). The geometric 
means of the annual returns are measured on the vertical axis and the 
standard deviations of In(1 + r,) on the horizontal. 

The attained returns reflect clearly the benefits from diversification 
among the five (or four) asset categories (represented in the figures by 
the square points RL [risk-free asset], GB [government bonds], CB 
[corporate bonds], CS [common stocks], and SS [small stocks]). In 
other words, the portfolios selected by the model have enabled inves- 
tors to travel in a distinctly "'northwesterly" direction, confirming the 
results of the previous studies. 

Recall that terminal wealth wo in terms of beginning wealth w, is 
given by 


wo = Wl + ral no)... o ry) 


= w, exp [> In(1 + rol. 


t=1 
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TABLE 4 Portfolio Composition and Realized Returns for Powers 0 
and —15 with and without Small Stocks, 1934-83 
(Quarterly Revision, with Leverage, 32-Quarter Estimating Period) 


Power 0 
With Small Stocks Without Small Stocks 
Inv. Fractions Inv. Fractions 

Period CS GB CB SS RL B n CS GB CB RL B r, 

1934:1 .29 4.74 134 ... ... -5.37 33.21 .29 474 134 ... —5.37 33.21 
1934:2 -32 5.23 1.16 .04 ... —5.75 15.46 .38 5.29 1.13 ... -—5.80 15.52 
1934:3 .04 5.78 1.10 .14 ... -6.07 -17.50 .25 5.98 1.01 ... -623 -17.65 
1934:4 ... 4.39 1.53 13 ... —5.05 18.89 .14 4.61 1.46 ... —5.21 18.56 
1935:1 ... 426 1.50 II ... -48 14.94 .03 4.48 1.54 ... -5.05 17.56 
1935:2 ... 4.82 1.42 .05 ... -5.29 795 ... 492 1.45 ... -§.37 7.62 
1935::3 ... 628 103 .03 ... -634 -5.75 ... 634 105 ... -6.39 -6.27 
1935:4 ... 5.06 1.31 08 ... -S.44 10.30 ... 5.23 1.36 ... -5.59 7.91 
1936:41 ... 2.73 1.85 .18 ... -3.76 15.90 .10 3.02 1.87 ... —3.98 10.90 
1936:2 ... 3.34 1.59 .24 ... —4.18 —.71 .M 3.9 1.64 ... -—4.53 4.03 
1936:(3 ... 3.09 175 .17 ... -402 7.84 .08 3.42 1.77 ... -4.28 5.98 
1936:4 ... 4.90 125 .16 ... -5.31 15.04 .04 5.29 1.30 ... -35.62 12.61 
1937:1 ... 5.27 1009 .17 ... -5.522 -18.69 ... 5.69 1.23 ... —5.92 -—24.08 
1937::2 ... 5.84 .80 .25 ... -5.88 -475 ... 632 105 ... -637 3.19 
1937:33 ... 498 LIS .18 ... -5.31 —2.24 ... $.33 1.33 ... —5.66 1.95 
1937:4 ... 5.60 103 .14 ... -5.77 6.40 ... $.88 1.18 ... -6.06 12.40 
19381. ... 403 L50 .18 ... -4.1 -5.63 ... 4.54 1.56 ... —5.10 dM 
1938:2. ... 4.47 1.53 .04 ... -5.05 15.33 ... 4.59 1.54 ... -544 13.11 
19383 ... 435 1.35 .24 4... -4.93 2.90 ... 5.07 1.44 ... -—5.8 2.75 
1938:4 ... 4.13 1.35 .28 ... -477 12.1). ... 4.98 1.43 ... -5.2 7.97 
1939:1 ... 446 122 .39 ... -477 -189 IAS 5.15 122 ... -5.52 9.92 
1939:2 ... 623 86 .19 ... -628 14.67 ... 680 .92 ... ~6.71 15.88 
1939:3 ... 632 .78 .24 ... -6.344 ~30.65 .04 7.02 80 ... —686 -48.79 
1939:4  ... 5.59 .47 .69 ... -5.75 32.70 .79 567 .33 ... -5.80 37.90 
1940: ... 728 ... .68 ... -—6.96 20.44 .67 732 ... es. =~ 6,99 9.62 
1940:2 ... 7.04 ... .74 ... -6.78 -2820 75 701 ... e. —6.76 -21.87 
1940:3 ... 684 ... .79 ... -6.63 15.40 1.03 5.89... e. — 5.92 16.87 
1940:4 ... 721 .03 .67 ... -691 20.70 .75 625 421 ... -6.22 16.92 
94:1 ... 684 ... .79 ... -6.60  -889 .88 648 ... ... -636 -1280 
194::2 ... 674 ... S82 ... -—6.55 12.38 1.00 6.00 ... e. 6.00 11.50 
1943 ... 7.88 ... 530... —7.41 6.58 .48 671 .39 ... -—6.58 1.65 
1941:4 ... 748 ... 664 ... -7.09 ~22.65 .75 7010 ... ... -6.6 -1781 
1942:1 ... 798 ... 50 1... -7.49 14.96 .45 821 ... es ~ 7.66 7.46 
1942:2 ... 811 ... 47 ...  —7.58 -.,10 41 813 .06 ... -7.61 2.65 
1942: ... 786 ... .84 ... -7.399 12.18 60 7.660 ... ... ~7.20 5.74 
194X4. 0... 7.56 ... 61 .. 717 542 .61 7.56 ... es 7747 6.86 
1943:1 ... 7.43 ... 64 ... ~7,07 41.32 .75 699 ... ee —6.75 14.04 
1943:2 ... 628 ... 93 ... -621 23.69 1.12 $53 ... es 25.65 12.80 
1943::3 ... 616 ... 96 ... -6]2 -7.91 1.00 598 ... es. —5.99 -2.25 
1943:4 ... 661 ... 85 ... -—6.46 -,82 .81 677 ... es -6.58 —3.29 
1944:1..... 693 ... .77 ... -669 1542 .69 725 ... .... 6.93 4.74 
1944:2 ... 704 ... T4 ... —6.78 11.72 68 727 ... e. -6.95 6.68 
1944:3 ... 6.45 ... 89 ... -—64 1.72 .78 688 ... e. —6.66 1.60 
1944:4 ... 666 ... 840... —649 12/71 N 712 ... e. — 6.84 5.99 
1945:1 ... 6.68 ... S83 ... -6.51 1637 .67 639 427 ... ~6.32 14.14 
1945:2 ... 7.08 ... .58 ... -6.66 40.05 .39 807 ... ... -745 32.01 
1945:3 ... 5.9 ... 56... —5.36 4 27 799 ... e. -226 -1.76 
1945:4 ... 476 ... .70 ... -—4.46 33.27 .38 7.16 ... ree — 6.54 30.68 
1946:1 ... 283 ... 96 ... —2.79 11.22 47 644 ... es. 75.91 3.77 
1946:2 ... 1314 ... .87 ... -LIS 5.38 .37 6.26 ... es 65.64 —6.59 
1946:3 ... 1.49 ... 85 ... -1.344 -26.30 .39 614 ... es —5.53 — 19.88 


1946:4  ... JS ... .93 ... 4-67 181 .50 $500 ... ... -450 7.38 
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TABLE 4 (Continued) 
Power — 15 
With Small Stocks Without Small Stocks 
Inv. Fractions Inv. Fractions 
CS GB CB SS RL B r, CS GB CB RL B r, 
-15 .83 .02 5.83 45 .83 .Q 5.83 
1S 85 3.52 AS  .85 3.52 
14 .86 — .04 14 86 —.04 
.08 .92 3.36 08 .92 3.36 
.05 95 3.95 .05  .95 3.95 
1.00 2.68 1.00 2.68 
1.00 .69 1.00 .69 
1.00 1.95 1.00 1.95 
1.00 2.20 1.00 2.20 
1.00 1.49 1.00 1.49 
1.00 1.46 1.00 1.46 
1.00 1.44 1.00 1.44 
... M00 —1.36 .. 100 — 1.36 
.05 .95 1.58 .05  .95 1.58 
1.00 47 1.00 47 
1.00 2.02 1.00 2.02 
1.00 -.39 1.00 —.39 
eA 1.00 2.45 s 1.00 2.45 
.01 .99 1.55 Ol  .99 1.55 
.03 97 2.38 .03  .97 2.38 
.. 1.00 1.08 ... 1.00 1.08 
ll .89 1.61 A .89 1.61 
AS B85 6. 0. -3.11 AWS 85 ... -3.11 
14 81.02 03 e 4.07 03 .95 02 ee 3.98 
2.64  .04 — 1.68 2.92 2.42 -1.42 2.17 
2.70 ..05 -1.75 ~184 ... 2.46 — 1.46 — .60 
2.79 ..06 - 1.85 2.83  .06 2.69 —1.74 2.83 
2.79  .06 —1.85 1.74 .06 2.72 — 1.78 1.59 
2.79 ..06 -1.85 -1.17 .07 2.67 -1.74 — 1.44 
2.78 .06 — 1.85 4.44  .08 2.63 -1.71 4.26 
2.79  .06 —1.85 4.0  .08 2.68 -1.76 3.44 
2.78 .07 -1.85 -3.92  .09 2.67 -].75 —3.46 
2.72 .06 -1.77 1.31 .05 2.43 —1.47 .45 
2.69 .06 —1.74 Ji 0 2.35 -1.39 .92 
2.58  .06 — 1.64 238  .04 2.27 —1.31 1.39 
230 .07 -1.77 1.40 ..06 2.34 — 1.40 1.49 
2.61 .07 ~ 1.68 5.53.07 2.25 — 1.33 2.43 
2.54 .08 ~ 1.62 4.52  .09 2.16 -—1.25 3.28 
2.51  .08 — 1.58 -21 .09 2.13 -1.22 21 
2.43 .07 -1.50 -26 07 2.09 — 1.16 —-.36 
2.26 07 -1.32 2.80 05 1.92 —.98 1.69 
2.21  .06 ~1.27 1.67 .05 1.85 —.90 VAL 
2.43 .07 -1.21 1.25  .06 1.78 -.83 1.10 
2.00 .07 —1.14 4.80  .05 1.71 —-.76 3.59 
2.41 .07 -1.17 2.75 %4 1.75 —.80 2.24 
e. 2.67 .06 -1.73 1.88 05 2.29 —1.33 73 
.2] 2.24  .08 —-1.53 30.07 2.20 —1.27 56 
24 222 0 — 1.56 6.73 10 ... 220 ~ 1.30 4.56 
J5. 425. 43 —1.13 3.82 .14 26 1.68 — 1.09 3.69 
33 2.4 /— .16 — 1.80 a 21 .05 225 ~tS5l —.29 
.2} 2.35  .16 -1.71 -8.51 20 ... 228 ~ 1.48 —7.35 
59 29  .12 441 .n 20 .78 —.09 1.53 
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TABLE 4 (Continued) 


Power 0 


With Small Stocks Without Small Stocks 


Inv. Fractions Inv. Fractions 


Period CS GB CB SS RL RB RP CS GB CB RL RB RP 


19471 ... 147 ... .885 ... = 1.32 —.03 .47 530 ... ... -47] -.89 
19452... ... ... 433 ... -.33 -14.08 .96 2.78 ... ... =2,74 30 
1947:3 ... ... ... L33 ... -.33 11.44 1.12. E19... s... -1,37 1.04 
1947:4 ... .97 ... 120 ... -118 -.60 ... 10.000 ... ... -9.00 -45.20 
1948]. .... ... ... 133 ... -.33 -0.61 1.33 ... e 5 7433 — .46 
19482. ... ... ... 133 ... -.33 20.06 1.33... es ss = BB 16.55 
19483. ... ... ... 133 ... -.33 —14.46 1.33 ... ee ss. —.33. - 8.54 
19484. ... ... ... 133 ... -.33  -634 1.33 ... es ss BS —.01 


1949) ... ... ... L33 ... -.33 3.81 133  ... ee ss. 7433 59 
949:2 ... ... ... 200 ... -1.00 - 20.04 200 ... es 5. -100 -9.04 
1949:3 ... ... ... 200 ... 1.00 28.94 2.00 ... es. -100 22.72 
1949:4  ... ... ... 200 ... -1.00 23.58 2.00 .... ess. -1.00 19.98 


1950:1  ... ... ... 200 ... -1000 13.00 2.00... es 6. = £00 8.80 
19502. ... ... ... 200 ... -100 -372200 ... es s. -].00 7.66 
19503 ... ... ... 200 ... -1.00 34.00 2.00 ... es 7 1,00 23.20 
1950:4 ... ... ... 200 ... -100 24.44 2.00 |... es 5. -1.00 15.14 


95:1. ... ... ... 200 ... -100 6.77 2.00... es e. 1.00 11.95 
1951r2.... ... .. 133 ... -.33 -7.01 133 ... es sss 7.33 -.77 
195:3. ... ... ss. 8, -.33 16.22 133  ... ee s 7.33 16.24 
195:4  ... ... ... 133... -.33  -375 133 ... ees es 7.33 5.27 


1 ... s. E33 -.33 0 1.33 ees —.33 4.94 
1952:2...19 ... ... LI ... -.33 -2.14 133 ... ee 0s 33 5.24 
1952:3 1.25 ... ... 009 ... -.33 -1.03 133 ... ee ss 7.33 = 1.03 
1952:4 1.28 ... ... 05 ... -.33 12.76 1.33 ... ee s 7.33 12.99 


19531 133 ... ... ... ee —-.33  -514 133 ... ee ss 0.33 - 5.44 
1953:2 2.00 ... ... ... ... -100 -691200 ... es. -100  -691 
1953:3 2.00 ... ... ... ... -V400 -524 200 ... es e. 100 -524 
1953:4 200 ... ... ... ... = 1.00 15.18 2.00... e. -1400 15.18 


1954: 2.00 ... ... ... ... - M00 18.93 2.00 .... es 5. 7 4.00 18.93 
1954:2 2.00 ... ... ... ... = 1.00 18.80 2.00... e 0 -L100 18.80 
1954:3 2.00 ... ... ... ... -100 22.48 2.00 .... e. 0. -4.00 22.48 
1954:4 2.00 ... ... ... ... -400 25.00 2.00 |... es s 14.00 25.00 
1955:1. 1.62... s. use ss -.67 3.77 1.67... es s 7467 3.77 
1955:2 1.67 ... ... ... -.67 21.53 1.67 ... es e. 67 21.53 
1955:3 1.43... ee s.s ss -.43 10.00 1.43... es s AB 10.00 
1955:4 1.43 ... ... 5. 4 — .43 7.45 143. ... ees 443 7.15 


1956:1 L43 ... ... s ss -.43 10.43 1.43... es s E 10.43 
1956:2 1.43 ... ... wee eee —.43  -3.566 143 ... es s.  —443 = 3.56 
1956:3 1.43 ... ... ... ... -.43  -432 1.43... e. 5e 43  -432 
1956:4 1.43 ... ... ... ee ~ 43 4.94 1.43... ee s 7443 4.94 


1957: 1.43 ... ... 5... 2 -.43 -7.06 143 ... e. s 7.43  -7.06 
1957:2 10.43 ... ... ss ss — .43 11.49 1.43. ... es se 7.43 11.49 
1957:3 1.43 ... ... ss ss -.43 -1431 443 ... - e. 0-43 -1431 
1957:4 1.43 ... s sss ss -.43 -7.34 143 .... e. s. 43  -731 


1958:1 1.43 ... ss sss ss -.43 8.54 1.43 e s. AB 8.54 
1958:2 22.00 ... ... ss. ss. 71.00 15.87 2.00 .... es 5 -100 15.87 
19583 2.00 ... ... ... ... 1.00 22.48 2.00 .... e s -1.00 22.18 
1958:4 1.43 ... sss ss -.43 15.59 1.43... ee e. 0-44 15.59 
1959:1 1 

1959:2 |. TL - e eee . 

W959:3 VAP ... 5 6 6 =.  -234 MM... ess cM — 2.34 
1959:4 | 

1960:1 1 

1960:2 I 
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TABLE 4 (Continued) 


Power — 15 


With Small Stocks Without Small Stocks 


Inv. Fractions Inv. Fractions 


CS GB CB SS RL B r, CS GB CB RL B n 


74 AS 1200... ee 40 12 38 SE e .55 
34 52 C ee —1.24 45 45 70 ... - 60 
.08 7 043 o ee -.7  d6 ... 84 ee -1.44 


. ee ee 1.51 A3. .30 58 ... ee 1.30 

87. B .. ee 1.58 13 11 76 ... ee 1.48 
1.79. 25. 40 35 ... e — 1.49 

125 21 29 50 ... ee 1.66 

l 


AB ..221 ... Y ee 18 
. ee ee -.31l 22. 19 .59 ... ee 30 
85 I5 ee 3.53 20 24 555 ... ee 3.59 
1.9. 21 .37 42 ... ee 2.29 
l 


03. 17 6t 9 ... ee 8. 341 52 Ts. ee 1.43 
320 /—.. 70 H ee ee 20. .37 04 .59 ... ee 1.58 
A9 0. 40. .M ee ee 4.61 36 40 053 ... ee 4.70 
325/45 68 07 ee 336 37 OS .58 ... ee 3.31 


48 / ... Spb o. ee -27 37 ... 63 / ... e 67 
38  .01 .03 0l .57 ee -.05 40 .03 .02 .56 e - 0l 
x .62 ee 4.93 38 ... ... .62 - 4.93 
40 .. ss ss .60 ee 1.91 A0 7. sss .60 ee 1.91 


4207.20 4 ss. .58 ee 1.88.42 ... ... .58 ee 1.88 
2.2 s sss .58 ee 1.97. 42 ... ... .58 e 1.97 
LS 59 ee 05. 5 ... 59 e .05 
d 0. ss sls .59 ee 433 41 ... ... 59 ee 4.33 
dl 2. us sse .59 es -118 .4l ... ... 59 e -1.18 
Ee 0.5. s .62 ee -208 38 ... ... -62 ee -.78 
BS ek 0s. ss. .65 ee -38 35 ... ... .65 ee -.38 
I .66 ee 2.96 34 ... ... .66 ee 2.96 


x ee .66 ee 361 .34 ... ... .66 ee 3.61 
35 0... 9  ... 57 ee 3.57 .35 ... .09 57 ee 3.57 
35... 25 .39 ee 4.48  .35 ... .25 .39 ee 4.48 
92 08 a’ a‘ ee 11.98 .92 ... 08 ... ee 11.98 


92 08 /... ee 2.40 .92 ... 08 ... ee 2.40 
93... 2. .07 ee 12.350 93 ... ... 07 ee 12.35 
i) us usn .06 ee 6.87 94 ... ... .06 ee 6.87 
92... 2. a .08 ee 499 .92 ... ... .08 ee 4.99 


92.2.0 0.5.2.4 .08 wee 709 92 ... ... .08 ee 7.09 
D 4... .06 ee -1.97 .94 ... ... .06 ee -1.97 
90... 2. s .10 ee -2.31 Aea .. 10 e -2.31 
100 ... ... s. ss ee 3.86 100 ... ... ... ee 3.86 
1.00... 0... sss ss ee -4.53 100 ... ... ... ee —4,53 
90 0... ee 10 ee 7302 90 ... ... 10 ee 7.72 
98. ... ... y ... ee -9.8 $00 ... ... ... ee -9.60 
65/2. us. .35 ee -2.75 65 ... ... 35 ee -2.75 


IJ .39 ee 413. .6b] ... ... .39 ee 4.13 
69 ... 326 /... .05 ee 629  .69 ... .26  .05 ee 6.29 
J0 7... 30 ... .. ee 647 70 ... 30 ... ee 6.47 
61 7.2. 0... 39 ee 718  .61 ... ... .39 ee 7.18 
60 0. 2. 4 39 ee 1.02. 6b... ... .39 ee 1.02 
59... s. ss 4) ee 4.02 59 ... ... E ee 4.02 
600.2. 5.0. .39 ee -.88 66 ... ... 39 ee —.88 
53.00 0.9.0. 47 e 3.74  .53 ... ... 47 ee 3.74 


327.50 6. .48 M -299 .52 ... ... .48 DE — 2.99 
43 7... ../— 04 52 vee 220 47 ... ... 53 e 2.20 
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TABLE 4 (Continued) 


Power 0 
With Small Stocks Without Small Stocks 
Inv. Fractions Inv. Fractions 


Period CS GB CB SS RL B n CS GB CB RL B n 


1960(13 .79 ... ... .32 ... -.]1 -5.66 11l ... es ss G -5.93 
1960:4 90 ... ... S ... ~ 43 9.92 1.43... e s 43 13.11 
196t:1 113 ... ... 30 ... -.43 20.71 1.43... es s 443 17.71 
1961:2 143 ... ... ... ... -.43 —.48 143 ... e s. 43 —.48 
1961:3 88 ... ... .55 ... -.43 1.92 1.43 ... es s —.43 5.09 
1961:4  .89 ... ... .54 ... -.43 11.86 1.43 ... e e 3 10.95 


1962:1 ... ... ... L43 ... -.43 491 L43 ... es e c3  -3.59 
1962:2 ... ... ... 143 ... —.43 -3428 143 .... ee e. 7.43 = 30.05 
1962:3 .... ... ... 143 ... -.43 4.34 1.43 ... TELE 4.67 
1962:4 es s. 200 ... -100 13.26 2.00... es 5. -1.00 25.18 


1963:1 164 ... ... 36 ... =1.00 12.98 200 ... e. s c1.00 11.30 
1963:2 1.76 ... ... 224 ... 1,00 896 200 ... ex 5e -100 8.64 
1963:3 [34 ... ... 466 ... — L00 6.32 2.00... ex 5e 1.00 6.82 
1963:4 ... ... ... 200 ... -1.00 20 2.00... es 5. -1.00 9.84 


196: ... ... ... L4 ... -.4 12.01 1.43 ... e s 83 8.07 
1964:2 ... ... ... 1443 ... -.43 5.38 143  ... e e. AB 5.41 
1964:3 ... ... ... 143 ... -.43 10.62 1.43 ... e se c3 4.81 
1964:4 .,... ... ... 143 ... —.43 .87 143. ... es s 443 1.67 


1965:1 ... s. ss L4 s. -.43 16.55 1.43 ... ess —43 2.75 
1965:2 ... ... ... L43 ... —.44 -7.92 143 ... e. 6  -43 2,94 
1965:33 6... ... s. 143 ... -.43 20.17 1.43... ex s 443 10.40 
1965:4 ... ... ... 10143... -.43 22.89 1.43 ... e e 43 4.60 


1966:0  ... ... wee 143 ... — 43 11.86 1.43... es e. 7-430 4,57 
1966:2 ... ... ... 143 ... —.43 -10.6 143 ... e e 43 6.77 
1966:3 6... ... ... 143 ...  -.433 -18.2 0143  ... es s  —-43 13.41 
1966:4 ... ... ... M3 — 43 5.61 L17 ... ex s TIT 6.68 


1967:1] ... ... s. E43 e. —.43 43.99 1.00... OC 13.21 
19652. ... ... ... M43 ... -.43 16.72 1.43 ... ese —3 1.13 
1967:3  ... ... ... 143 ... -.43 22.06 1.43 ... es s 443 10.02 
1967:4 ... ... u.s 0143. ... —.43 9.98 1.43 ... es e $ 3 12 


1968:1 ... ... ... L43 ... 43 -1031 143 ... es 5e 743  -892 
1968:2 ... ... ... 143 ... -.43 36.96 1.43 ... e s 43 15.26 
1968:3 ... ... ... M25 ... 25 7.14 125  ... es s 75 4.39 
1968:4  ... ... ... 125 ... -.25 10.34 125 .... ex DS 1.99 
1969:1 .... ... ... 125 ... —.25 -10.37 125  ... es 5e -.25  -238 
1969:2 ... ... ... 125 ... -.35 -8.76 100 ... TORUM ee —3.00 
1969:3 ... ... ... 1425 ... —.25 -8.92 1.00 ... eres -3.2 
1969:4 ... ... ... 125 ... -35 -89 J72 ... ee 28. ... 27 


71 ... ... ... 425 ... -25 -74A2 ... ‘a e. 000 .. 1.98 
1970:2 2... ... ... 125 ... ~.25 -41.44 .50 ... ee 50... —8.20 
1970:3 ... ... ... 105 ...  -.05 29.38 75 ... ee 325. ... 13.01 
1970:4 ... s. s. 4137... 7 66 100 ... RP 10.43 


7b oo... ss sss. M45 n —-.45 37.53 1.00 ... eren e 9.69 
197:2. ... ... s. 54 v.n. -.54 -11.34 1.54  ... ee 54 —.6l 
197:3. ... ... s... E44 v... —.44 -—421 001 ... es s OL —.61 
197:4. ... ss sss 435 s 7.35 1.27 1.00... eren ee 4.66 
1972:1 ... ... e... 47 ... —.47 18.4 L19  ... e. e.  -—.9 6.54 
1972:2 0 ... s. LSE v... -.5$1 -634 117 ... e 4e dT 53 
1972:33 ... 0 wae sss 3T —.37 -8.46 1.00 ... o ee 3.91 
1972:4 ... s. sss LR -.2 1.87 1.00 .... MORE ee 7.56 


1973:1 ... ... ... 109 ... .-.0 -1527 100 ... een ee — 4.89 
1973:2 ... ... ... 5 215 ...- -AL86 Rs. ee 38  ... — 2.98 
19733 ... ... ... .32 .68 ... 721 ...  .. e. L00 2.00 
19734 |... ... ... 48 SD, -820 ... ... e. 400 ... 1.79 
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TABLE 4 


CS 


.40 
-36 
37 
53 
47 
.47 


-41 
31 
10 
10 
.20 
22 
12 
.03 
.08 
.04 
.05 
.06 
.03 
.03 


03 


01 


With Small Stocks 


.07 
A3 
ae 
.16 
.09 
.04 


.03 
aT 
23 


.30 
23 
42 
.46 


(Continued) 


Inv. Fractions 


SS 
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Power -15 
Without Small Stocks 
Inv. Fractions 
RL [^ CS GB CB RL ri 
.25 -190 .52 e AS 33 -2.03 
.06 4.86 51 .30 19 5.58 
.01 9.29 52 .34 Ad 6.95 
e S8  .42 .02 
2.44  .60  .40 2.88 
5.05 62  .38 ee 4.75 
1.27 .63  .36 .01 -.07 
e -13.30 .60 39 ... E — 12.18 
70 1.49  .24 .07  .70 1.54 
.64 2.94 24 .13 64 3.76 
.66 2.21 23 JM 66 2.06 
.60 1.74 .24 .16 .60 1.7] 
69 1.43 .22 .10  .69 1.48 
.76 96  .20 04  .76 1.80 
.80 2.49  .20 ee .80 1.90 
.78 1.52 20 .03 78 1.54 
.68 2.48 22 JAM 67 1.54 
-53 1.06 .25 25 50 1.18 
46 324 25 .32 44 1.32 
53 -.$7 26 22 .52 13 
36 3.58 .24 39 437 2.20 
.26 373 M .4d4 25 .67 
21 3.35 29 71 .02 
22 -106 .27 33 -.29 
25 -245  .22 78 -1.02 
.82 1.90  .13 .87 1.96 
.83 6.43  .14 .86 2.84 
.79 3.5  .19 81 1.02 
79 4.19 17 83 2.18 
77 2.60  .19 .81 1.04 
.78 —.50 16 .84 .16 
78 6.74  .16 .84 2.89 
77 2.44 7 .83 1.79 
5 3.5 21 79 1.46 
76 -0]7 a7 .83 1.03 
79 —.13 12 .88 98 
82 .23  .08 .92 1.28 
.83 33  .05 95 1.66 
.87 1.08. ... 1.00 1.98 
.87 -2.8  .03 97 1.00 
90 4.45  .05 .95 2.35 
.88 1.43 11 .89 2.50 
.87 4.28 AS 85 2.44 
.86 —.13  .18 .82 .81 
.89 .94 — 42 .88 1.13 
.89 113. .13 .87 1.56 
88 2.34  .16 .84 1.62 
.88 37 (1S .85 89 
89 27 42 .88 1.36 
91 124 10 .90 1\82 
92 .05  .10 .90 E 
.95 72 .04 .96 1.28 
.98 2.33 1.00 2.00 
97 1.16 1.00 1.79 
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TABLE 4 (Continued) 


Power 0 


With Small Stocks Without Small Stocks 


Inv. Fractions Inv. Fractions 


Period CS GB CB SS RL RB RP CS GB CB RL RB RP 


1974:0 ... ... ... ... M00 ... 1.94 1.00 1.94 
1974:2 ... ... ... ... «00 ... 206 ... ... e. M00 .. 2.06 
1974:3 ... ... ... ... M00... 194 .... ... e. LOO .. 1.94 
197444. ... ... ... ... K00 ... 1.81 1.00 1.81 
1975:10  ... s. sss s. M00. n, 1.62 1.00 1.62 
19752. ... ... ... .02 .98 ... 72 ... 04, e. E00 ... 1.42 
1975:3 ... ... ... .08 92 ... 68 03 ee -9 1.17 
1975:4 ... s. ss ss. M00. a 132. .... ... e. 00 ... 1.52 


19761 ... ... .62 ... .38 ... 3.07 ... ... 62 38 ... 3.07 
1976:2 .23 ... (35 23 .19 ... 27 49 35 l6 ... 1.52 
1976:3 |.27 ... ... 5. 33 ... 1.8 27. ... e 33 0. 1.48 
1976:4 |.14 ... 64 ... 22 ... $.5 .14 ... 64 R ... 5.55 


1971... s. 00 s. ss TP -231] ... ... 100 ... ... -2.31 
197752 ... ... DOL ... ... -.01 2.4 ... ... LOL ...- -.0l 2.94 
19753. ... ... 90 .0 ... ee 99 ... ... 0400 ... ... 1.09 
19754. ... ... 85 45 ... e SE . 0400 ... ee -.82 


1978:01. ... ... 6 3 ... e 375 ... ... 100 ... ... .03 
19782. ... ... .36 .64 ... ee 890 ... ... 100 .. .. — 1.08 
19783. ... ... ... 1328 ...  -.28 20.47 .36 ... 64 0... us 5.10 
19784 0... ... ... 1400 ... ee -1737 M ... e 86 1.05 


1979:1 ... a 3 2) ... 1692 ... ... e. 0100 —... 2.43 
1979:2 ... ... ... 0M 29... 749 .... e. E00 ... 2.37 
197:3 ... ... ... L00 ... ee 5.04 ... ... e. 0100 ... 2.15 
1979:4  ... ... ... LOO ... ee 1.70 ... e. 0100 ... 2.52 


1981. ... ... ... .8| .I9 ... -1024 ... ... e. 600 .. 2.96 
1980:2 ... ... ... .05 .95 ... 4.588 ... 4. e. L00 3.68 
1980:3..... ... ... 100°... ee 25.40 ...  ... e E00 ... 1.97 
1980:4 ... ... ... M00 ... ee 749 ... ... e LOO 2.85 


1981:1. ... ... ... .85 d$... 13.13... ... e. E00 ... 3.73 
1981:2 ... ... ... 108 ... — .08 8.72 ... ... e. 0100 ... 3.24 
1983 ... ... ... L06 ... .-.06 -1898 ... ... e. 40400 ... 3.73 
19814 ... .. ... M00 ... ee 11.22... ... e. 00 3.77 
1982:14. ... ... wee 124 ... —.74 -1226 ... ... o. 0100 .. 2.86 
1982:2 .... ... wee LIH... -.MH —.86 ... ... e 0500 ... 3.46 
1982::3 2... ... ... E81... -.5l 13.78 1... 00 e. 0100 ... 3.33 
1982:4 ... ... ... 2.00 ... -100 44.04 100 ... .... «e 5 18.14 
1983:1 ... ... ... 200 ... —1,00 36.76 1.00 |... TD 10.05 
19832. ... ... ... 200 ... -1.00 39.57 125  ... es s A) 13.25 
1983:3./.... ... ... 2090 ... -1.00 -590 100 ... "RC —.14 
1983:4 ... ... ... 200 ... -100 -7.32 178 ... es s.s 748 ASD 


NorE.—CS = common stocks, GB = government bonds, CB = corporate bonds, SS = small 
stocks, RL = risk-free asset, B = the fraction in borrowing, and r, = return for the quarter. 
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TABLE 4 (Continued) 
Power - 1$ 
With Small Stocks Without Small Stocks 
Inv. Fractions Inv. Fractions 
CS GB CB SS RL RB RP CS GB CB RL RB RP 
1.00 1.94 1.00 1.94 
1.00 2.06 1.00 2.06 
1.00 1.94 1.00 1.94 
1.00 1.81 1.00 1.81 
1.00 1,62 1.00 1.62 
1.00 1.44 1.00 1.42 
1.00 1.49 1.00 1.52 
eel 1.00 1.52 ... 1,00 1.52 
ee. 04 .96 1.35 ... .04 .96 1.35 
.02 .02  .01 .95 1.17 .03 .02 95 1.24 
.02 ees .98 1,33 .02 ee. .98 1.33 
.01 .04 95 1.50 Ol ... .04 .95 1.50 
.07 .20 73 — 05 07 .20 73 — 05 
24 .76 1.56 24 .76 1.56 
22 .78 1.21 22 .78 1.21 
A8 /... .82 1.05 .18 .82 1.05 
3] 0l .78 1.29 23 77 1.18 
.06 .04 .91 193 ... 13 87 1.26 
.09 .91 315 02 .09  .89 1.99 
.08 .92 44  .01 99 1.93 
.04 .96 3.31 1.00 2.43 
.04 .96 2.66 1.00 2.37 
.07 93 2.38 1.00 2.15 
.06 .94 2.47 1.00 2.52 
.05 95 2.15 1.00 2.96 
.. 100 3.74 1,00 3.68 
.07 .93 3.64 1.00 1.97 
.07 .93 3.18 1.00 2.85 
.05 .95 4.31 1.00 3.73 
.09 .91 3.73 1.00 3.24 
„li .89 1.36 1.00 3.73 
.07 .93 4.30 1.00 3.77 
14 .86 1.70 1.00 2.86 
10 .90 3.08 1.00 3.46 
2 .88 4.9 ... 1.00 3.33 
21 .79 6.56 A7 .83 4.70 
.23 77 6.16 .18 .82 3.44 
23 37 6.45 .14 .86 3.44 
.22 .78 1.42 12 .88 1.98 
.24 76 1.16 .18 .82 1.90 
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Fic. 1.—Geometric means and standard deviations of annual returns with 
and without small stocks, 1934-47 (quarterly revision, with leverage, 32- 


quarter estimating period). 


Since the returns themselves are not additive but compound multiplica- 
tively, we employed the paired r-test for dependent observations to the 
quarterly (and additive) variables In(1 + rj) to test whether the pres- 


ence of small stocks improved returns significantly. '! 
the return series rl, .. . 
different strategies, we calculate the statistic 


t= 4 _ 
o(d) Vn. 


Il. This test was also employed by Fama and MacBeth (1974). 


, rl with the return series rj, . . 


Thus to compare 
. , Ż for two 
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Fic. 2.—Geometric means and standard deviations of annual returns with 
and without small stocks, 1934-83 (quarterly revision, with leverage, 32- 
quarter estimating period). 


where 


i- » In(1 +r!) — In + 72) 
t=1 


n 


and o(d) is the standard deviation of the differences between In(1 + rJ) 
and In(1 + r2). In each case, the null hypothesis is that 


ElIn(1 + 7b] = Ellan + 72)] 
and the alternative hypothesis is that 
Elni + rb] > Elin + 72)]. 
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TABLE 5 Paired t-Tests, 1934—47 Subperiod, with Leverage 
Comparison d o(d) t 
Power — | vs. common stocks .0196 .1267 ` 1.16 
Power 0 vs. portfolio I .0378 .1438 1.97* 
Power .5 with small stocks vs. power .5 

without small stocks .0360 .1813 1.48 
Power .25 with small stocks vs. power .25 

without small stocks .0306 1333 1.72* 
Power .25 with small stocks vs. power 0 

without small stocks .0273 1157 1.76* 
Power 0 with small stocks vs. power 0 

without small stocks .0256 .1068 1.79* 
Power — | with small stocks vs. power — 1 

without small stocks .0160 .0591 2.03** 
Power —2 with small stocks vs. power —2 

without small stocks .0103 .0389 1.99** 
Power —3 with small stocks vs. power —3 

without small stocks .0079 .0311 1.90* 
Power —5 with small stocks vs. power —5 

without small stocks .0052 0211 1.85* 
Power —7 with small stocks vs. power —7 

without small stocks .0038 .0156 1.81* 
Power — 10 with small stocks vs. power — 10 

without small stocks .0025 .0104 1.79* 
Power ~ 15 with small stocks vs. power — 15 

without small stocks .0020 .0073 2.06** 


* Significant at the 10% level. 
** Significant at the 5% level. 


The results for selected pairs with comparable standard deviations in 
the 1934-47 period (56 quarters) are given in table 5, while table 6 gives 
the results for the full 200-quarter 1934-83 period. 

Table 5 suggests that the presence of small stocks was especially 
significant in improving returns for the ‘‘middle’’ strategies (powers 
— 15-.25) from the early 1930s to the late 1940s. Statistically, the im- 
provements were less impressive for the full period (fig. 2 and table 6). 
The active strategies performed extremely well when compared to 
bonds and bills, attaining highly significant improvements, through 
diversification, in the geometric means without increasing variability. 
Figures | and 2 and tables 5 and 6 also reveal that the active strategies 
performed surprisingly well when compared to the. 10 fixed-weight 
portfolio strategies. 


VII. Concluding Remarks 


By way of summary, several conclusions emerge. First, the present 
study confirms that the simple probability assessment approach, which 
uses only the past to (naively) forecast the future, is apparently not 
without merit when combined with multiperiod investment theory. Not 
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TABLE 6 Paired t-Tests, 1934-83 Period, with Leverage 
Comparison d o(d) t 
Power —3 vs. common stocks .0062 .0758 1.16 
Power — 15 vs. corporate bonds .0085 .0452 2.67*** 
Power — 15 vs. government bonds .0100 ,0458 3.08*** 
Power — 50 vs. risk-free asset .0036 .0102 4.94*** 
Power — 10 vs. portfolio C .0032 .0419 1.09 
Power —20 vs. portfolio B .0032 .0311 1.45 
Power —20 vs. portfolio A .0071 0356 2.82*** 
Power 0 with small stocks vs. power 

.75 without small stocks .0116 .1073 1.52 


*** Significant at the 1% level. 


only did it produce respectable results, but the logarithmic policy also 
always came out at or near the top in the geometric mean dimension, 
suggesting an absence of clear biases in the estimating method; the 
exception occurs in the last subperiod, when the approach appears to 
have been somewhat too conservative. '? Second, small stocks entered 
even the most risk-averse portfolios most of the time. On the other 
hand, they were ignored by all the strategies some of the time (espe- 
cially in the mid- and late 1950s and mid-1970s). Third, small stocks, 
when chosen, tended to replace common stocks, except in the 1970s 
and early 1980s, when they were primarily held in lieu of the risk-free 
asset. This was true even when (additional) margin purchases would 
have been possible. Consequently, the presence of small stocks had 
relatively little effect on the use of leverage; in fact, the effect was not 
always in the positive direction. Thus the use of, and gains from, 
margin purchases occurred, as in our previous paper (Grauer and 
Hakansson 1982), primarily for the more risk-tolerant strategies from 
the mid-1930s to the mid-1960s. 

Fourth, small stocks had a notable positive effect on returns, not 
only for the risk-tolerant strategies but also for conservative investors. 
This effect was especially significant from the mid-1930s to the early 
1950s in the presence of margin purchases. Fifth, the strong gains from 
diversification among the major asset categories reported in the previ- 
ous papers are, if anything, strengthened when small stocks are in- 
cluded. 

Sixth, the performances of the active strategies, when measured 
against those of the fixed-weight strategies, were surprisingly strong. 
Could it be that the naive, simple-minded empirical distribution con- 
tains the kind of information money managers usually pay good money 
for? 


12. Under perfectly valid distributions, power y = 1/T, where T is the number of 
periods, has the highest asymptotic probability of attaining the largest geometric mean as 
T increases. 
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Finally, one cannot escape the conclusion that U.S. financial mar- 
kets offered a generous environment during the half-century we 
studied. As a point of reference, consider the following. The maximum 
employee-portion of the social security contribution for an individual 
in 1984 was $2,532.60. The purchasing power of this amount (in Janu- 
ary 1, 1984, dollars) at the beginning of 1934 was $328.78. Under a 
naive logarithmic policy in an environment with access to small stocks 
and leverage, this initial contribution alone would have grown to 
$2,078,925 as of the end of 1983. 

Of course, the reader should also be reminded of the limitations of 
the study. The model used is focused on sequential reinvestments only, 
without concern for intermediate consumption; even though its birth 
occurred in the mid-1970s, it was applied as far back as 1934. The latter 
statement also applies at least partially to the data base used. The joint 
probability estimates were based on periods ranging from the most 
recent 5-10 years only. All investors were assumed to be strict 
price takers. Transactions costs and taxes were ignored (as in the 
underlying returns series). Finally, maintenance margins were ignored 
whenever leverage was used. 
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Abstract 


The growth optimal investment strategy has been shown to be highly effective 
for structured decision problems such as blackjack, sports betting, and high fre- 
quency trading. For securities markets, these strategies are more difficult to apply 
due to a variety of practical issues: structural changes in market behavior due 
to varying risk premium and related factors, transaction costs, operational con- 
straints, and path dependent risk measures for many investors, including surplus 
risks for a defined-benefit pension plan. In addition, the standard three step 
approach for institutional money management does not allow for rapid changes 
in asset allocation — especially needed during highly turbulent periods. We 
modify the growth models to address downside protection, along with applying 
a portfolio of investment strategies — to improve diversification of the portfo- 
lio. Empirical results show the benefits of the concepts during normal and crash 
(2008) periods. 


1 Introduction 


The growth optimal model and its siblings have been applied successfully in a 
wide variety of decision problems. As an early example, Thorp (1969) imple- 
mented card counting approximations in the area of gambling situations such as 
blackjack, as well as extensions in the investment domain including option pricing 
models. Grauer and Hakkasson (1986) evaluated asset allocation models with the 
iso-elastic expected utility model (including log-utility); these authors applied this 
framework for many years in a series of related papers. More recently, Stutzer 
(2003, 2010), Ziemba (2005), and MacLean et al. (2004, 2009) extended the growth 
models to address downside protection. See Bell and Cover (1980), Kelly (1956), 
and Samuelson (1971, 1979) and the remainder of this book for details of growth 
optimal strategies. 

There are a number of practical issues to address when implementing growth 
models. For example, individual investors rarely are able to monitor their affairs 
on a short term basis and take proper corrective actions. In fact, individuals 
are often accused of rendering poor judgment during market turning points — 
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increasing risk assets at market tops, and vice versa. Individuals can be unaware 
of their asset allocation or surplus capital, except during occasional visits to a 
financial planner. 

In contrast, institutional investors often apply well established risk management 
tools via a three step process. First, a set of generic asset categories is defined, such 
as large-cap U.S. equities, small-cap value equities, long duration government bonds, 
real estate, and so on. Standard asset categories along with tracking benchmarks 
are provided by firms including, Russell, Standard and Poor's, MSCI, and Dow 
Jones. In most cases, these asset categories can be passively managed by means of 
index funds or exchange traded funds (ETFs); they serve as benchmarks for active 
portfolio managers. Second, on a semi-regular basis, the investor conducts an asset 
allocation or asset-liability management study to ascertain their asset proportions 
to best meet their goals, liabilities, and risk tolerances. For example, many uni- 
versity endowment or pension plan administrators carry out this task every 2 to 3 
years. As a third step, the investor selects active managers to beat their respective 
benchmarks, or to invest in a low-cost passive index (which can be simply defined 
such as the usual cap-weighted index or a variant such as fundamental weighted 
index). The three step process has evolved to provide an implementable process for 
large institutions; otherwise, there can be great difficulty achieving diversification 
benefits and simply managing a large pool of capital. 

The three step process has worked reasonably well over the past twenty plus 
years. It is durable and has certain benefits in terms of ease of implementation 
and adding judgment by an advisory firm or investment management organization. 
Unfortunately, several issues have arisen over the past few years that have slowly 
undermined the approach. First, the correlation among many of the traditional 
style segmentation has increased over time, and reached almost unity in 2008. See 
Figure 1 for the average correlation among six segments along two dimensions: 
market capitalization (large, medium, and small), and fundamental valuation (value, 
growth) over the past 15 years. This situation became critical in 2008 when the 
contagion increased even further. The great majority of stocks performed quite 
poorly in 2008 — and diversification strategies failed in many cases. 

Second, many institutional investors have turned to alternative private markets 
to improve their performance. This shift has been championed by David Swensen 
at Yale University endowment (with his book Pioneering Portfolio Management 
(2000) and superb performance up until early 2008). Private markets are much 
harder to track since the securities are rarely traded. Also, private markets possess 
substantial spreads between the return of the top managers (say top decile) and 
the average manager in these segments. Thus, it is difficult to apply a passive 
index. And importantly, the return patterns can be hard to model since there are 
few detailed data available for analysis and issues such as leverage are not always 
evident to outside researchers. Conducting an asset allocation study with a majority 
of alternative investments is a treacherous assignment. Importantly, a portfolio of 
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Figure 1 Rolling correlation of traditional equity style classification 


private market assets is quite difficult to modify as conditions warrant due to their 
illiquidity. 

Third, the emergence of ETFs and passive mutual funds in particular segments, 
along with more fundamental causes, has changed the relationships of securities to 
each other. For example, institutional investors will trade a basket of commodities 
(e.g., Goldman’s GSCI, or Deutsche Bank’s commodity index) in large volumes — 
again altering the diversification benefits. Thus, in late 2008, we saw very high 
correlation in returns for a very wide variety of securities — equities, corporate 
bonds of all types, commodities, mortgage-backed securities, and so on. Even many 
alternative investors saw substantial losses — 20, to 30 to 40% in 2008. There 
was a very rapid and substantial increase in risk premium — with commensurate 
increases in volatility and correlation. 

The correlation has increased to close to one for the classical segments: 
Value/growth and Large/mid/small (ST6-A = style segmentation with six cate- 
gories, ST9-A = style segmentation with nine categories). Industry segments (ICB 
= Industry Classification Benchmark) and an optimal classification scheme (OPT) 
give better diversification. The Traditional segmentations are close to a randomly 
selected classification (RND). 

Given our knowledge about these turbulent events, investor could have benefited 
by applying capital growth models. A critical issue involves ensuring a consistent 
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relationship between the investor's wealth and the anticipatory risks in the market. 
The investor must avoid taking risks (bets) above the log-optimal solution, in order 
to optimize the long-term consequence of their decisions. Critically, as the investor's 
wealth decreases, they must lower their risk threshold — by reducing committed 
capital. 'This rule is quite well known; however, it has been difficult or almost 
impossible for many investors to take the appropriate action (due to a number of 
factors including those mentioned above). 

In the next section, we propose an alternative to the usual three-step approach 
to address the abovementioned issues. In this regard, we link traditional growth 
models with strong downside protection. We emphasize capital and risk allocation 
across investment strategies — rather than individual assets (or asset categories). 


2 Dynamic Portfolio Tactics 


This section describes the fundamental tenets of dynamic portfolio tactics (DPT). 
Two of the primary concepts are: protect the investor's capital (or goal capital) 
as a first priority — prevent large losses at all costs; and construct and revise 
a portfolio of investment strategies — rather than a portfolio of assets — based 
on dynamic risk analysis. To achieve the first concept, we employ highly liquid 
investments such as futures markets and high-volume exchange traded funds. In 
both cases, the transaction costs are quite low for most investors (except the largest 
institutions such as CALPERs and the Canadian Pension System). For investors 
possessing illiquid assets, we advocate a variation of the DP'T approach for tactical 
asset allocation based on replication strategies (see Mulvey and Ling, 2009). In this 
paper, we focus on single-period, asset only investments; see Mulvey et al. (2003a, 
2008, 2009), Zenios and Ziemba (2006, 2007), and Ziemba and Mulvey (1998) and it 
references for applications of asset-liability and multi-stage models. The developed 
approach can be extended in the natural manner. 

To start, we define the traditional growth model as a sequence of nonlinear 
stochastic optimization models. First, we are interested in a relatively long sequence 
of decisions occurring at equal time steps (for simplicity), t = (1,2,..., T) where 
the horizon equals time T'. For our purposes in the empirical section, we will render 
decisions every five trading days (a balance between high frequency and mid-term 
models) in order to capture relative rapid changes in the investor's wealth. Roughly, 
the investor will makes fifty asset allocation decisions per year, or 500 per decade. 

Next, at time point t, we define uncertainties via a set of stochastic scenarios 
{S}, in which the returns for the next five days will depend upon a number of factors 
known at the time of the decision — including recent volatility, recent momentum 
returns, current risk premium, and so on. Importantly, we do not assume a sta- 
tionary process; instead the risk premium and volatility/correlation will encounter 
bursts of high turbulence (as we have seen displayed in 2008 and in several previous 
episodes). 
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Figure2 Rolling correlation of equity returns and government long bond returns 


We give a straightforward example of a changing relationship in market behav- 
ior (Figure 2). Here we display the rolling correlation between the return of large 
U.S. cap stocks (S&P 500) and the return on long-duration U.S. government bonds. 
Many portfolio models assume that this correlation remains constant. However, 
during sharp downturns, the correlation generally becomes negative — due to fun- 
damental factors — mostly due to a flight to quality during à recession. It is clear 
that the scenario generator must take these changing regimes into account (more 
on multiple regimes later). 

As compared with traditional asset allocation models, we will focus on a set 
of investment strategies {J}, rather than subsets of securities. Several example 
investment strategies will be described in the next section. We distinguish two types 
of investment strategies: core strategies taking capital {J1}, and overlay strategies 
{J2} that employ futures markets and swaps, taking risk capital. We separate 
these strategies in order to emphasize the linkage of overlay strategies with the core 
strategies, and when the core strategies cannot be easily traded, such as private 
equity or venture capital. We are interested in the optimal combination of core and 
overlay strategies that best suits the investor at a given time. 

'The traditional growth models employ the following stochastic optimization 
model [GO]. 


[GO] Max EU (w2) 
Subject to w = 5 Tj 2; 20, and rex 
je 
and W2,5 = 2T *(1+rjs) for se19] 
Ji 
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The decision variables, xj, are defined over set {J1}, in our case investment 
strategies rather than securities or groups of securities, except when indicated oth- 
erwise. We define the investor’s initial wealth equal to w 1, to reference the start 
of the planning period t — [1 to 2]. For this initial model, we greatly simplify 
the investor's situation by ignoring transaction costs, by assuming a static, single 
period planning period, by assuming no cashflow considerations, by focusing on 
the assets without reference to liabilities or goals, defining a long-only perspective, 
and so on. In a discrete-time model such as [GO], there is an implicit assumption 
that the model will be executed over a short enough horizon so that the investor's 
wealth is not greatly affected by intermediate changes in returns. Next, we assume 
the standard growth-optimal utility function U (w2) = Ln(w»s) within the family of 
iso-elastic functions U (w) = (1/y) x w7 for a single risk aversion parameter gamma. 
Returns are designated over a finite set of stochastic scenarios (S). All policy and 
legal constraints are contained in set {X}. 

'This model is a straightforward nonlinear program, which is concave for any risk 
adverse decision maker, and thus can be readily solved with a standard package 
(such as the solver in Excel). For the optimization, we employ versions of the iso- 
elastic utility function. However, the model can be readily adapted for other convex 
reward and risk measures. 

To extend the model, we allow for a borrowing variable x, (with upper limit us) 
and a set of overlay variables — set {J2}. The first equation above is modified as: 


wita= >> a; and 0< £p <u 
jeJ1l 


The revised second equation, ending wealth we is: 


W2 s = a *(L+rj5) —y* (re) + 5 Lj * Tj 

j jeJ2 
The overlay variables do not require borrowing (implicit) and do not take direct 
capital. Rather, the core assets {J1} serve as the margin requirements for the 
overlay strategies (Mulvey et al., 2006, 2007). We assure that the total level of 
overlay risk capital is small enough to prevent margin calls and to maintain the 
investor's overall risk profile (limits in constraints X). The cost of borrow is defined 
by fb ,s- This valuc is unccrtain duc to two causcs: since the cost of borrowing may 
depend upon interest rates during the planning period, and secondly due to the 
variable amount of leverage that may occur during the planning period as specified 
by the investment strategies. 

The growth strategy [GO] (and variants) can be applied in a straightforward 
manner given a reliable and robust model for generating/forecasting the return 
scenarios embedded in set (5). Unfortunately, due to the changing structure of 
security markets (e.g., DeBondt and Thaler, 1985, 1987) and the limited number 
of time junctures for most investors, there is à need for a carefully designated risk 
management system. A typical theoretical approach would be to increase the risk 
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p(d,) 
d 
Dy D, 
Figure 3 A penalty function to protect downside losses 
aversion parameter y within the iso-elastic family — lowering risks throughout 


the entire planning period. Instead, we design an alternative approach in which the 
investor's risk aversion changes as a function of her wealth path. Accordingly, the 
utility function is modified to be conditional on the wealth path {w+} and especially 
drawdown values [d,) at time t. The following function reflects this concept: 


[GO — DRAW] U(we) = (1/7) * w3 — p(d2) * wr 


here the function p(-) penalizes investment decisions with outcomes falling below 
specified drawdown values. The drawdown variable is a function of the distance 
between the high water mark of the wealth path and the current wealth. 


high _ 


high 
d, —w,?'—w, where w, max {wr} 


7=1,...,t- 


For the empirical tests, we have found that two thresholds values, D; = 12%, 
and D» = 16%, in conjunction with a piecewise approximation have proven to be 
a robust approach. Thus, when drawdown exceeds these thresholds, additional 
risk aversion is obtained by assessing penalties over the iso-elastic function. This 
approach is similar in spirit to Ziemba (2005) and MacLean et al. (2004, 2009). Our 
approach allows for changing risk premium as a function of the investor’s wealth 
and market conditions. 

The downside penalty function is particularly important during periods of high 
turbulence for assets possessing returns with a positive function of risk premium, 
such as equities or corporate bonds. To this point, we refer to the fall 2008 months 
in which drawdown became quite excessive, and risk protection was important at 
that time. 

A critical aspect of any investment model involves the forecasting system for 
projecting the returns for the assets and investment strategies over the upcoming 
period. We advocate a projection system based on multiple regimes and an embed- 
ded Markov switching matrix (Guidolin and Timmermann, 2007; Hamilton, 1989; 
Kim, 1994; and Mulvey and Bilgili, 2009). In other words, at each time period, 
we estimate the current regime (for example, normal, bubble, crash) by means of 
current conditions such as volatility and correlation over the past 5 to 10 days. 
Several algorithms can be applied to this problem (Ding and He, 2004; Mulvey and 
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Figure 4 A time series of a triple regime analysis for U.S. equities (1/1/1983 to 1/1/2009) 


Table 1 Transition probability matrix for three regime analysis 


Regime 1 | Regime2 | Regime 3 | Number of Periods 
Regime 1 0.65 0.22 0.33 101 
Regime 2 0.21 0.35 0.44 81 
Regime 3 0.12 0.19 0.63 158 


Bilgili, 2009). Once a regime is estimated, we can calculate a subset of internally 
consistent scenarios for each regime. As an example, suppose that we estimate 3 
regimes for equities. Then, we form three sets of scenarios (51), (52), and (53], 
with probabilities, 71, 7», and 73, respectively. The number of scenarios in each set 
depicts the relative probabilities. And the probability of moving between scenarios 
is indicated by the Markov transition matrix. Mulvey and Bilgili (2009) provide 
further details. Figure 4 and Table 1 show the path and the transition probability 
matrix of a three regime analysis over the period 1983 to 2009 for the U.S. equity 
market, respectively. Note that the down cycles correspond mostly to the third 
regime, and the persistence of these regimes over time. Regimes one and three are 
the most stable (60-65% probability of remaining in that regime for the subsequent 
period), whereas regime two is the least stable (3596 probability). 


3 Investment Strategies and Empirical Results 


'This section takes up the proposed growth model with downside protection. The 
first step is to define several investment strategies that possess good long term 
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Table 2 Performance of equity strategies for emerging markets (Benchmark EEM, rebalanced = 
equal weighted, and 130/20 strategies) 


Period Length: 4.42 years EEM Rebalanced 130/20 Period Length: 14.42 years EEM Rebalanced 130/20 
‘Annual Geometric Return 10.80% 12.80% 13.90% Annual Geometric Return 6.61% 11.70% 13.61% 
Annual Standard Deviation 36.90% 33.30% 36.1096 Annual Standard Deviation 25.8696 25.1196 27.71% 
Best Month Return 30.60% 29.60% 32.30% Best Month Retum 30.60% 29.60% 32.30% 
Sharpe Ratio 0.18 0.27 0.27 Sharpe Ratio 0.10 0.31 0.35 

Worst Month Return -26.1096 -25.70% -28.20% Worst Month Return -26.10% -25.70% -28.20% 
Max Drawdown 63.80% 63.30% 67.10% Max Drawdown 63.80% 63.30% 67.10% 
Retum/MaxDrawdown 16.92% 20.28% 20.66% Return/MaxDrawdown 10.40% 18.50% 20.30% 
Correlation with EEM Strategy 1.00 0.98 0.96 Correlation with EEM Strategy 1.00 0.95 0.93 


characteristics. We illustrate the modeling principles by reference to several strate- 
gies that have proven effective over long time periods. 

The core investment strategy employs three equity markets: (a) countries in 
the emerging market segment (symbol = EEM), (b) sectors of the S&P 500 index 
(symbol = SPY), and (c) developed countries outside the U.S (symbol = EFA). 
In each case, there are cap-weighted indices that can be readily purchased (EEM, 
SPY, and ETA, respectively) via high volume exchange traded funds (ETFs). These 
instruments are among the most highly traded in the world (over $500 million per 
day in mid-2009). To develop an investment strategy, we form a portfolio of ten 
E'TFs that depict each of the three underlying markets. For example, the emerging 
markets can be largely covered by purchasing single country ETF (China, Brazil, 
India, Korea, etc.). The long portfolio will be set up by equally weighting the 10 
countries (1096 each), with rebalancing back to the equal weight target on a short- 
term basis (5 days in our tests). We can show that in many cases, a fixed-mix 
rebalanced portfolio will outperform a traditional buy-and-hold cap-weighted index 
(next subsection). Table 2 shows a comparison of the equal-weighted index versus 
the cap-weighted index from the emerging markets, over two periods 1/1/1995 to 
7/1/2009 and 1/1/2005 to 7/1/2009. Similar results occur for EAFE and S&P 
500 sectors. In fact, equal weighting is the optimal solution to a dynamic portfolio 
model if the returns of the individual components are equal and independent. 

This section illustrates the advantages of applying a multi-period policy rule. 
We begin with the well-known fixed-mix investment rule due to its simplicity and 
profitability. This policy rule serves as a benchmark both for other types of rules 
and for the recommendations of a stochastic programming model. 


3.1 Fixed mix policy rules 


First, we describe the performance advantages of the fixed-mix rule over a static, 
buy-and-hold perspective. This rule generates greater return than the static model 
by means of rebalancing. The topic of re-balancing gains (also called excess growth 
or volatility pumping) as derived from the fixed-mix decision rule is well understood 
for a theoretical perspective. The fundamental solutions were developed by Merton 
(1969) and Samuelson (1969) for long-term investors. Further work was done by 
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Fernholz and Shay (1982). Luenberger (1998) presents a clear discussion. We 
illustrate how rebalancing the portfolio to a fixed-mix creates excess growth (Mulvey 
and Kim, 2008). Suppose that a stock price process P; is lognormal so it can be 
represented by the equation 


d P, = a P,dt + e P,dz, (1) 


where a is the rate of return of P, and c? is its variance, z, is Brownian motion 
with mean 0 and variance t. 

The risk-free asset follows the same price process with rate of return equal to r 
and standard deviation equal to 0. We represent the price process of risk-free asset 
by By: 


dB, = r Bidt (2) 
When we integrate the equation (1), the resulting stock price process is 
B, = Pye /2)t+o21 (3) 


It is well documented that the growth rate y = «—6?/2 is the most relevant measure 
for long-run performance. For simplicity, we assume equality of growth rates across 
all assets. This assumption is not required for generating excess growth, but it 
makes the illustration easier to understand. 

Next, let's assume that the market consists of n stocks, each with stock price 
processes P,,,..., Py following the lognormal process. A fixed-mix portfolio has 
a wealth process W, that can be represented by the equation 


dWA/Wi = mdPi t/Pit + + 1ndPat/Pn,t (4) 


where 7,...,75 are the fixed weights given to each stock (proportion of capital 
allocated to each stock). In this case, the weights sum up to one 


Son =1 (5) 
i=l 


The fixed-mix strategy in continuous time always applies the same weights to stocks 
over time. The instantaneous rate of return of the fixed-mix portfolio at anytime 
is the weighted average of the instantaneous rates of returns of the stocks in the 
portfolio. 

In contrast, a buy-and-hold portfolio is one where there is no rebalancing and 
therefore the number of shares for each stock remains constant over time. This 
portfolio can be represented by the wealth process Wi: 


aw, = midP\ 4 Tee mad P, (6) 


where m,...,7m4 depicts the number of shares for each stock. 

Again for simplicity, let's assume that there is one stock and a risk-free instru- 
ment in the market. This case is sufficient to demonstrate the concept of excess 
growth in a fixed-mix portfolio as originally presented in Fernholz and Shay (1982). 
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Assume that we invest 7 portion of our wealth in the stock and the rest (1 — 7) in 
the risk-free asset. Then the wealth process W, with these constant weights over 
time can be expressed as 


dW,/W, = ndP,/ Pi + (1 — n)dB:/ Bi (7) 


where P, is the stock price process and B, is the risk-free asset. 
When we substitute the dynamic equations for P, and B+, we get 


dW,/W, = (r - n(o — r))dt + nodz (8) 


For simplicity, assume the growth rate of all assets in the ideal market should 
be the same over long-time periods so that the growth rate of the stock and the 
risk-free asset are equal. Hence 


a-o/2=r (9) 
From equation (8), we can see that the rate of return of the portfolio, o, is 
Quy ST + nla = r) (10) 
By using (9), this rate of return is equal to 
aw =r -- na? /2 (11) 
The variance of the resulting portfolio is 
=o" (12) 
Hence, the growth rate of the fixed-mix portfolio becomes 
^u = Ay —02/2 =r + (n — n)a? [2 (13) 
This quantity is greater than r for 0 < 7 « 1. As it is greater than r, which 
is the growth rate of individual assets, the portfolio growth rate has an excess 
component, which is (7—7?)o?/2. Excess growth is due to rebalancing the portfolio 
constantly to the target fixed-mix. The strategy moves capital out of stock when 
it performs well and moves capital into stock when it performs poorly. By moving 
capital between the two assets in the portfolio, a higher growth rate than each 
individual asset is achievable. It can be shown that the buy-and-hold investor with 
equal returning assets lacks the excess growth component. Therefore, buy-and-hold 
portfolios under-perform fixed-mix portfolios in various cases. We can easily see 
that the excess growth component is larger when o takes a higher value. 

Next, we design a simple long/short strategy based on the observation that an 
equal weighted portfolio will generally outperform a capitalized weighted portfolio 
such as the S&P 500 or the MSCI emerging market index EEM. A good compromise 
is 130/20 — wherein the equal weighted index is set at 13096 long, and the cap- 
weighted index is 20% short. The portfolio is slightly levered at 110% with a 
commensurate cost of borrowing the extra 1096. The statistics for this variant is 


listed in Table 2, along with the associated statistics for the benchmarks: capital 
weighted index, equal weighted/5 trading days, and the 130/20 index. 
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Table3 Performance of equity strategies with drawdown constraints (January 1999 to June 2009) 


(Benchmarks = EEM, EAFE, and S&P500; equal weighted; and Dynamic Portfolio Tactics DPT 

Emerging Markets EEM Equal Weight DPT Emerging Markets EEM ‘Equal Weight DPT 
Annual Geometric Return 10.80% 12.80% 31.00% Annual Geometric Return 6.61% 11.70% 20.38% 
Annual Standard Deviation 36.90% 33.30% 21.60% Annual Standard Deviation 25.86% 25.11% 19.02% 
Best Month Return 30.60% 29.60% 7.B0% Best Month Return 30.60% 29.60% 8.80% 
Sharpe Ratio 0.18 0.27 1.25 Sharpe Ratio 0.10 0.31 0.86 
Worst Month Return 26.10% -25.70% 8.70% Worst Month Return 26.10% -25.70% 10.60% 
Max Drawdown 63.80% 63.30% 30.80% Max Drawdown 63.80% 63.30% 33.90% 
Return/MaxDrawdown 16.92% 20.28% 100.80% Return/MaxDrawdown 10.40% 18.50% 60.10% 
Correlation with EEM Strategy 1.00 0.98 0.74 Correlation with EEM Strategy 1.00 0.95 0.74 
Europe, Australia, Far East EAFE Ew DPT Europe, Australia, Far East EAFE Ew DPT 
Annual Geometric Return 4.10% 7.00% 11.00% Annual Geometric Return 4.37% 9.04% 16.55% 
Annual Standard Deviation 23.50% 25.3096 12.10% Annual Standard Deviation 18.9596 20.0796 13.9496 
Best Month Return 17.30% 20.60% 6.50% Best Month Return 17.30% 20.60% 9.20% 
Sharpe Ratio 0.01 0.12 0.58 Sharpe Ratio 0.02 0.25 0.90 
Worst Month Return 12.80% 13.50% 8.50% Worst Month Return 12.8096 13.50% 8.50% 
Max Drawdown 57.70% 59.8096 24.1096 Max Drawdown 57.70% 59.8096 24.1096 
Return/MaxDrawdown 7.11% 11.71% 45.64% Return/MaxDrawdown 7.57% 15.12% 68.67% 
Correlation with EAFE Strategy 1.00 0.99 0.59 Correlation with EAFE Strategy 1.00 0.95 0.75 
S&P 500 S&P 500 Ew DPT S&P 500 S&P 500 EW DPT 
Annual Geometric Return 4.7096 -1.70% 9.40% Annual Geometric Return 6.29% 9.29% 16.50% 
Annual Standard Deviation 20.60% 21.80% 13.70% Annual Standard Deviation 18.75% 18.09% 14.43% 
Best Month Return 15.70% 16.30% 5.50% Best Month Return 15.70% 16.30% 9.60% 
Sharpe Ratio 0.42 0.26 0.40 Sharpe Ratio 0.12 0.29 0.87 
Worst Month Return 15.00% 15.80% 5.9096 Worst Month Return 15.00% -15.80% 6.00% 
Max Drawdown 55.20% 54.80% 24.0% Max Drawdown 55.20% 54.80% 24.00% 
Return/MaxDrawdown 8.51% 3.10% 39.17% Return/MaxDrawdown 11.39% 16.95% 68.75% 
Correlation with S&P 500 Strategy 1.00 0.99 0.69 Correlation with S&P 500 Strategy 0.97 0.76 


As mentioned in the previous section, capital growth theory requires a consistent 
relationship between the investor’s capital at time £ and their risks at the same time. 
Thus, as capital is lost and drawdown increases, the investor must lower risks — 
otherwise the size of the bets will be too large in most cases (roughly speaking). The 
DPT model takes this into account. As mentioned, we designate two breakpoints 
for approximating the nonlinear objective function, Dı and D» to reduce risks by 
reducing capital in à complementary fashion. 

'The results of applying the approximation are shown in Table 3. Here we see 
that the overall performance has been increased, especially with regard to the worst 
case losses. Especially, note the worst and best case time periods. In the case of the 
cap-weighted index, we see that the upside periods are much larger than the upside 
periods for DPT. Conversely, the worst downside periods are much better with 
DPT than with the cap-weighted indices. As mentioned, we are willing to give up a 
substantial portion of the best periods, if we are able to protect the investor's capital 
during drawdown periods. This condition is an important criterion for achieving 
the capital growth path. 

To improve upon the core equity results (core assets), we add two strategies. 
'These strategies are designated as overlays, i.e., they do not require any dedicated 
capital for their purchase (only as marginable assets). See Mulvey et al. (2004) 
for a well known example of a successful overlay strategy based on trend fol- 
lowing. Table 4 shows the historical performance of the two developed overlay 
strategies. 
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Table4 Performance of two overlay strategies (DEO and PUI) 


at 100 

Period: July 1999-June 2009 100% DEO 100% PUI 

Annual Geometric Return 8.69% 7.32% 
Annual Standard Deviation 16.39% 7.73% 
Max Drawdown 14.29% 8.13% 
Return/Volatility 0.53 0.95 
Return/MaxDrawdown 0.61 0.90 
Worst Month Return -8.58% -6.37% 
Best Month Return 35.59% 5.56% 


The first overlay strategy is called Duration Enhancing Overlay — DEO. See 
Mulvey et al. (2006) and Zhang (2006). This strategy increases the duration of 
a portfolio by entering into a swap agreement (or a long future purchase of long 
government bonds). DEO, in general, takes a long position on the long term bond 
and a short position on T-bill. This strategy has a particular benefit for a defined 
benefit pension plan to help them in matching duration of assets and liabilities 
(Mulvey et al., 2006). 

Dynamic DEO is a strategy that uses certain signals to improve performance 
vis a vis returns and reduced correlation with equity markets. The signals that 
the strategy uses are from (1) interest rates, (2) equity market returns, and (3) the 
strategy’s performance over recent time periods. The basic idea is to use the trend 
property of interest rates and the volatility level of the equity market returns. When 
equity market volatility increases, the strategy increases its capital commitment. In 
addition to these, past performance of the strategy itself is used; it is assumed that 
in the DEO strategy a bad day precedes a cluster of bad days. 

The second overlay strategy applies momentum and futures curve factors to 
a collection of futures markets — extending upon the traditional trend following 
rules developed by Dr. Frank Vannerson at Commodity Corp and later at Mt. Lucas 
Management, and many others (e.g., Bodie and Rosansky, 1980; Chan et al., 1996; 
Erb and Harvey, 2006; Mulvey et al., 2004; and Rouwenhorst, 1998). We call 
our strategy the Princeton University Index (PUI). Herein, the goal is to improve 
performance by capitalizing on patterns occurring in commodity markets. 

The index is based on two ideas: the expected return of commodities futures 
depends upon the degree of hedging by producers or consumers — as evidenced by 
the shape of the futures curve for each commodity (Brennan et al., 1997). Thus, a 
curve with sharp contango signifies that consumers are the primary hedgers, whereas 
backwardation signifies that the producers are the primary hedgers. We assume, and 
with much evidence to support the supposition, that hedgers will pay an expected 
costs for their bebavior (and conversely that speculators will possess a positive 
expected cost) over long time periods and on average. Accordingly, we take positions 
that correspond to positive expected values. A rebalanced portfolio of commodity 
futures is constructed to greatly reduce risks for the PUI strategy. 
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Table 5 Performance of portfolio of strategies — July 1, 1999 to June 30, 2009 


GS DPT 
100% Thond (30 commodity Combined 
Period: July 1999-June 2009 100% SP500 100% EAFE year) AIG commodity Index index Strategies 
Annual Geometric Retum -2.26% 1.88% 6.9526 4.02% 10.67% 25.2596 
Annual Standard Deviation 15.92% 18.08% 13.5996 17.46% 25.05% 14.9496 
Max Drawdown 50.80% 54.24% 23.77% 54.50% 610326 12.99% 
Retum/Volatility -0.14 0.10 0.51 023 043 169 
Retum/MaxDrawdown -0.04 0.08 029 007 0.17 194 
Worst Month Retum -1652% -19.05% -14.56% -2134% -27.77% -7.68% 
Best Month Retum 9.93% 13.19% 16.05% 12.99% 2110% 22.79% 


Figure 5 Time series performance for equity and bond benchmarks and combined DPT strategy 
(July 1, 1999 to June 30, 2009) 


As a second factor, we measure the momentum of return for each of the com- 
modities over the past six to twelve months. If momentum is positive, we increase 
our exposure to that commodity in the index. The overall commodity portfolio is 
rebalanced twice per month. 

We next combine the three strategies: core equities as discussed in the previous 
section, the commodity overlay via PUI, and the interest rate overlay via DEO in 
an integrated portfolio (called DPT). Note that we are employing the core equity 
strategy as traditional investable assets, along with the two overlay strategies DEO 
and PUI. The percentage of traditional assets assigned for the overlays is a small 
portion of capital — around 10 to 15%. This small percentage prevents margin 
calls and remains within the target level of risks. Also, from the standpoint of 
risk allocation, we designate the proper proportion of the overlays, in this case — 
100% for PUI, and 50% for DEO. These proportions are targets for a fixed-mix 
rebalancing rule on a monthly basis. 

Turning to Table 5 and Figure 5, we see that the overall performance is enhanced 
by combining the three strategies in a rebalanced portfolio. In particular, the Sharpe 
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ratio is improved since the strategies provide their best returns at different time 
conjunctures. In a similar fashion and more importantly, the return per drawdown 
ratios are superior for the integrated system. This performance is robust with 
respect to the modeling parameters since the individual strategies are designed to 
provide wide diversification. Also, the rebalancing gains can be significant and can 
be readily implemented due to the liquidity of ETFs and futures markets. 


4 Conclusions 


This paper extends capital growth models with downside protection. To achieve 
the twin goals of protecting the downside while generating good returns during 
upswings, we penalize drawdown values in a sequential fashion, first via a stochastic 
nonlinear program, and second by implementing relatively simple policy rules. We 
show that a long-term investor can perform well by avoiding large losses, as is well 
known. As a corollary, we demonstrate that a portion of the upside gains can be 
forsaken in our quest to improve the downside protection. This latter observation 
is somewhat controversial since some have argued that an investor must stay fully 
invested in order to improve on long term performance. Our objective function 
has an asymmetric relationship between the upside and the downside. But the 
changing market structure necessitates a more conservative strategy than is the 
case with traditional capital growth models. We focus on the drawdown values to 
implement this primary concept. 

A second feature involves combining a set of investment strategies via dynamic 
portfolio tactics. Since sharp market corrections take place within the context of ris- 
ing volatility and correlation, we cannot depend upon diversification benefits among 
asset securities. T'here is simply too much contagion during periods of high turbu- 
lence. Instead we design a portfolio of strategies in which the performance of one 
strategy is relatively uncorrelated with the performance of neighboring strategies. 
Several examples of these phenomena are presented. 

The primary damage of the 2008 market crash was done in a quick amount of 
time — three months September toNovember (along with earlier losses in February). 
It is evident, therefore, that investors must remain vigilant and nimble if they 
wish to avoid large losses. The traditional asset allocation procedure possessing 
relatively long periods between revisions of policy have not served the investor well; 
as we mentioned, many investor experienced large losses (25-30% or more). A 
more dynamic framework is needed, with much shorter time intervals, in order to 
achieve the goals of capital growth. The investor must be careful to not only grow 
their capital during positive markets, but they must protect their capital during 
turbulent periods if they are to achieve the goals of capital growth theory. 
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Abstract 


This paper presents an intertemporal portfolio selection model for pension funds or life in- 
surance funds that maximizes the intertemporal expected utility of the surplus of assets net of 
liabilities. Following Merton (Econometrica 41 (1973) 867), it is assumed that both the asset 
and the liability return follow Itó processes as functions of a state variable. The optimum occurs 
for investors holding four funds: the market portfolio, the hedge portfolio for the state variable, 
the hedge portfolio for the liabilities, and the riskless asset. In contrast to Merton's result in 
the assets only case, the liability hedge is independent of preferences and only depends on the 
funding ratio. With HARA utility the investments in the state variable hedge portfolio are also 
preference independent. Finally, with log utility the market portfolio investment depends only 
on the current funding ratio. 
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1. Introduction 


Intertemporal asset allocation models date to Merton (1969, 1971, 1973) (see also the 
summary in Merton, 1990) and Samuelson (1969). Merton presents a continuous time 
intertemporal model whereas Samuelson discusses the discrete time case. Both models 
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formulate a lifetime portfolio selection problem. Merton's (1973) intertemporal capital 
asset pricing model derives equilibrium asset premia. In contrast to Sharpe's (1964) 
CAPM, Merton's CAPM is based on a three-fund theorem. Each rational investor 
holds the riskless asset, the market portfolio, and a hedge portfolio for a so-called 
state variable in order to maximize his lifetime expected utility. The state variable 1s 
a stochastic term, which affects the asset price processes. The hedge portfolio provides 
maximum correlation to the state variable, i.e. it provides the best possible hedge 
against the state variable variance. Continuous time models can be applied to surplus 
optimization problems, since surplus optimizers, such as pension funds or life insurers, 
distribute the surplus to those who are insured. Pension funds or life insurance funds are 
frequently broadly internationally diversified. Exchange rate movements are considered 
as state variables. 

Merton, (1993, reprinted in Ziemba and Mulvey (1998), see also Constantinides' 
(1993) comments on Merton's paper) addresses a problem similar to surplus optimiza- 
tion. He advocates the view that University endowment funds can be managed by using 
Merton's (1969) intertemporal portfolio selection model. His objective function is to 
maximize the lifetime expected utility of University activities with specific costs. The 
University activity portfolio includes education, training, research, storage of knowl- 
edge, etc. Since the activities of a University have to be optimized with respect to their 
costs, the activity costs are "liabilities" for universities. In contrast to our approach, 
Merton (1993) specifies the University’s non-endowment cash flows as Itô processes 
dependent on the activity costs. This is the classical setting of Merton's (1973) in- 
tertemporal CAPM, where the state variables are given by the activity costs. The result 
is that each utility maximizing University holds three funds, the market portfolio, the 
riskless portfolio and a hedge portfolio against fluctuations of the activity costs. The 
composition across those three funds depends on the risk preferences towards market 
and activity cost volatility. 

This paper provides a synthesis of the surplus management and the continuous time 
finance literature. Both the assets and liabilities of a pension fund are modeled as 
Stochastic processes dependent on stochastic state variables. Only the asset mix can 
be influenced by the fund manager's decision. The paper provides a solution to his 
decision problem. In comparison to Merton's (1993) setting, the advance made by the 
paper is that one (or several) state variable(s) are allowed to be distinct from the 
liabilities. Le. in contrast to Merton (1993), this paper explicitly examines the impact 
of long-term liabilities. Three important results developed are: first, if only one state 
variable is considered, Merton's three-fund theorem is extended to a four-fund theorem. 
The four distinct mutual funds are: the market portfolio, the state variable hedge port- 
folio, the liability hedge portfolio, and the riskless asset. Secondly and most important, 
the investment in the liability hedge portfolio depends only on the current funding 
ratio of a pension or life insurance fund and is independent of the utility function. 
This result differs from Merton's (1993) findings. The preference independence of the 
liability hedge portfolio has major implications for monitoring pension funds and life 
insurance companies. Thirdly, the state variable hedging policy is preference indepen- 
dent if hyperbolic absolute risk aversion (HARA) utility functions are assumed. With 
log utility, even the market portfolio investment is independent of the risk aversion 
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coefficient which is caused by the fact that log utility is equivalent to HARA utility 
with a risk tolerance coefficient of « = 0. 

In Section 2, the intertemporal surplus management model and a four-fund theorem 
are derived. In Section 3 we show that with HARA utility, the risk tolerance coefficients 
of the model are related to the funding ratio and to the currency betas of the portfo- 
lio. A k state variable case is derived in Section 4. Section 5 describes a case study 
for a surplus optimizer who is diversified across the stock, the bond markets, and cash 
equivalents of four countries (EMU countries are treated as a single country). Section 6 
concludes the paper. 


2. An intertemporal surplus management model—a four-fund theorem 


Having surplus over time is what life insurance companies and pension funds try 
to achieve. In both life insurance companies and pension funds, parts of the surplus 
are distributed to the clients usually once every year. Hence, optimizing the investment 
strategy of a life insurance or a pension fund is equivalent to maximizing the expected 
lifetime utility of the surplus. 

For £ > 0 we have the stochastic processes A(t), L(t), Y(1), representing assets, liabil- 
ities, and a state variable Y. An extension of the setting to k state variables is possible 
without difficulties (see e.g. Richard (1979) and Adler and Dumas (1983)). Adler and 
Dumas (1983) take purchasing power parities as state variables. The surplus S(/) and 
the funding ratio F(t) are defined by 


S(t) := AU) — LG), F(t) :9 A(O/L(2). (1) 


According to Merton (1973), the state variable Y follows a geometric Brownian motion 
(i.e. log-normally distributed) where uy and cy are constants representing the drift and 
volatility, and Zy(/) is a standard Wiener process. The state variable as well as the 
assets and the liabilities are assumed to follow the stochastic processes: 


dY(t) = YCO[uy dt + oy dZy()], 

dA(t) = A(t) [eas Y (0) dt + oat, Y (0) dZaQ)], 

dL(t) = Lt )puc(o Y ()) dt + L(t, Y G0) dZ), (2) 
where the drift and volatility parameters for the assets and liabilities are allowed to 
depend on both time and state variable, and where Za(t),Z.(1) are Wiener processes 
correlated with Zy(/) and also with each other. Hence, we have cross variations 

dZ4(t) dZy(t) = pay dt, 

dZ, (t) dZy(t) = pry dt, 


dZ4(1) dZL(t) = par dt, 
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where pay represents the instantaneous correlation between the Wiener processes Z4(/) 
and Zy(t), zy is the instantaneous correlation between Z,(f) and Zy(t) and paz 
refers to the instantaneous correlation between Z4(£) and Z,(£). The return processes 
Ry(t), Ra(£), Rr (t) for the state variable, assets and liabilities are defined by 


Ry(t) := dY(O/Y((0, 
Ra(t) :— dAO/A(LO, 
R(t) := dL(Oy/ LG). 


Following Sharpe and Tint (1990) and according to Eq. (1), the surplus return process 
is defined as 


dS) RIO 
Rs(t):— AD = Rat) FO 
- | - A dt + o4 dZ4(1) — FU dZ,(t). (3) 


Since dS(t) = Rs(t)A(L), using Eq. (3) we have 


E[dS(t)] = A(t) (s - A5) di, 


5 2 c? OAL 
dS(t)dS(t) = A(t) (a + FA 2:0) i 


dS(t) d¥(t) = 4()YQ) (om g 70) d 


dY(t)dY(t) = Y (Do? dt, (4) 


where 64, :— PALGAOL, Cay :— Pay Oady,OLy :— pLyOLOy are the covariances of assets 
with liabilities and of the state variable with assets and liabilities, respectively. 

The objective is to maximize the expected lifetime utility of surplus, which implies 
identifying an optimum surplus strategy.! For an analysis of a situation with convex 
penalties for underfunding, see Carifio and Ziemba (1998). The expected utility is pos- 
itively related to the surplus in each period. This 1s because positive surpluses improve 
the wealth position of the insurants of a pension or a life insurance fund, even if the 
yearly retirement benefits are over-covered by the surplus. Hence, in this interpreta- 
tion, the insurants are like shareholders of the fund. Then the following equation is the 
fund's optimization problem, where U is an additively separable, twice differentiable, 


! To avoid underfunding in particular periods we assume a steady state implying that the dollar value of 
employees (insurance policies) entering equals those leaving the pension fund (insurance company) dollar 
value. 
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concave utility function and 7 is the end of the fund’s existence: 


r 
J(S, Y, t) := max E, (/ U(S, od) 
w Jt 


tdt T 
= max E, v" U(S, Y, vx) dv + / U(S, oe) . (5) 
w í Ji+di 


E, denotes expectation with respect to the information set at time ¢ and the J -function 
is the maximum of the life insurance or pension fund's expected lifetime utility. The 
maximum in (5) is taken with respect to w, the vector of portfolio fractions of the 
risky assets. Let n be the number of portfolio assets and œw; (1 <i € n) be the portfolio 
fraction of asset i, then w' :— (@1,...,@,). Applying the Bellman principle (see e.g. 
Dixit and Pindyck (1994, Chapter 4)), according to Merton (1969) and Merton (1990, 
p. 102) and according to the mean value theorem for integrals yields 


JCS, Y, t) = U(S, Y, t) dt + max E,[J(S + dS, Y + dY,t + di]. (6) 


Let Js be the first partial derivative of J with respect to S, Jss the second partial 
derivative of J with respect to S, Jy and Jyy the first and second partial derivatives 
with respect to Y, and Jsy the derivative of J with respect to S and Y. Applying ltó's 
lemma and Eq. (4) yields (see the appendix for proof), 


0— U(LS, Y) 


J, + JsAQ) (u - #5) 


] 2 2 of OAL 
+ max +JyY(t)uy + 5 Jss4 ofat . (7) 


5 dw Y Ok pA QOY (D) (car - T ) 
(t) 

Let m, be the vector of expected asset returns of the risky assets of dimension n,e be 
the n-dimensional vector of ones, V be the n x n matrix of covariances between the 7 
risky assets, va; and v4y be the vectors of covariances between the n assets and the 
liabilities, respectively, with the state variable. It is assumed that a riskless asset with 
return r exists. Then rearranging (7) yields the following equation (see the appendix 
for proof and definitions of the matrix and the vectors): 


0=U(1,S,Y) 
J, + JsA(t) (won —re)+r— Hs) 
1, ; o? W'vanN |. 
+ max TJyY(t)uy + 5 7554 co(m wVw+ P —2 =e) . (8) 


Tz TJn roo +JsrA(DY (t) (v Vay — zi) 
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Differentiating (8) with respect to w yields? 


Js -1 Y(Q)Jsy 4 Volvar 
= — V^ (m — re) — Vay + 
Mods (MIT GOI rt FO 
Js Y(t)Jsy 
=: — “wy — b -W —— Ww, 9 
aJs TI agus YTO © 
where 
V-'(m, — re) Vo! vay Volva 
Wm = -ona ae WY San LU -Ro 
e'V-!(m, — re) e'V-Ivay e'V-!vy 


a:-eV-^l(m,—-re) b:-eV^vay, cime V vy. 


The vectors Wy. Wy, and w; are of dimension n with elements that sum to 1; a, b, 
and c are real constants. The optimum portfolio consists of four single portfolios: the 
market portfolio wą, the hedge portfolio for the state variable wy, which is Merton's 
(1973) state variable hedge portfolio, the hedge portfolio for the liabilities w;, and the 
riskless asset. The state variable hedge portfolio wy reveals the maximum correlation 
with the state variable Y (see the appendix). A perfect hedge for the state variable 
could be achieved if the universe of n risky assets contains forward contracts on the 
state variable. Then the state variable hedge portfolio would consist of a single asset, 
which is the forward contract. The third portfolio w; is interesting. For the liabilities 
there exist no hedging opportunities at the financial markets (i.e. a portfolio which 
hedges wage increases or inflation rates). Eq. (9) shows how a liability hedge can be 
constructed. This is related to the problem addressed by Ezra (1991) and Black (1989) 
and solves it intertemporally. In the four-fund theorem, life insurance and pension funds 
invest in the following four funds, if they maximize their expected lifetime utility (5) 
based on the stochastic differential equations (2) and (3): 


1. The market portfolio wa, with level —a(Js/A(t)Jss ). 

2. The state variable hedge portfolio wy with level —b(Y(t)Jsy/A(t)Jss). 

3. The riskless asset with level 1 + a(Js/A(1)Jss) + b(Y(t)Jsy/A(t)Jsg) — c/F(t). 
4. Finally, the liability hedge portfolio w, with level c/F(f). 


Thus, the holdings of the liability hedge portfolio are independent of preferences. The 
most interesting result in the portfolio selection equation (9) is that the liability hedge 
portfolio holdings depend only on the current funding ratio and not on the form of the 
utility function. In order to maximize its lifetime expected utility, each life insurance 
or pension fund should hedge the liabilities according to the financial endowment. 
The percentages of each of the three other funds differ according to the risk pref- 
erences of the investors. For example, —a(Js/A(t)Jss) is the percentage invested in 


` The derivative of (8) with respect to w is: 0 — Js(m4 —re)--A()Jss Vw — [A/F (Oss vaL + Y (QU sy Yay. 
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the market portfolio. Since J is a "derived" utility function, this ratio is a times 
the Arrow/Pratt relative risk tolerance with respect to changes in the surplus. That 
is, the higher the risk tolerance towards market risk, the higher the fraction of the 
market portfolio holdings. The percentage of the state variable hedge portfolio is 
—b(Y(t)Jsv/A(t)Jss). Merton (1973) showed that this ratio is b times the Arrow/Pratt 
relative risk tolerance with respect to changes in the state variable. The percentage of 
the liability hedge portfolio is c/F(£). Surprisingly, and in contrast to Merton’s results, 
this portfolio does not depend on preferences nor on a specific utility function, but 
only on the funding ratio of the pension fund. The lower the funding ratio, the higher 
the percentage of the liability hedge portfolio. 

This allows for a simple technique to monitor life insurance and pension funds, 
which extends Merton's (1993) approach. In most funding systems, funds are legally 
obliged to invest subject to a deterministic threshold return. Since payments of life 
insurance or pension funds depend on the growth and the volatility of wage rates, 
this is not appropriate, For instance, if the threshold return is 4% p.a. and the wages 
grow by more than this, the liabilities cannot be covered by the assets. Our model 
suggests instead that a portfolio manager of a pension fund should invest in a portfolio 
which smoothes the fluctuation of the surplus returns caused by wage volatility, i.e. 
in a liability hedge portfolio. Since the liability hedge portfolio depends only on the 
funding ratio, preferences of the insurants have not to be specified. 


3. Risk preference, funding ratio, and currency betas 
Assume that the utility function is from the HARA class. Merton (1971 and 1990, 


p. 140) shows that this is equivalent to assuming that the J-function (6) belongs to 
the HARA class as well. Let « < 1 be the risk tolerance coefficient. Then 


U(S, Y,t) C HARA & J(S, Y,t) C HARA, 


— S a 
HS, Y,D = JIS), = 5 St) (10) 


where x and y > 0 are real constants and p is the utility deflator. Observe that (10) 
implies linear absolute risk tolerance since —Js/Jss = S/(1 — «) + /x. The HARA 
class of utility functions as defined in (10) is commonly used; it implies the negative 
exponential utility functions J = —e~*5 when « approaches —oo, y = l, and p =Q. 
Moreover, the isoelastic power utility (see Ingersoll (1987, p. 39)) J =S*/a is obtained 
for 4 0, p —0, and x «(1 — a)*-'/*, Log utility is that member of isoelastic power 
utility when « approaches 0 (5 = p = 0), since by applying l'Hópital's rule to the 
equivalent utility function, 


$* —] 
i = lim ——— = lim (S* — In S. T 
lim JS, Y,1) lim " lim (S In) ^ In (11) 


760 M. Rudolf and W. T. Ziemba 


982 M. Rudolf. W.T. Ziembal Journal of Economic Dynamics & Control 28 (2004) 975-990 
Under the HARA utility function assumption, 


K KS um K? KS x2 
^ (en) , Js=- i (So +n) . 


dJs dJs d$ dd | d4 
=y as da dày 79 (12) 


Jsy 


dY' 


The percentage holdings of the market portfolio may be re-expressed as 


Js S n l l n 
A z] = l-7 . 13 
AJss (xi -3)* x) ES" ( z) ur (13) 


The market portfolio holdings thus depend on the funding ratio and on the risk aversion. 
The higher F, the higher the investment in the market portfolio, and the higher «, the 
lower the market portfolio investment for funding ratios smaller than 1. If a sufficient 
funding is observed (i.e. F > 1), then there is a positive relationship between « and F. 
If « approaches 0 (log utility case), the coefficient for the market portfolio investment 
becomes 1 — i/F plus the constant 4/(Ax). Thus, for log utility pension or life insurance 
funds, risk aversion does not matter. Indeed only the funding ratio matters to determine 
the market portfolio investment. For either case of utility functions, if the funding 
ratio is 100%, there will be no investment in the risky market portfolio. Thus, the 
funding ratio of a fund does not only determine the capability to bear risk but also the 
willingness to take risk. 

We now consider the percentage holding of the state variable hedge portfolio. Sup- 
pose the state variable Y is an exchange rate fluctuation, which affects the surplus of 
a life insurance or pension fund. Given (12) it follows that 

YJsy d4/A | Ra 


AJs . dYlY Ry 


(14) 


Hence, the weight of the state variable hedge portfolio (14) is preference independent 
if HARA utility is assumed. Assume the regression model R4 = (R4, Ry )Ry (asset 
returns are linear functions of currency returns). Hence, —R4/Ry is the negative beta 
of the portfolio with respect to the state variable. Using (14) it follows that 


- St fs Ry). (15) 
SS ! 


If Y is an exchange rate, then —f equals the minimum variance hedge ratio for the 
foreign currency position. The holdings of the state variable hedge portfolio are in- 
dependent of preferences; they only depend on the foreign currency exposure of the 
portfolio. The higher the exchange rate risk in the portfolio, the higher the currency 
hedging. 

Hence, for the HARA utility case, only the investment in the market portfolio de- 
pends on the risk aversion g. The investments in all other funds are preference in- 
dependent. They depend only on the funding ratio and on the exposure of the asset 
portfolio to the state variable. 
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4. The multiple state variable case 


Let & be the number of foreign currencies contained in a life insurance or pension 


fund’s portfolio, Y1,..., Y; be the exchange rates in terms of the domestic currency, 
and Ry,,...,Ry, the exchange rate returns of the k currencies. Then 
Y, Jsy, YS sy 
——— = —P(Ry, Ry, ),..., -—— = BRA, Ry, ), 
AJss PCRa, Ry,) Ass BCGRa, Ry, ) 
which are the Arrow/Pratt relative risk tolerances with respect to changes in the ex- 
change rates for the HARA case. Let v4y,,...,V4y, be the covariance vectors of the 
returns of the asset portfolio with the k exchange rate returns, 
Vol vay, Vo Way, 
Wy, (= —.,... Wy, m ——*, 
n eV! Way, 5 eV Vay, 
which are the state variable hedge portfolios 1-k, and b, := e’V~'vay,,...,¢ :— 


e’V~'vuy, are the coefficients of the state variable hedge portfolios. A pension or a life 
insurance fund with HARA utility function facing & state variables has the following 
investment strategy (R; :— dAj/A,: return on risky asset 7, | — 1,...,n): 


"| a ( aa) +4 ane TTE + gb (16) 


l-g 
i=] 


where 


B(Ra, Ry) = 3 Lonff(Ri, Ry, ). 
i-i 
Since the right-hand side of Eq. (16) depends on the portfolio allocation w, it is not 
possible to solve (16) analytically for w. However, numerical solutions can be applied. 
For & staté variables a (k + 3)-fund theorem thus follows. 


5. Case study 


The following case study is based on a USD-based surplus optimizer investing in 
the stock and bond markets of the US, UK, Japan, the EMU countries, Canada, and 
Switzerland. Monthly MSCI data between January 1987 and July 2000 (163 observa- 
tions) are used for the stock markets. The monthly JP Morgan indices are used for the 
bond markets in this period (Salomon Brothers for Switzerland). The stochastic bench- 
mark for a surplus optimizer is the quarterly Thomson Financial Datastream index 
for US wages and salaries. Quarterly data are linearly interpolated in order to obtain 
monthly wages and salaries data. The average growth rate of wages and salaries in the 
U.S. between January 1987 and July 2000 was 5.7% p.a. (see Table 1) with annual- 
ized volatility of 4.0%. Table | also contains the stock and bond.market descriptive 
statistics in USD, and the currency betas of the indices. 

All foreign currencies except CAD, i.e. GBP, JPY, EUR, and CHF, have volatility of 
about 12% p.a., and all currencies except JPY depreciated against the USD by a little 
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Table ! 
Descriptive statistics 
Mean Volatility Beta Beta Beta Beta Beta 
retum GBP JPY EUR CAD CHF 
Stocks USA 13.47 14.74 0.18 0.05 0.35 —0.59 0.35 
UK 9.97 17.96 0.47 0.36 0.29 0.48 0.14 
Japan 3.42 25.99 0.61 Ell 0.45 0.44 0.43 
EMU countries 10.48 15.80 0.32 0.27 0.26 0.45 0.11 
Canada 5.52 18.07 0.05 —0.02 0.27 —1.44 0.34 
Switzerland 11.56 18.17 0.14 0.32 0.26 0.13 —0.32 
Bonds USA 5.04 4.50 —0.03 0.00 —0.06 —0.04 ~0.06 
UK 6.86 12.51 0.92 0.44 0.80 0.39 0.59 
Japan 3.77 14.46 —0.53 — 1.04 —0.75 0.09 —0.72 
EMU countries 7.78 10.57 —0.69 —0.42 —0.93 —0.06 —0.74 
Canada 5.16 8.44 —0.09 0.03 —0.02 —1.14 0.02 
Switzerland 3.56 12.09 0.67 0.53 1.03 0.19 —0.99 
Exchange GBP 0.11 11.13 l 0.37 0.86 0.42 0.64 
rates in JPY —2.75 12.54 0.48 ] 0.65 0.04 0.61 
USD EUR? 1.13 10.08 0.7 0.42 l 0.1 0.79 
CAD 0.79 4.73 0.08 0 0.02 l —0.02 
CHF 0.32 11.57 0.69 0.52 1.04 —0.13 | 
Wages and salaries 5.71 4.0 0 0.01 0 —0.0I 0 


The stock market data are based on MSC indices and the bond data on JP Morgan indices (Switzerland 
on Salomon Brothers data). The wage and salary growth rate is from datastream. Monthly data between 
January 1987 and July 2000 (163 observations) is used. All coefficients are in USD. The average returns 
and volatilities are in percent per annum. 

“ECU before January 1999. 


more than 0 to 1.13% per year. From a USD viewpoint, for GBP-beta is especially 
high (absolute value) for the UK bond market. The Japanese stock and bond market 
reveal a JPY-beta of —1.11 and —1.04, respectively, and the EMU bond market has a 
EUR-beta of —0.93. Furthermore, the CAD-beta of the Canadian bond market is —1.14, 
the CHF-beta of the Swiss bond market is —0.99. All other countries have substantially 
lower currency betas. Since the betas are close to zero, the wages and salaries in the 
U.S. obviously do not depend on currency movements. 

The investor faces an exposure against five foreign currencies (GBP, JPY, EUR, 
CAD, CHF), and has to invest into eight funds. Five of them are hedge portfolios 
for the state variables, which are assumed to be currency returns. The next step is to 
calculate the compositions of the eight funds. The results appear in Table 2. The major 
holdings in the market portfolio are investments in the US stock market and the EMU 
bond market. Substantial short positions for the tangency portfolio are obtained for the 
Canadian stock and the Swiss bond market. The US bond portfolio is a major part of 
a portfolio providing the best hedge against fluctuations in wages and salaries. More 
than 126% of the liability hedge portfolio are invested in the US bond market. The 
bond markets of the respective currencies dominate the currency hedge portfolios. 
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Table 2 
Optimum portfolios of an internationally diversified pension fund from a USD perspective 


Market Liability hedge Hedge Hedge Hedge Hedge Hedge 


portfolio portfolio portfolio portfolio — portfolio — portfolio portfolio 

GBP JPY EUR CAD CHF 

Stocks USA 83.9 —30.4 3.9 —4.5 —7.5 60.9 —5.l 
UK —14.8 60.6 —31.0 —1L.l —8.6 —85.3 1.9 

Japan —6.7 2.7 6.1 8.9 —24 —4.0 —0.7 

EMU —19.2 —68.3 35.3 12.8 23.5 138.3 —3.8 

Canada —39.2 5.1 14.6 13.6 5.5 —2.6 —0.9 
Switzerl. 21.6 1.5 —24.8 —14.6 —14.7 —100.7 1.0 

Bonds USA 14.8 126.0 —126.4 —28.6 —41.2 —627.3 —35.9 
UK —9.7 —56.8 189.7 7.9 —3.4 97.5 —8.5 

Japan 0.6 39.1 —31.2 133.9 —3.6 —30.1 6.4 

EMU 138.5 9.6 —164 —38.0 97.3 —170.1 34.0 

Canada 7.9 5.0 -118 —32.6 -7.9 679.6 0.9 
Switzzerl. | —77.7 5.9 91.9 42.4 62.9 143.8 110.7 


The portfolio holdings are based on Eq. (9). All portfolio fractions are percentages. A riskless rate of 
interest of 2% per annum is assumed. 


Table 3 

Weightings of the funds due to different funding ratios 

Funding ratio 0.9 l 1.1 1.2 1.3 1.5 
Market portfolio (%) —11.5 0.0 6.3 12.3 18.2 24.6 
Liability hedge portfolio (96) 14.8 13.3 12.1 11.1 10.2 8.9 
Hedge portfolio GBP (96) —0.6 —0.5 —0.5 —0.5 —0.4 —0.4 
Hedge portfolio JPY (96) 1.2 1.1 1.0 0.9 0.9 0.8 
Hedge portfolio EUR (%) 0.3 0.2 0.1 0.0 0.0 —0.1 
Hedge portfolio CAD (96) —0.1 —0.1 —0.1 —0.1 —0.1 -0.1 
Hedge portfolio CHF (%) ll 1.0 0.9 0.8 0.8 0.7 
Riskless assets (%) 94.9 85.1 80,2 75.3 70.4 65.6 
Portfolio beta against GBP —0.01 —0.01 —0.01 —0.01 —0.01 —0.01 
Portfolio beta against JPY 0.02 0.02 0.02 0.02 0.02 0.02 
Portfolio beta against EUR 0.01 0.00 0.00 0.00 0.00 0.00 
Portfolio beta against CAD —0.02 —0.02 —0.01 —0.01 —0.01 —0.01 
Portfolio beta against CHF 0.02 0.01 0.01 0.01 0.01 0.01 


The weightings of the portfolios according to Eq. (16), where « = 0, i.e. log utility, is assumed. 


As derived in Section 3, the holdings of the six funds depend only on the funding 
ratio and on the currency betas of the distinct markets. This is shown in Table 3. 
For a funding ratio of one, there is no investment in the market portfolio and only 
diminishing investments in the currency hedge portfolios. The portfolio betas against 
the five currencies are close to zero for all funding ratios. The higher the funding 
ratio, the higher the investment in the market portfolio, the lower the investments in 
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the liability and the state variable hedge portfolios, and the lower the investment in 
the riskless fund. Negative currency hedge portfolios imply an increase of the currency 
exposure instead of a hedge against it. The increase of the market portfolio holdings 
and the reduction of the hedge portfolio holdings and the riskless fund for increasing 
funding ratios shows that the funding ratio is directly related to the ability to bear risk. 
Rather than risk aversion coefficients, the funding ratio provides an objective measure 
to quantify attitudes towards risk. 


6. Conclusions 


This paper derives a four-fund theorem for intertemporal surplus optimizers such as 
life insurance and pension funds. In addition to the three funds identified by Merton 
(1973), the expected utility maximizing portfolio contains a liability hedge portfolio, 
which is preference independent. Its holdings depend only on the funding ratio of 
a pension fund. The higher the funding ratio, the lower the necessity for liabilities 
hedging. As a practical consequence, the hedging policy of pension or life insurance 
funds could very easily be monitored by authorities. This is due to the fact that in 
the optimum, only the funding ratio is decisive for the liabilities hedge, and not the 
utility function, which hardly can be determined by law. Today's pension fund laws do 
not contain any rules for the treatment of stochastic wage growths. Although Merton 
(1993) addresses a similar problem of the optimal investment strategy for University 
endowment funds, his setting is different. He describes the “liabilities” of universities, 
i.e. the costs for their activities, as state variables. He obtains a preference-dependent 
hedge portfolio for the activity costs. In contrast, here the liability returns are specified 
as a part of the surplus return. In contrast to Merton's results, the hedge portfolio for 
the liabilities is exclusively dependent on the funding ratio (and not on risk preferences) 
of a life insurance or pension fund. The funding ratio is an objective measure, whereas 
risk preferences are hard to determine. 

The model provides an intertemporal] portfolio selection approach for surplus opti- 
mizers. The intertemporal surplus management approach holds for investors who have 
to cover liabilities by their assets in each moment of time. Since the investment strategy 
of all investors is consumption orientated, it is reasonable to assume that all investors 
invest in order to cover their liabilities. If the growth rate of individual consumption 
is related to the growth of wages and salaries, the relevant benchmark for surplus op- 
timizers refers to the growth rate and the volatility of wages and salaries. Finally, this 
model suggests a new type of product for investment banks. Pension and life insurance 
funds need to protect themselves against unanticipated changes in the growth rates of 
wages and salaries. Investment banks could offer liability protection and hedge those 
positions by purchasing a liability hedge portfolio as is given by Eq. (9). 
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Appendix A 


A.l. Derivation of (7) 

The arguments of the J-function are dropped for simplicity. o(d£) summarizes all 
terms of higher order than 1 of dí. For o(dt) : lima, o 0(dt)/dt = 0. Applying Itós 
lemma yields 

J(S, Y, t)  U(S, Y, t)dt + max E,[J(S + dS, Y + dY,t -- dt)] 
=U dt + max E[J +J, dt + Js dS +Jy dY + t Jgs dS? + 1Jyy dY? 
+Jsy dS dY + o(dé)]. 
Transforming the expression above yields 
0 = U dt + max E,[J; dt + Js dS + Jy dY + 5 Jss dS? + 3 Jyy dY* 
+Jsy dS dY + o(dt)], 
0=U dt + max[J; dt + JsE,(dS) + JyE,(dY) + }Jss dS dS 
+ LJyy dY dY + Jsy dS Y], 


and substituting in the expressions in (4) yields (7). 


A.2. Derivation of (8) 


Denote the prices of the n risky assets by Aj(£), i=1,...,7, and suppose they follow 
the processes 


dA;(1) = A;(0) [ni dt + e; dZi(t)], i-l,..n, 
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which defines the vector of risky expected asset returns m4 :— (uj,..., Hn). The Wiener 
processes Zi(£), i — 1,...n, are correlated according to 


dZi(() dZj(1) = i dt, i £j, 


where p;; is the instantaneous correlation between Z,(f) and Z;(¢). If the covariance 
between assets i and j is denoted by ø;;, then we have cj; — c;0;p;; and the covariance 
matrix V has entries G; Le. 


Gir co Gln 


On) ttt Onn 
Suppose the fund holds x;(£) shares of asset i at time ¢, and that the cash in the asset 


portfolio at time ¢ is B(f). Then the asset portfolio value A(£) is given by (A;(1): price 
of asset i) 


AQ) = XO x(OAQ0) + B). 


i=l 


In the next infinitesimal time interval dz, this evolves according to 


n 


dA(t) = $ x(O) dA) + ABU) = $ x(t) dA) + rBO) di 


i=l i=l 


where r is the riskless interest rate. Define the vector of portfolio fractions w' = 
(Q1,..., 04) by 


Xxj;(t Ai t 
e(t) :— (ACD i=l,...,f. 
A(t) 
The sum of risky and riskless investments is assumed to be |. Hence, the fraction 
invested in the riskless asset is B(1)/A(t) = | — $; cx. Therefore, the expression 


above for dA(t) becomes 


H 


dA(1) = A(t) X oiu — 7) ridi + M w0; dZQ) 


is! i=l 
Comparing the above expression with Eq. (2), i.e. dA(t) 2 A(£)[u4 dt + a4 dZ4(1)], and 
assuming that e’ :— (1,..., 1) is the n-dimensional vector of ones, gives 


a4 dZ4(1) = Ñ wic; AZ(1). 
ist 
From this follows that 


o? =w Vw, 
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where V is the covariance matrix given earlier. Furthermore, define cj;(c;y) as the 
covariance between the ith risky asset and the liabilities (state variable). Denote the 
vector of these covariances by 


YA = (TiL, Onh) Viy = (Ciy, Ony 


Then the relationship ca, = w'v4; and Gay = W'Yay is obvious. From the last four 
representations of expected returns, variances, and covariances Eq. (8) follows. 


4.3. Discussing Eq. (9) 


Maximizing the covariance between the asset portfolio and the state variable, 


Ri 
dY 
Cov|w | : Uy. Way max st. w'Vw — 47%, 
R, 
where Ri,..., Rn refer to the n asset returns. Let A be a Lagrange multiplier: 


L — w'vay — A(w' Vw — o2) 
=>  OL/Ow — vay — 22- Vw — 0 
=> weVovyy/QA). 
aL/(Owy « 0 if and only if V is positive definite. 


Hence, wy (the state variable hedge portfolio) maximizes the correlation between the 
asset portfolio and the state variable. Furthermore, 


24- w'Vw 2 w'vay e 2402 = ayy 


GAY - 
=> 24 = -7 = Bay > V ‘vay = BayW. 
A 


Multiplying the fractions of the asset portfolio by the regression coefficient fy provides 
the fractions of the hedge portfolio. 
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THE SYMMETRIC DownsiDe-Risk SHARPE RATIO 


he Sharpe ratio is a very useful measure of 

investment performance. Because it is based on 

mean-variance theory, and thus is basically 

valid only for quadratic preferences or normal 
distributions, skewed investment returns can lead to mis- 
leading conclusions. This is especially true for superior 
investors with many high returns. Superior investors may 
use capital growth wagering ideas to implement their 
strategies, which produces higher growth rates but also 
higher variability of wealth. 

My simple modification of the Sharpe ratio to 
assume that the upside deviation is identical to the down- 
side risk provides a useful modification and gives more 
realistic results. 

Exhibit 1 plots wealth levels using monthly data 
from December 1985 through March 2000 for the Wind- 
sor Fund of George Neff, the Ford Foundation, the 
Tiger Fund of Julian Robertson, the Quantum Fund of 
George Soros, and Berkshire Hathaway, the fund run by 
Warren Buffett, as well as the S&P 500 total return index, 
U.S. Treasuries, and T-bills. Yearly data are shown in 
Exhibit 2. 

The means, standard deviations, and Sharpe [1966, 
1994] ratios of these six funds, based on monthly, quar- 
terly, and yearly net arithmetic and geometric total return 
data, are shown in Exhibit 3. Shown as well are data on 
the Harvard endowment (quarterly) plus U.S. Treasuries, 
T-bills, and U.S. inflation, and number of negative months 
and quarters. 

The first panel in Exhibit 3 shows the data behind 
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The Sharpe ratio, a most useful measure of investment per- 
formance, has the disadvantage that it is based on mean- 
variance theory and thus is valid basically only for quadratic 
preferences or normal distributions. Hence skewed invest- 
ment returns can engender misleading conclusions. This 
is especially true for superior investors with a number of 
high returns. Many of these superior investors use capital 
growth wagering ideas to implement their strategies, 
which means higher growth rates but ako higher variability 
of wealth. A simple modification of the Sharpe ratio to 
assure that the upside deviation is identical to the down- 
side risk gives more realistic results. 
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EXHIBIT 1 


Growth of Assets—Monthly Data December 1985~April 2000 


16 o 


1 [AID] 
2 


1000) B 


J E 


Exhibit 1, which illustrates the high mean returns of Berk- 
shire Hathaway and the Tiger Fund as well as that the Ford 
Foundation’s standard deviation was about a third of Berk- 
shire’s, The Ford Foundation actually trailed che S&P 500 
mean return. We can see the much lower monthly and 
quarterly Sharpe ratios compared to the annualized values 
based on monthly, quarterly, and yearly data. 

By the Sharpe ratio, the Harvard endowment and 
the Ford Foundation had the best performance, followed 
by the Tiger Fund, the S&P 500 total return index, Berk- 
shire Hathaway, Quantum, Windsor, and U.S. Treasuries. 
The basic conclusions remain the same according to 
monthly or quarterly data and arithmetic or geometric 
means. Because of data smoothing, the Sharpe ratios with 
yearly data usually exceed those with quarterly data, 
which in turn exceed the monthly calculations. 

The reason for this ranking is that the Ford Foun- 
dation and the Harvard endowment, with less growth, also 
had much less variability. These endowments have different 
purposes, different investors, different portfolio managers, 
and different fees, so such differences are not surprising. 

Clifford, Kroner, and Siegel [2001] give us similar 
calculations for a larger group of funds. They also show 
that over July 1977-March 2000, Berkshire Hathaway's 
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Date 


Sharpe ratio was 0.850, Ford’s 0.765, and the S&P 500's 
0.676, The geometric mean returns were: 32.07% (Buf- 
fett), 14.88% (Ford), and 16.71% (S&P 500). 

Exhibit 4 shows Buffett’s returns ona yearly basis in 
terms of increase in per share book value of Berkshire 
Hathaway versus the S&P 500 total return yearly index 
values for the 40 years 1965-2004. Buffett's geometric and 
arithmetic means were 22.02% and 22.84%, respectively, 
versus 10.47% and 11.83% for the S&P 500. This mea- 
sure does not fully reflect the trading prices of Berkshire 
Hathaway shares and thus yearly net returns, but it does 
indicate that Buffett has easily beaten the S&P 500 over 
these 40 years with a 286,841% increase versus the S&P 
500's 5,371% increase. 

Typically the Sharpe ratio is computed using arith- 
metic returns. This is because the basic static theories of 
portfolio investment management such as mean-variance 


“analysis and the capital asset pricing model are based on 


arithmetic means. These are static, one-period theories. 
For asset returns over time, however, the geometric mean 
is a more accurate measure of average performance because 
the arithmetic mean is biased upward. 

The geometric mean helps mitigate the autocorre- 
lated and time-varying mean and other statistical proper- 
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EXHIBIT 2 
Yearly Return Data (%) 


Berkshire S&P $00 
Date Windsor Hathaway Quantum Tiger Ford Harvard — Total 
Yearly data, 
14 years 

Neg 2 2 l 0 i l 1 

years 

Dec - 20.27 14.17 42.2 26.83 1809 2216 1847 
86 

Dec- 123 4.61 14.13 728 520 1246 523 
87 

Dec - 28.69 59.32 10.13 15.76 10.42 12.68 16.81 
88 

Dec - 15.02 84.57 35.21 2432 22.15 15.99 31.49 
89 

Dec-  -15.50 -23.05 23.80 557 196 -1.0) 347 
90 

Dec- 28.55 35.58 50.58 37.59 2292 15.73 3055 
91 

Dec 16.50 29.83 6.37 842 5.26 4.88 7.67 
92 

Dec - 19.37 38.94 33.03 2491 1307 21.73 9.99 
93 

Dec - -0.15 24.96 3.94 171 -1.96 3.71 L3 
94 

Dee - 30.15 57.35 38.98 3434 26.47 2499 37.43 
95 

Dec - 26.36 623 -1.50 8.03 1539 2647 23.07 
96 

Dec - 21.98 34.90 17.09 1879 19.11 2091 3336 
97 

Dec - 0.31 52.17 12.46 11.21 2139. 1244 28.58 
98 

Dec - 11.57 -19.86 34.68 2744 2759 2378 21.04 
99 


US 


Trea 


means are used rather than arithmetic 
means; the difference between these two 
measures is a function of return volatil- 


us us ity. The basic conclusions in this research, 
bas inf such as the relative ranking of the vari- 
ous funds, are the same for the arith- 
0 0 metic and geometric means. 
Exhibit 5 shows that the Harvard 
616 1D Investment Company, that great school's 
547 4.4 endowment, had essentially the same 
635 442 wealth record over time as the Ford 
837 465 Foundation. This conclusion is based on 
quarterly data, which are all I have on 
781 6 Harvard. Harvard beats Ford by the ordi- 
5.60 3.06 nary Sharpe ratio, but Ford is better by 
351 290 the symmetric downside risk measure | 
develop later. 
290 235 . " 
Before evaluating positive and neg- 
390 267 ative returns performances of these var- 
560 2.54 ious funds using the Sharpe ratio and my 
S233 modified version, it is useful to discuss 
how these funds got their outstanding but 
526 130 sometimes volatile records. 
486 161 
468 268 SOME GREAT INVESTORS 


ties of returns that are not independent and identically dis- 
tributed. If one has returns of +50% and —5096 in two peri- 
ods, for example, the arithmetic mean is zero, which 
does not reflect the fact that 100 became 150 and then 
75. The geometric mean, which is -13.7%, is the correct 
measure to use. 

For investment returns in the 1096-1596 range, the 
arithmetic returns are about 2 percentage points above the 
geonietric returns. But for higher returns, this approxi- 
mation is not accurate. Hence, I use geometric means as 
well as more typical arithmetic means in this article. 

Lo [2002] points out that we must take care in mak- 
ing Sharpe ratio estimations when tlie investment returns 
are not iid, which they are for the investors | discuss here. 
For dependent but stationary returns Lo derives a cor- 
rection of the Sharpe ratios that deflates artificially high 
values to correct values using an estimation of the corre- 
lation of serial returns. 

Miller and Gehr |1978] and Knight and Satchell 
[2005] derive exact statistical properties of the Sharpe 
ratio with normal and lognormal assets, respectively. The 
Sharpe ratios are almost always lower when geometric 
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Ideally, we would want to penalize 
only losses such as those shown in Exhibit 
6, while rewarding positive returns. The Sharpe ratio 
penalizes high-return but volatile records. 

In the theory of optimal investment over time, it is 
not quadratic (the utility function behind the Sharpe 
ratio) but log that yields the most long-term growth. But 
the elegant results in the Kelly [1956] criterion, as it is 
known in the gambling literature, and the capital growth 
theory, as it is known in the investments literature, as 
proven rigorously by Breiman [1961] and generalized by 


` Algoet and Cover [1988], are long-run asymptotic results 


(see Hakansson and Ziemba [1995], Ziemba [2003], and 
MacLean and Ziemba [2005]). 

The Arrow-Pratt absolute risk aversion of the log 
utility criterion: 


is essentially zero, where u is the utility function of wealth 
w, and the primes denote differentiation. Hence, in the 
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EXHIBIT 3 


Fund Return Data—December 1985-A pril 2000 


Windsor 


Monthly data, 172 months 


Neg 61 
months 
arith 1.17 
mean, 
mon 
81 dev, 4.70 
mon 
Shame, 0.157 
mon 
arith — 14.10 
mean 
si dev 16.27 
Sharpe, 0.543 
yr 
geomean, 1.06 
mon 
geo st 4.70 
dev,mon 
Sharpe, 0.133 
mon 
geo 12.76 
mean, yr 
Beo st 16.27 
dev, yr 
Sharpe, 0.460 
yr 


Quarterly data, 57 quarters 


Neg 14 

quarters 

mean, 3.55 
quy 

stdev, — 8.01 
atly 


mean, yr [4.20 

st dev, yr 16.03 

Sharpe, 0.556 
yr 


geomean, 3.23 
qtly 
gto st 8.02 
dev,qtly 


geo — 12.90 
mean, yr 
gcost 16.04 
dev, yr 
Sharpe, 0.475 
yr 


Berkshire 


Hathaway 


58 


0.186 


2238 
26.56 


0.644 


15 
6.70 


14.75 


26.81 
29.50 
0.729 


5.67 


14.79 


22.67 
29.58 


0.588 


Yearly Data, 14 years 


Neg 2 
years 
mean, 14.63 


stdev, 13.55 
Sharpe, 0.681 


yrly 
geom — 13.83 
mean 
stdev 13.58 
Sharpe 0.621 
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30.57 
0.641 


Quantum 


53 


25.72 


0.486 


25.38 


0.571 


16.20 
1.022 


Tiger 


56 


2.02 


6.24 


0.54 


24.27 


21.62 
0.879 


21.63 


0.770 


15.41 


0.713 


11.41 
1.064 


Ford 


Found 


2.69 


9.43 


0.954 


9.39 
0.962 


Harvard 


na 


na 


na 


na 


9.45 


1.029 


8.53 
1.146 


S&P500 
Toral 


56 


15.06 


0.764 


12.90 
0.981 
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4.58 


0.48 
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EXHIBIT 4 

Increase in Per Share Book Value of Berkshire 
Hathaway versus S&P 500 (dividends included) 
1965-2004 (%) 


Year BH S&P500 Diff Year BH S&P 500 Diff 
1965 23.8 10.0 13.8 1985 48.2 316 16.6 
1966 20.3 (1.7) 320 1986 26.1 18.6 75 
1967 1.0 30.9 (19.9) 1987 19.5 5.1 14.4 
1968 19.0 11.0 8.0 1988 20.1 16.6 3.5 
1969 16.2 (84) 246 1989 444 317 127 
1970 12.0 3.9 8.1 1990 7.4 (3.1) 10.5 
1971 16.4 14.6 1.8 199] 396 30.5 9.1 
1972 21.7 18.9 2.8 1992 203 7.6 12.7 
1973 4.7 (14.8) 19.5 1993 143 10.1 42 
1974 5.5 (26.4) 31.9 1994 13.9 L3 12.6 
1975 21.9 37.2 (15.3) 1995 43.1 37.6 5.5 
1976 59.3 23.6 35.7 1996 231.8 23.0 8.8 
1977 31.9 (7.4) 393 1997 34.1 33.4 7 
1978 24.0 6.4 17.6 1998 483 28.6 19.7 
1979 35.7 18.2 17.5 1999 5 21.0 (20.5) 
1980 19.3 323 (130) 2000 £65 (9.1) 156 
198] 31.4 (5.0) 364 2001 (62) (119) $7 
1982 40.0 21.4 186 2002 100 (22.1) 32.1 
1983 32.3 224 99 2003 210 287 (7.7) 
1984 13.6 6.1 75 2004 10.5 10.9 (0.04) 
Overall Gain 286,841 — 5,371 
Arithmetic Mean 22.84 11.83 11.01 


Geometric Mean 22.02 10.47 10.07 


Sources: Berkshire Hathaway 2004 annual report, Hagstrom [2004]. 


EXHIBIT 5 


Ford Foundation and Harvard Investment 
Corporation Retums—Quarterly Data 
June 1977-March 2000 


320. 
— Herea 
Ford 
| : 
8 
Hi 
3 10 : 
H 
E 
; ae 


s uu PP aa Pye a i 
short run, log can be an exceedingly risky utility func- 


tion with wide swings in wealth values because the opti- 
mal bets can-be so large. 

Long-run exponential growth is equivalent to maxi- 
mizing the one-period expected log of that period’ returns. 
To illustrate how large Kelly (expected log) bets are, con- 
sider the simplest case with Bernoulli trials where you win 
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with probability p and lose with probability q = 1 - p. 

Log utility is related to negative power utility, namely, 
for Qu? for @<0 since negative power converges to log 
when @ — 0. Kelly [1956] discovered that log utility 
investors had the best utility function, provided they 
were very long-run investors. The asymptotic rate of 
asset growth is 


3M 
G = lim Z EJ 
Now Wo 


where wy is period N's wealth and wy is initial wealth. 

Consider Bernoulli trials that win +t with proba- 
bility p and lose -1 with probability 1 — p. If we win M 
out of N of these independent trials, the wealth after 
period N is: 


w, =m + $y" -pr 


where fis the fraction of our wealth bet in each period. 
Then: 


N-M 


GU) = lim EZ E log( ~ f) 


which by the strong law of large numbers is 


G(f) = plog(1 + f) + glog(1 - f) = E(logw) 


Hence, the criterion of maximizing the long-run 
exponential rate of asset growth is equivalent to maxi- 
mizing the one-period expected logarithm of wealth. 
Hence, to maximize long-run (asymptotic) wealth, max- 
imizing expected log is the way to do it period by period. 

The optimal fractional bet, obtained by setting the 
derivative of G(f) to zero, is f^ = p — q, which is simply 
the investor edge or expected gain on the bet. If the bets 
are win O + 1 or lose t—that is, the odds are Oto 1 to 
win—the optimal Kelly bet is f* = A orthe Es Since 
edge is a mean concept and odds is a risk concept, you 
wager more with higher mean and less with higher risk.' 

In continuous time: 


HA-r edge 
risk(odds) 


with optimal growth rate 
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EXHIBIT 6 


Summary— Negative Observations and Arithmetic and Geometric Means 


Berkshire Ford 
Number of Windsor Hathaway Quantum Tiger Found Harvard Total Treas 
negative 
...months out 61 58 53 56 44 na 
of 172 
„quarters out 14 15 16 11 10 1 
of 57 
years out of 2 2 1 0 1 I 


14 


S&P 


56 


small-cap stocks under Democrats 
had 24.5 times as much wealth as a 
60-40 large-cap/bond investor. 
That’s the idea, more or less. 

Keynes and Buffett are essen- 
tially Kelly bettors. Kelly bettors will 
15 have bumpy investment paths, but 
most of the time, in the end, accu- 
mulate more money than other 
investors. 


Us 


54 


1 y 
z (Sharpe Ratio)’ + Risk-Free Asset 


where 4 is the mean portfolio return, r is the risk-free 
return, and O? is the portfolio return variance. So the ordi- 
nary Sharpe ratio determines the optimal growth rate. 

Kelly bets can be large. Consider Bernoulli trials 
where you win 1 or lose 1 with probabilities p and 1 — p, 
respectively. Then: 


p.5.51.6.8.9.99 
1-p.5.49 42.1.01 
f’ 0.02.2.6.8.98 


So, if the edge is 98%, the optimal bet is 98% of one’s 
fortune. With longer-odds bets, the wagers are lower. 

The Kelly bettor is sure to win in the end if the hori- 
zon is long enough. Breiman [1960, 1961] was the first 
to clean up the math in Kelly’s [1956] and Latané’s [1959] 
heuristic analyses. He proves that: 


Jnd Pas 
Nw w,(N) 


where wy, (N) and wp (N) are the wealth levels of the 
Kelly bettor and another essentially different bettor after 
N play. That is, the Kelly bettor wins infinitely more 
than bettor B and moves farther and farther ahead as the 
long time horizon becomes more distant.” 

Breiman also shows that the expected time to reach 
a preassigned goal is asymptotically the shortest with an 
expected log strategy. Moreover, the ratio of the expected 
log bettor’s fortune to that of any other essentially differ- 
ent investor goes to infinity. The log investor gets all the 
money in the end if one plays forever. 

Hensel and Ziemba [2000] calculate that over 1942 
through 1997 a 100% long investor solely in large-cap 
stocks under Republican administrations and solely in 
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Ziemba and Hausch [1986] 
perform a simulation to show medium-run properties of 
log utility and half-Kelly betting (using the -w utility 
function). Starting with an initial wealth of $1,000 and 
considering 700 independent wagers with probability of 
winning 0.19 to 0.57, all with expected values of $1.14 
per dollar wagered, they compute the final wealth pro- 
files over 1,000 simulations. These wagers correspond to 
odds of 1-1, 2-1, 3-1, 4-1, and 5-1. 

The results show that the log bettor has more than 
100 times initial wealth 16.6% of the time and more than 
50 times as much 30.2% of the time. This demonstrates 
the great power of log betting, as the half-Kelly strategy 
has very few such high outcomes. Yet this high return 
comes at a price. Despite making 700 independent bets, 
each with a 14% success rate, the investor could have lost 
more than 98% of the initial wealth of $1,000. 

Exhibit 7 provides the simulation results. The 
Ziemba-Hausch simulation used the data in Exhibit 8. The 
edge over odds gives f* equal to between 0.14 and 0.028 
for the optimal Kelly wagers for 1-1 versus 5-1 odds bets. 

"The 18 in the first column in Exhibit 7 shows it is 
possible for a Kelly bettor to make 700 independent 
wagers, all with a 14% edge, having 1996 to 57% chance 
of winning each wager, and still lose over 98% of one's 
wealth. Even with half-Kelly, the minimum starting with 
$1,000 was $145, or a 85.5% loss. This shows the effect 
of a sequence of very bad scenarios that may be unlikely 
but are certainly possible. 

The last column in Exhibit 7 shows that 16.696 of 
the time the Kelly bettor increases initial wealth more than 
100-fold. The half-Kelly strategy is much safer, as the 
chance of being ahead after the 700 wagers is 95.4% ver- 
sus 87.0% for full-Kelly. But the growth rate is much lower, 
since the 16.696 chance of making 100 times initial wealth 
is only 0.196 for half-Kelly wagerers. The Kelly bettor 
accumulates more wealth but with a much riskier time 
path of wealth accumulation. The Kelly bettor can take 
a long time to get ahead of another bettor. 
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Distributions of Final Wealth—Kelly and Half-Kelly Wagers 


Final Wealth 
Suategy Min Max Mean | Median 
Kelly 18 483,883 48,135 | 17,269 
Half-Kelly 145 111,770 13,069 8,043 
EXHIBIT 8 
Value of Odds on Wagers 
Probability of Odds Probability of Optimal Kelly 
Winning Being Chosen in Bets Fraction 
the Simulation at of Current 
Each Decision Wealth 
Point 
0.57 1-1 0.1 0.14 
0.38 2-1 0.3 0.07 
0.285 3-1 0.3 0.047 
0.228 4-1 0.2 0.035 
0.19 5-1 0.1 0.028 


EXHIBIT 9 
Place-and-Show Betting on Kentucky Derby 1934-1994 


Source: Bain, Hausch, and Ziemba [2005]. 


What one wants is to have those swings in the right 
direction. And who better, it seems, at predicting these 
swings than Warren Buffett. Chopra and Ziemba [1993] 
show that, in portfolio problems, errors in estimating 
means, variances, and covariances influence investment 
performance roughly in the ratio 20:2:1. When risk aver- 
sion is lower, as it is with log, the errors are even greater. 
In this case, the effect on portfolio performance of mean 
errors can be 100 times the errors in covariances. 

Who then would use a log utility function i£ it is so 
risky, or use a toned-down log utility function by mixing 
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Number of Times Final Wealth out of 1000 Trials Was 


>500 »)000 10,000 250,000  »100,000 
916 870 598 302 166 
990 954 480 30 1 


the log utility investment fraction of one’s wealth with cash? 
These so-called fractional Kelly strategies are actually 
mathematically equivalent to negative power utility when 
the assets are lognormally distributed and approximately 
equivalent otherwise; see MacLean, Ziemba, and Li [2005]. 

The difference in wealth paths between Kelly and 
half-Kelly strategies is illustrated in Exhibit 9, which shows 
the wealth level histories from place-and-show betting on 
the Kentucky Derby over 1934-1994 following the DrZ 
system. This system uses a 4.00 dosage index breeding fil- 
ter rule with full- and half-Kelly wagering and $200 flat bets 
on the favorite, with initial wealth of $2,500 (dosage is a mea- 
sure of a horse’s speed over stamina ratio). Starting with 
$2,500, full-Kelly yields a final wealth of $16,861, while 
half-Kelly, with a much smoother path, has final wealth of 
$6,945. These two strategies for the Kentucky Derby are 
winning ones compared to the losing strategy of betting on 
the favorite, which turns the $2,500 into $480. 

Some rather good investors, including four l! have 
worked with or consulted with, have used the Kelly and 
fractional Kelly approach to turn humble starts into for- 
tunes of hundreds of millions. One was the world’s most 
successful racetrack bettor; see Benter [1994]. 

Another was a trend-following futures trader in the 
Caribbean for whom I designed a Kelly betting system for 
the 90 liquid futures markets he traded. The Kelly system 
added $9 million extra profits per year to his already good 
betting system based on an ad hoc but sound probability- 
of-success approach. Another was an options trader eking 
out nickels and dimes with slightly mispriced options in 
Chicago. 

The fourth was the popularizer of the Kelly approach 
in sports betting, Edward O. Thorp. Thorp’s Princeton- 
Newport Fund had a net mean return of 15.1%, when 
the S&P 500’s was 10.2% and T-bills returned 8.1%. 
Interest rates were very high in 1968-1988 while Thorp 
was running his fund. Thorp had no losing quarters, only 
three losing montlis, and a most impressive yearly stan- 
dard deviation of 4%. 

To show the differences between the gamblers and 
other investors, Exhibit 10 shows Benter’s racetrack bet- 
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EXHIBIT 10 
Gamblers Like Smooth Wealth Paths 
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(3) Hong Kong racing syndicate 1989 to 1994, 
Sec Benter [1994]. 


EXHIBIT 11 


Princeton-Newport Partners—Cumulative Results 
November 1968-December 1988 


Price (U.S. dollars) 
15 . 
14 
13 
12 
n 
10 


5 a 


Ror Nnywse MW aunw 


18 10/73 10/77 10/81 10/85 12/88 
PNP ees S&P 500 US. T-BIN 


ting record over 1989-1994 and 1989-2001, and Exhibit 
11 shows Thorp’ record over 1968-1988. Compare these 
rather smooth graphs with the brilliant but quite volatile 
record in Exhibit 12 of the eminent economist, John May- 
nard Keynes, who ran the King’s College Chest Fund, the 
college’s endowment, from 1927 until his death in 1945. 

Keynes lost more than 50% of his fortune during the 
difficult years of the depression around the 1929 crash. This 
bad start was followed by many years of outperformance 
on relative and absolute terms, so that by 1945 Keynes’s 
geometric mean return was 9.12% versus the U.K. index 
of -0.89%. Keynes's Sharpe index was 0.385. 

The gamblers have several common characteristics: 
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(b) Hong Kong racing syndicate 1989 to 2001 
See Benter [2001]. 


* They carefully developed anomaly systems with 

positive means. 

* They carefully developed computerized betting 

systems that automated the betting process. 

* They constantly updated their research. 

* They were more focused on not losing than on 
winning in their careful risk control. 


Thorp [2006] shows that the Buffett trades are actu- 
ally very similar to those of a Kelly trader. In Ziemba 
[2003] I find Keynes is well approximated as a fractional 
Kelly bettor with 80% Kelly and 20% cash; this is equiv- 
alent to the negative power utility function 2? Recall 
log is when the power coefficient goes to zero, and that 
log is the most risky utility function that should ever be 
considered. Positive power utility has less growth and 
more risk, so is not an acceptable utility function. 


SYMMETRIC DOWNSIDE SHARPE 
RATIO PERFORMANCE MEASURE 


Now back to the records of the funds in Exhibit 1. 
How do we compare these various investors? And can we 
determine if Warren Buffett really is a. better investor 
than the rather good funds, especially the Ford Founda- 
tion and the Harvard endowment? 

Exhibit 13 plots the Berkshire Hathaway and Ford 
Foundation monthly returns as a histogram and shows the 
losing months and the winning months in a smooth 
curve. We want to penalize Buffett for losing but not for 
winning. We define the downside risk as: 


THe JOURNAL OF PORTFOLIO MaNacEMENT 115 


778 W. T. Ziemba 


EXHIBIT 12 
King's College Chest Fund 1927-1945 
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EXHIBIT 13 
Berkshire Hathaway versus Ford Foundation—Monthly Returns Distribution January 1977-April 2000 


20.00 


6.00 


em E Suid 


50 57 64 7i 78 95 92 99 106 113120 127 124 141 148 1$5 162 169 


-10.00 


, — Berkshire Hathaway 


; —— Ford Foundation : 
-30.00 E pa =: dale ae T ee aa 


116 Tut SYMMETRIC DowNsiDE-Rusk SHARPE RATIO Fatt 2005 


The Symmetric Downside-Risk Sharpe Ratio 


EXHIBIT 14 
Comparison of Ordinary and Symmetric Downside 
Sharpe Yearly Performance Measures 


Ordinary Downside 


Ford 0.970 0.920 
Foundation 
Tiger Fund 0.879 0.865 
S&P500 0.797 0.696 
Berkshire 0.773 0.917 
Hathaway 
Quantum 0.622 0.458 
Windsor 0.543 0.495 
a 2 
gi Die, -3) 
j n-i 


where the benchmark x is zero, i is the index on the n 
months in the sample, and the x, taken are those below x, 
namely, those m of the n months with losses. This is the 
downside variance measured from zero, not the mean, so 
itis more precisely the downside risk. To get the total vari- 
ance, we use twice the downside variance (202), so that 
Buffett gets the symmetric gains added, not his actual gains. 
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Berkshire Hathaway versus the much less volatile Ford 
Foundation returns. When Berkshire Hathaway had a los- 
ing month, it averaged —5.36% versus +2.15% for all months. 
Meanwhile, Ford lost 2.44% and won, on average, 1.19%. 

Exhibit 15 shows the histogram of quarterly returns 
for all funds including Harvard (for which monthly data 
are not available). The plots show that the distributions 
of all the funds lie between those of Berkshire Hathaway, 
Harvard, and Ford. 

By the quarterly data, the Harvard endowment has 
a record almost as good as the Ford Foundation’s; see 
Exhibits 16 and 17. Berkshire Hathaway made the most 
money but also took more risk, and by either the Sharpe 
or the downside Sharpe measure the Ford Foundation and 
the Harvard endowment had superior rewards. 

I first used this measure in Ziemba and Schwartz 
[1991] to compare the results of superior investment in 
Japanese small-cap stocks during the late 1980s. The 
choice of x = 0 is convenient and has a good interpre- 
tation. But other x are possible and might be useful in other 
applications. This measure is closely related to the Sortino 
ratio (see Sortino and van der Meer [1991] and Sortino 


Using 207, the usual Sharpe ratio 
with monthly data and arithmetic 
returns is: 


EXHIBIT 15 


Quarterly Returns Distributions December 1985-March 2000 


R-R 0.500 


0.400 


Exhibit 14 provides the results 
for the ordinary and symmetric 0.300 | 
downside Sharpe ratios using monthly 
data and arithrnetic means. 

My measure moves Warren 
Buffett higher, to 0.917, but not up 
to the Ford Foundation and not 0.100 
higher because of his high monthly 
losses. He does gain in the switch 


0.200 | 


from ordinary Sharpe to downside 
symmetric Sharpe, while all the other 
funds drop. Ford is 0.920, and Tiger (109) 


0.865. The Berkshire Hathaway 
monthly losses when annualized are 


over 64% versus under 27% for the (0:00) 


Ford Foundation. 


Exhibit 14 shows these rather fat (0.300) 
tails on the up side and down side of 
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EXHIBIT 16 


Yearly Sharpe and Symmetric Downside Sharpe Ratios December 1985— 


April 2000 
Berkshire Ford S&P 
Windsor Hathaway Quantum Tiger Found Harvard Total 
qtly data, 57 
quarters 
neg qts 14 15 16 ll 10 I 10 
mean, neg -6.69 -10.50 -7.77 -6.26 -359 -281 -6.92 
mean, qtly 3.55 6.70 5.70 4.35 3.68 3.86 448 
ds st dev, qtly 10.52 14.00 12.09 8.18 4.44 5.12 9.89 
mean, yr 14.20 26.81 2239 1742 14,71 15.44 17.91 
ds st dev, yr 21.04 28.00 24.17 1635 889 1024 19.78 
ds Sharpe 0.424 0.269 0.724 0.742 1.060 0.991 0.638 
geomean, qtly 3.23 5.67 494 407 3.57 3.75 4.20 
ds geo stdevqtly — 10.20 13.49 11.80 771 3.91 4.99 9.34 
geo mean, yr 12.90 22.67 1978 16.28 1429 [5.01 16.80 
ds geo st dev, yr — 20.41 26.97 23.60 15.43 7.83 9.97 18.68 
ds Sharpe 0.373 0.644 0.614 0.712 1150 0.975 0.616 


and Price [1994]), which considers downside risk only. 
That measure does not have the two-sided interpretation 
of my measure, and the v2 does not appear. 

The notion of focusing on downside risk is popular 
these days as it represents real risk better. I started using it 


in asset-liability models in the 1970s; 
see Kallberg, White, and Ziemba 
[1982] and Kusy and Ziemba [1986] 
for early applications. In those mod- 
els we measure risk as the downside 
non-attainment of invesunent target 
goals that can be deterministic such as 
wealth growth over time, or stochas- 
tic such as the non-attainment of a 
portfolio of weighted benchmark 
returns. See Geyer et al. [2003] for an 
application of this to the Siemens Aus- 
tria pension fund? 

Calculating the Sharpe ratio and 
the downside-symmetric Sharpe ratio 
using quarterly or yearly data does not 
change the results much, even though 
it smooths the data because individual 
monthly losses are combined with gains 
for lower volatility. The yearly data 
move us closer to normally distributed 
returns so the symmetric downside and 
ordinary Sharpe measures will yield 
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-1.35 

1.93 

1.35 


7.73 
2.70 
0.903 


1.90 
1.28 


7.59 
2.56 
0.900 


similar rankings. 

In Exhibit 2, I showed the yearly 
returns for the various funds and their 
Sharpe ratios computed using arith- 
metic and geometric means and the 
yearly data. There are insufficient data 
to compute the downside Sharpe 
ratios based on yearly data. The Ford 
Foundation had only one losing year, 
and that loss was only 1.96%. Berkshire 
Hathaway had two losing years with 
losses of 23.1% and 19.9%. The Ford 
Foundation had a higher Sharpe ratio 
than Berkshire Hathaway, but that was 
exceeded by the Tiger and Quantum 
funds and the S&P 500. 

Exhibits 17 and 18 summarize 
the annualized results using monthly, 
quarterly, and yearly data. The Ford 
Foundation had the highest Sharpe 
ratio, followed by the Harvard 


endowment, and both of these 


exceeded Berkshire Hathaway. The Ford Foundation had 
the highest symmetric-downside Sharpe ratio, followed by 
Harvard, and both exceeded Berkshire Hathaway and the 
other funds. 


EXHIBIT 17 


Summary of Means (%) and Sharpe and Symmetric Downside 
Sharpe Ratios Annualized 


mean (arith, mon) 
Sharpe (arith, mon) 


ds Sharpe (arith,mon) 


mean (geom, mon) 


Sharpe (geom, mon) 
ds Sharpe (geom, mon) 


mean (arith, qtl y) 
Sharpe (arith, qtly) 


ds Sharpe (arith, qtly) 


mean (geom, qtly) 
Sharpe (gcom, qlty) 


ds Sharpe (geom, qtly) 


mean (arith, yrly) 
Sharpe (arith, yrly) 


mean (geom, yrly) 
Sharpe (geom, yrly) 
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Berkshire Ford S&P 
Windsor Hathaway Quantum Tiger Found Harvard Total Trea 
14.10 25.77 2125 2427 1429 na 1744 7.57 
0.543 0.773 0.622 0.879 0.970 na 0.797 0.504 
0.495 0.917 0.458 0.865 0.920 ba 0.696 0.631 
12.76 22.38 1776 2192 13.86 na 1625 747 
0.460 0.644 0.486 0.770 0.924 na 0.719 0.482 
0.420 0.765 0.358 | 0.758 0.876 na 0.628 1,053 
14.20 26.81 229  Á 1742 1471 15.44 1791 773 
0.556 0.729 0.691 0.788 | 0.999 1.074 0.839 0.456 
0.424 0.769 0.724 0.42 1.060 0.991 0.638 0.903 
12.90 22.67 19.78 16.28 14.29 15.00 16.80 7.59 
0.475 0.588 0.571 0.713 0.954 1.029 0.764 0431 
0.373 0.644 0.6644 0.712 1.150 0.975 0.616 0.900 
14.63 28.55 2293 18.04 14.79 15.47 18.70 2797 
0.681 0.763 1.084 1.109 1.001 1.181 1.033 0.390 
13,83 24.99 21.94 17.54 14.43 15.1? 1804 7.8 
0.621 0.641 1.022 1.064 0.962 1146 0.981! 0362 
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EXHIBIT 18 


Summary of Means and Sharpe and Downside Sharpe—Monthly, Quarterly, and Yearly Data Annualized 


December 1985-March 2000 
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EXHIBIT 19 
Asset Allocation of Harvard Endowment—June 2004 


Harvard's Holdings Annual Returns Weight In 
By Sector l-Year, % 10-Year, % Endowment, 96 
Domestic Equities 22.8 17.8 15 
Foreign Equities 36.1 8.5 10 
Emerging Markets 6.6 9.7 $ 
Private Equity 20.8 31.5 13 
Hedge Funds 15.7 na 12 
High Yield 12.4 9.7 $ 
Commodities 19.7 10.9 13 
Real Estate 16.0 15.0 10 
Domestic Bonds 9.2 14.9 ll 
Foreign Bonds 17.4 16.9 5 
Inflation-Indexed Bonds 4.2 na. 6 
Total Endowment 211 15.9 105%* 


*includes slight leverage 


Source: Barons , February 1, 2005. 


As Lawrence Siegel of the Ford Foundation privately 
acknowledges, some of the Ford Foundation’s high Sharpe 
ratio results from dividing by an artificially smoothed 
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standard deviation, due to that institution’s high private 
equity allocation whose market prices do not retlect actual 
volatility. This is also true of Harvard. 

Exhibit 19 shows the Harvard endowment’ asset 
allocation. Its ten-year 15.9% performance on a $22.6 
billion portfolio was 3.1 percentage points better than the 
median of the 25 largest university endowments through 
June 2004. Harvard has continued its good performance, 
with extensive use of private equity and other non-tradi- 
tional investments. Ford’s performance in 2001-2002 was 
poor, and Berkshire Hathaway has doubled in price since 
the 2000 lows, so the rankings based on more data may 
well have changed. 

Exhibits 6, 16-17, and 20 give a short overview of 
key data. The numbers in italics indicate the worst out~ 
comes and those in bold the best. Windsor had rhe most 
negative months and negative years (tied with Berkshire 
Hathaway) and the lowest means. Berkshire Hathaway had 
the highest annual mean returns. 
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EXHIBIT 20 


Yearly Sharpe and Symmetric Downside Sharpe Ratios December 1985-A pril 2000 (%) 


Berkshire 
Windsor Hathaway Quantum Tiger 
Neg months 61 58 53 56 
arith mean, -3.54 -5.36 -5.65 -4.41 
neg 
arith mean, 1.17 2.15 1.77 2.02 
mon ` 
ds st dev, 5.15 6.46 10.08 6.34 
mon 
arith mean, 14.10 25.77 21.25 24.27 
yr 
ds st dev, yr — 17.85 22.36 34.91 2197 
ds Sharpe 0.495 0.917 0.458 0.865 
geomean, -3.62 -5.48 -5.95 — -453 
neg 
Beomean, 1.06 1.87 1.48 1.83 
mon 
ds geo st 5.15 6.46 10.09 — 6.34 
dev,mon 
geo mean, yr 12.76 22.38 17.76 2192 
ds geo st 17.86 22.37 3494 21.98 
dev, yr 
ds Sharpe 0.420 0.765 0.358 0.758 


Ford S&P US T- US 
Found Harvard Total Trea bils nfl 
44 na 56 54 0 13 
-2.24 na -3.31 -0.87 na -0.14 
1.19 na 145 0.63 044 026 
2.83 na 5.05 1.06 0.00 0.19 
14.29 na 1744 7.57 527 344 
9.81 na 17.49 3.66 0.00 0.64 
0.920 na 0.696 0.631 na -3.320 

-2.26 na -3.38 

1.16 na 1.35 0.62 0.44 026 
2.83 na 5.05 0.60 0.00 0.00 
13.86 na 1625 747 527 3.14 
9.81 na 17.49 2.09 0.00 6.00 
0.876 na 0.628 1.053 


CONCLUSIONS AND CAVEATS 


My downside-risk Sharpe ratio measure is ad hoc, 
as all performance measures are, and adds to but does not 
close the debate on this subject. 

Hodges [1998] proposes a generalized Sharpe measure 
that eliminates some of the paradoxes the Sharpe measure 
leads to. It uses a constant, absolute risk-return exponential 
utility function and general return distributions. When the 
returns are normally distributed, this is the usual Sharpe ratio, 
as that portfolio problem is equivalent to a mean-variance 
model. A better utility function is the constant relative risk 
aversion negative power. 

Leland [1999] shows how to modify Bs when there 
are fat tails into more correct fs in a CAPM framework. 

Goetzmann et al. (2002] and Spurgin (2000] show 
how the Sharpe ratio may be manipulated using option 
strategies to obtain what looks like a superior record to 
obtain more funds to manage. Managers sell calls to cut off 
upside variance, and use the proceeds to buy puts to cut 
off downside variance, leading to higher Sharpe ratios 
because of the reduced portfolio variance. These options 
transactions may actually lead to poorer investment per- 


120 


THE SYMMETRIC DOWNSIDE-Rusk SHARPE RATIO 


formance in final wealth terms even with their higher 
Sharpe ratios. Tompkins, Ziemba, and Hodges [2003], for 
example, show how on average the calls sold and the puts 
purchased on the S&P 500 both had negative expected val- 
ues over 1985-2002. 

I have not tried to establish when the symmetric down- 
side-risk Sharpe ratio might give misleading results in real 
invesunent situations or to establish its mathematical and sta- 
tistical properties. I note only that it is consistent with an 
investor whose utility is based on the negative of the disutil- 
ity of losses. The technique does seem to provide a simple 
way to avoid penalizing superior performance in order to 
more fairly evaluate performance. 

We will have to find another way to measure and 
establish the superiority of Warren Buffett. One likely 
candidate is related to the Kelly approach to evaluate 
investments, which looks at compounded wealth over a 
long period of time, which we know at the limit is attained 
by the log bettor (which Buffet seems to be). After 40 years, 
most of us believe Buffett is in the skill, not luck, category. 
Afterall, $15 a share in 1965 became $87,000 in June 2005, 
but since he is a log bettor only more time will tell. 
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ENDNOTES 


‘If there are two independent wagers, and the size of the 
bets does not influence the odds, an analytic expression can be 
derived; see Thorp [1997, pp. 19-20]. In general, to solve for the 
optimal wagers when the bets influence the odds, there is depen- 
dence. In the case of three or more wagers, one must solve a non- 
convex nonlinear program; see Ziemha and Hausch [1984, 1987] 
for tecbnique. This gives the optimal wager, taking into account 
the effect of our bets on the odds (prices). 

?Bettor B must use an essentially different strategy from our 
Kelly bettor for this to be true. This means that the strategies dif- 
fer infinitely often. For example, they are the same for the first 
ten years, and then every second trial is different. This is a tech- 
nical point to get proofs correct, but nothing much to worry about 
in practice since non-log strategies will differ infinitely often. 

Roy [1952], Markowitz [1959], Mao [1970], Bawa [1975, 
1978], Bawa and Lindenberg [1977], Fishburn [1977], Harlow 
and Rao [1989], and Harlow |1991] have used downside-risk mea- 
sures in portfolio theories other than those based on mean-vari- 
ance and related analyses. 
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POSTSCRIPT: THE RENAISSANCE MEDALLION FUND 


The Medallion Fund uses mathematical ideas such as the Kelly criterion to run a superior 
hedge fund.! The staff of technical researchers and traders, working under mathematician 
James Simons, is constantly devising edges that they use to generate successful trades of 
various durations including many short term trades that enter and exit in seconds. The fund, 
whose size is in the $5 billion area, has very large fees (596 management and 4446 incentive). 
Despite these fees and the large size of the fund, the net returns have been consistently 
outstanding, with a few small monthly losses and high positive monthly returns; see the 
histogram in Figure Al. Table Al shows the monthly net returns from January 1993 to 
April 2005. There were only 17 monthly losses in 148 months and 3 losses in 46 quarters 
and no yearly losses in these 12+ years of trading in our data sample. The mean monthly, 
quarterly and yearly net returns, Sharpe and Symmetric Downside Sharpe ratios are shown 
in Table A2? 

We calculated the quarterly standard deviation for the DSSR by multiplying the monthly 
standard deviation by sqrt(3) and multiplied it by sqrt(12) for the annual standard deviation. 
All calculations use arithmetic means. We know from Ziemba (2005) that the results using 
geometric means will have essentially the same conclusions. 

In Figure A3 we assumed that the fund had initial wealth of 100 dollars on Dec 31, 
1992. Figures A2 and A3 show the rates of return over time and the wealth graph over time 
assuming an initial wealth of 100 on December 31, 2002. 

Medallion's outstanding yearly DSSR of 26.4 is the best we have seen even higher than 
Princeton Newport's 13.8 during 1969-1988. Jim Simon's Medallion fund is near or at 
the top of the worlds hedge funds. Indeed Simons' $1.4 billion in 2005 was the highest in 


-5 0 5 10 15 20 25 


Figure Al Histogram of monthly returns of the Medallion Fund, January 1993 to April 2005. 


! WTZ is pleased to have had a minor role in teaching Simons about the Kelly criterion in 1992. 
? Thanks to Ilkay Boduroglu for making these calculations. 


Table A1 Net returns in percent of the Medallion Fund, January 1993 to April 2005, Yearly, Quarterly and Monthly 


1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 
Annual 

39.06 | 70.60 38.33 31.49 21.21 41.50 2454 9853 3112 229.4 25.28 27.77 
Quarterly 
QI 7.81 14.60 22.06 7.88 3.51 7.30 (0.25) 2544 12.62 5.90 4.29 9.03 8.30 
Q2 25.06 35.48 4.84 1.40 6.60 7.60 6.70 20.51 5.64 7.20 6.59 3.88 
Q3 4.04 11.19 3.62 10.82 8.37 9.69 6.88 8.58 7.60 8.91 8.77 5.71 
Q4 (0.86) (1.20) 431 8.44 1.41 11.73 9.48 20.93 2.42 4.44 3.62 6.72 
Monthly 
January 1.27 4.68 74 3.25 1.16 5.02 3.79 105 4.67 1.65 2.07 3.76 2.26 
February ' 3.08 5.16 7.54 1.67 2.03 196 ~2.44 9.37 2.13 3.03 2.53 1.97 2.86 
March 3.28 4.19 5.68 2.77 0.29 0.21 —1.49 3.8 5.36 1.12 —035 3.05 2.96 
April 6.89 2.42 4.1 0.44 1.01 0.61 322 9.78 2.97 3.81 1.78 0.86 0.95 
May 3.74 5.66 5.53 0.22 4.08 4.56 1.64 7.24 2.44 1.11 3.44 2.61 
June 12.78 25.19 -4.57 0.73 1.36 2.28 1.71 2.37 0.15 2.13 1.24 0.37 
July 3.15 6.59 —128 4.24 5.45 —-1.1 4.39 5.97 1 5.92 1.98 2.2 
August —0.67 7.96 5.91 2.97 1.9 4.31 1.22 3.52 3.05 1.68 2.38 2.08 
September 1.54 —338 —0.89 3.25 0.85 6.33 1.15 —1.02 3.38 1.13 4.18 1.33 
October 1.88 —2.05 0.3 637 —1.11 5.33 2.76 6.71 1.89 1.15 0.35 2.39 
November —1.51  —0.74 2.45 5.93 —022 2.26 5.42 8.66 0.17 1.42 1.42 3.03 
December | —12 1.62 1.52 2.77 3.73 1.06 43 0.35 1.81 1.81 1.16 
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Figure A2 Rates of return over time, Medallion Fund, January 1993 to April 2005 
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Figure A3 Wealth over time, Medallion Fund, January 1993 to April 2005 
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Table A2 Sharpe and Downside Symmetric Sharpe Ratios for the 
Medallion Fund, January 1993 to April 2005 


Yearly Quarterly Monthly 
SR 1.68 1.09 0.76 
DSSR 26.4 11.6 2.20 


Mean Risk Free Rate 


Table A3 T-bill interest rates in percent, January 1993 to April 2005, Yearly, Quarterly and Monthly 


1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 


Annual 

3.33 4.98 5.69 5.23 5.36 4.85 4.76 5.86 3.36 1.68 1.05 1.56 3.39 
Quarterly 
QI 2.99 327 5.77 4.94 5.06 5.07 441 5.53 485 1.71 114 0.90 2.56 
Q2 2.98 4.05 5.61 504 5.08 5.00 443 5.75 3.70 171 1.05 1.06 
Q3 3.00 4.52 5.38 5.13 5.06 4.89 4.67 6.00 3.25 1.63 0.92 1.48 
Q4 3.08 5.34 5.28 4,97 5.08 2.30 5.05 6.03 1.93 1.36 0.91 2.01 


Monthly 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 


January 2.99 3.14 5.46 5.16 5.00 5.07 4.33 521 5.63 1.85 128 O91 2.18 
February 2.99 3.21 5.62 5.05 5.03 5.07 4337 5.37 524 1.78 1.21 0.91 2.36 
March 2.99 327 5.77 494 5.06 5.07 441 5.53 485 1.71 1.14 0.90 2.54 


April 2.99 353 5.72 497 5.07 5.05 441 561 445 171 1.11 095 2.65 
May 2.98 3.79 5.67 5.01 5.07 5.02 442 5.68 4.05 1.71 1.08 1.01 
June 2.98 4.05 5.61 5.04 5.08 5.00 443 5.75 3.66 1.71 1.05 1.06 
July 2.99 421 5.54 5.07 5.07 496 4.51 5.83 3.52 1.68 1.00 1.20 


August 3.00 4.36 5.46 5.10 5.07 4.92 4.59 5.92 3.39 1.66 0.96 1.34 
September 3.02 4.52 5.38 5.13 5.06 4.89 4.67 6.00 3.25 1.63 0.92 1.48 
October 3.00 4.78 5.34 5.08 5.07 4.69 4.80 601 2.81 1.54 0.92 1.66 
November 3.06 5.05 5.3] 5.03 5.07 449 4.93 602 2.37 1.45 O91 1.83 
December 3.08 5.31 5.28. 4.97 5.08 4.30 5.05 6.03 1.93 1.36 0.91 201 


the world for hedge fund managers and his $1.6 billion in 2006 was second best. Since the 
fund is closed to all but about six outside investors plus employees we watch with envy but 
Renaissance's new $100billion fund accepts qualified investors. 
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Abstract 


The central problem for gamblers is to find positive expectation bets. But the gam- 
bler also needs to know how to manage his money, i.e., how much to bet. In the stock 
market (more inclusively, the securities markets) the problem is similar but more com- 
plex. The gambler, who is now an “investor”, looks for “excess risk adjusted return". 
In both these settings, we explore the use of the Kelly criterion, which is to maximize 
the expected value of the logarithm of wealth ("maximize expected logarithmic util- 
ity"). The criterion is known to economists and financial theorists by names such as 
the "geometric mean maximizing portfolio strategy", maximizing logarithmic utility, 
the growth-optimal strategy, the capital growth criterion, etc. The author initiated the 
practical application of the Kelly criterion by using it for card counting in blackjack. 
We will present some useful formulas and methods to answer various natural questions 
about it that arise in blackjack and other gambling games. Then we illustrate its recent 
use in a successful casino sports betting system. Finally, we discuss its application to 
the securities markets where it has helped the author to make a thirty year total of 80 
billion dollars worth of "bets". 


Keywords 


Kelly criterion, betting, long run investing, portfolio allocation, logarithmic utility, 
capital growth 


JEL classification: C61, D81, Gl 
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1. Introduction 


The fundamental problem in gambling is to find positive expectation betting op- 
portunities. The analogous problem in investing is to find investments with excess 
risk-adjusted expected rates of return. Once these favorable opportunities have been 
identified, the gambler or investor must decide how much of his capital to bet. This 
is the problem which we consider here. It has been of interest at least since the 
eighteenth century discussion of the St. Petersburg Paradox (Feller, 1966) by Daniel 
Bernoulli. 

One approach is to choose a goal, such as to minimize the probability of total loss 
within a specified number of trials, N. Another example would be to maximize the 
probability of reaching a fixed goal on or before N trials (Browne, 1996). 

A different approach, much studied by economists and others, is to value money 
using a utility function. These are typically defined for all non-negative real numbers, 
have extended real number values, and are non-decreasing (more money is at least as 
good as less money). Some examples are U(x) = x^, 0 < a < oo, and U(x) = logx, 
where log means log,, and log 0 = —oo. Once a utility function is specified, the object 
is to maximize the expected value of the utility of wealth. 

Daniel Bernoulli used the utility function log x to "solve" the St. Petersburg Para- 
dox. (But his solution does not eliminate the paradox because every utility function 
which is unbounded above, including log, has a modified version of the St. Petersburg 
Paradox.) The utility function log x was revisited by Kelly (1956) where he showed 
that it had some remarkable properties. These were elaborated and generalized in an 
important paper by Breiman (1961). Markowitz (1959) illustrates the application to se- 
curities. For a discussion of the Kelly criterion (the "geometric mean criterion") from 
a finance point of view, see McEnally (1986). He also includes additional history and 
references. 

I was introduced to the Kelly paper by Claude Shannon at M.LT. in 1960, shortly 
after I had created the mathematical theory of card counting at casino blackjack. Kelly's 
criterion was a bet on each trial so as to maximize Elog X, the expected value of 
the logarithm of the (random variable) capital X. I used it in actual play and intro- 
duced it to the gambling community in the first edition of Beat the Dealer (Thorp, 
1962). If all blackjack bets paid even money, had positive expectation and were in- 
dependent, the resulting Kelly betting recipe when playing one hand at a time would 
be extremely simple: bet a fraction of your current capital equal to your expectation. 
This is modified somewhat in practice (generally down) to allow for having to make 
some negative expectation "waiting bets", for the higher variance due to the occur- 
rence of payoffs greater than one to one, and when more than one hand is played at a 
time. 

Here are the properties that made the Kelly criterion so appealing. For ease of un- 
derstanding, we illustrate using the simplest case, coin tossing, but the concepts and 
conclusions generalize greatly. 
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2. Coin tossing 


Imagine that we are faced with an infinitely wealthy opponent who will wager even 
money bets made on repeated independent trials of a biased coin. Further, suppose that 
on each trial our win probability is p > 1/2 and the probability of losing is g = 1 — p. 
Our initial capital is Xo. Suppose we choose the goal of maximizing the expected value 
E (X,) after n trias. How much should we bet, Bg, on the kth trial? Letting 7, = 1 if the 
kth trial is a win and 7, = —| if itis a loss, then X, = X,;_, +T; B; fork = 1,2,3,..., 
and X,, = Xo + 5. Tk By. Then 


E(Xn) = Xo + 9 , E(BTi) = Xo + 9 (p — 4) EGO. 
kzl k=l 


Since the game has a positive expectation, i.e., p — q > O in this even payoff situation, 
then in order to maximize E(X,) we would want to maximize E(B;) at each trial. 
Thus, to maximize expected gain we should bet all of our resources at each trial. Thus 
B, = Xo and if we win the first bet, By = 2Xo, etc. However, the probability of ruin is 
given by | — p" and with p < 1, lim,—oo[1 — p^] = 1 so ruin is almost sure. Thus the 
"bold" criterion of betting to maximize expected gain is usually undesirable. 

Likewise, if we play to minimize the probability of eventual ruin (i.e., "ruin" occurs 
if X, = 0 on the kth outcome) the well-known gambler's ruin formula in Feller (1966) 
shows that we minimize ruin by making a minimum bet on each trial, but this unfortu- 
nately also minimizes the expected gain. Thus “timid” betting is also unattractive. 

This suggests -an intermediate strategy which is somewhere between maximizing 
E(X,) (and assuring ruin) and minimizing the probability of ruin (and minimizing 
E(X,,)). An asymptotically optimal strategy was first proposed by Kelly (1956). 

In the coin-tossing game just described, since the probabilities and payoffs for each 
bet are the same, it seems plausible that an "optimal" strategy will involve always wa- 
gering the same fraction f of your bankroll. To make this possible we shall assume 
from here on that capital is infinitely divisible. This assumption usually does not matter 
much in the interesting practical applications. 

If we bet according to B; = f Xj 4, whereO < f < 1, this is sometimes called “fixed 
fraction" betting. Where S and F are the number of successes and failures, respectively, 
in z trials, then our capital after 7 trials is X,, = Xo(1 + fC — hl. where $+ F =n. 
With f in the interval O < f < 1, Pr(X,, = 0) = 0. Thus "ruin" in the technical sense 
of the gambler's ruin problem cannot occur. "Ruin" shall henceforth be reinterpreted to 
mean that for arbitrarily small positive e, lim, , o;[Pr(X,, < €)] = 1. Even in this sense, 
as we shall see, ruin can occur under certain circumstances. 

We note that since 


Xo ; 
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the quantity 


" 1/n F 
Gif) = teg | = 4 leg + f) + — log — f) 


measures the exponential rate of increase per trial. Kelly chose to maximize the expected 
value of the growth rate coefficient, g( f), where 
x, |" S F 
g(f) = E|1log| — = E}—log(1+ f) + — lo 
Xo n n > 
= plog(1 + f) * qlog(1 — f). 


Note that g(f) = (1/n)E (log X,) — (1/n) log Xo so for n fixed, maximizing g(f) is 
the same as maximizing E log X,,. We usually will talk about maximizing g(f) in the 
discussion below. Note that 
; p d p-q-f 
sP =I TF ücpü-p 
when f = f* — p — q. 
Now 


£g" (f) = —p/U+ f - qJ(0 — f? «0 


so that g'( f) is monotone strictly decreasing on (0, 1). Also g'(0) = p — q > O0 and 
lim y. ,;- g’(f) = —oo. Therefore by the continuity of g’(f), g(f) has a unique maxi- 
mum at f = f*, where g(f*) = plog p+q logg+log2 > 0. Moreover, g(0) = 0 and 
lim fg- g(f) = —oo so there is a unique number fe > 0, whereO < f* < fe < 1, 
such that g( f.) = 0. The nature of the function g( f) is now apparent and a graph of 
g(f) versus f appears as shown in Figure 1. 

The following theorem recounts the important advantages of maximizing g(f). The 
details are omitted here but proofs of (i)-(iil), and (vi) for the simple binomial case 
can be found in Thorp (1969); more general proofs of these and of (iv) and (v) are in 
Breiman (1961). 


ea- n 


Theorem 1. (i) /f g(f) > 0, then lim; oo Xn = œ almost surely, i.e., for each M, 
Pr[lim inf, o5 X, > M] = l; 

(1) If g(f) < 0, then lim; soo Xn = 0 almost surely; i.e, for each € > Q, 
Pr[limsup, , 45 Xn < e] = 1; 

(iii) If e Cf) = 0, then lim sup, , 4, X, = œ a.s. and liminf, +o X, = 0 a.s. 

(iv) Given a strategy &* which maximizes Elog X, and any other "essentially 
different” strategy $ (not necessarily a fixed fractional betting strategy), then 
lim; oo X, (6*)/ X, ($) = ooa.s. 

(v) The expected time for the current capital X, to reach any fixed preassigned goal 
C is, asymptotically, least with a strategy which maximizes E log X,,. 

(vi) Suppose the return on one unit bet on the ith trial is the binomial random vari- 
able U;; further, suppose that the probability of success is pj, where 1/2 < pj < 1l. 
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Fig. 1. 


Then E log X, is maximized by choosing on each trial the fraction f* = pi — qi which 
maximizes E log(l + f;U;). 


Part (i) shows that, except for a finite number of terms, the player's fortune X,, will 
exceed any fixed bound M when f is chosen in the interval (0, fe). But, if f > fe, 
part (ii) shows that ruin is almost sure. Part (111) demonstrates that if f = fe, Xn will 
(almost surely) oscillate randomly between 0 and +00. Thus, one author's statement 
that X, > Xo as n — oo, when f = fe, is clearly contradicted. Parts (iv) and (v) 
show that the Kelly strategy of maximizing £ log X,, is asymptotically optimal by two 
important criteria. An "essentially different" strategy is one such that the difference 
Ein X7 — E ln X, between the Kelly strategy and the other strategy grows faster than 
the standard deviation of In X5 — In X,,, ensuring P(In X7 — In X, > 0) — 1. Part (vi) 
establishes the validity of utilizing the Kelly method of choosing f* on each trial (even 
if the probabilities change from one trial to the next) in order to maximize £ log X,.. 


Example 2.1. Player A plays against an infinitely wealthy adversary. Player A wins 
even money on successive independent flips of a biased coin with a win probability of 
p = .53 (no ties). Player A has an initial capital of Xo and capital is infinitely divisible. 
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Applying Theorem l(vi), f* = p — q = .53 — .47 = .06. Thus 6% of current capital 
should be wagered on each play in order to cause X,, to grow at the fastest rate possible 
consistent with zero probability of ever going broke. If Player A continually bets a 
fraction smaller than 6%, X,, will also grow to infinity but the rate will be slower. 

If Player A repeatedly bets a fraction larger than 6%, up to the value f,, the same 
thing applies. Solving the equation g(f) = .53log(1 + f) + .47log(1 — f) = 0 
numerically on a computer yields fe = .11973~. So, if the fraction wagered is more 
than about 12%, then even though Player A may temporarily experience the pleasure of 
a faster win rate, eventual downward fluctuations will inexorably drive the values of X, 
toward zero. Calculation yields a growth coefficient of g(f*) = f(.06) = .001801 so 
that after n successive bets the log of Player A's average bankroll will tend to .001801n 
times as much money as he started with. Setting .001801n — log2 gives an expected 
time of about n — 385 to double the bankroll. 

The Kelly criterion can easily be extended to uneven payoff games. Suppose Player A 
wins b units for every unit wager. Further, suppose that on each trial the win probability 
is p > O0 and pb — q > Oso the game is advantageous to Player A. Methods similar to 
those already described can be used to maximize 


e(f) = Elog(X,/ Xo) = plog + bf) - qlog(1— f). 


Arguments using calculus yield f* = (bp —q)/b, the optimal fraction of current capital 
which should be wagered on each play in order to maximize the growth coefficient g( f). 

This formula for f* appeared in Thorp (1984) and was the subject of an April 1997 
discussion on the Internet at Stanford Wong’s website, http://bj21.com (miscellaneous 
free pages section). One claim was that one can only lose the amount bet so there was 
no reason to consider the (simple) generalization of this formula to the situation where 
a unit wager wins b with probability p > O and loses a with probability q. Then if 
the expectation m = bp — aq > 0, f* > 0 and f* = m/ab. The generalization 
does stand up to the objection. One can buy on credit in the financial markets and lose 
much more than the amount bet. Consider buying commodity futures or selling short a 
security (where the loss is potentially unlimited). See, e.g., Thorp and Kassouf (1967) 
for an account of the E.L. Bruce short squeeze. 

For purists who insist that these payoffs are not binary, consider selling short a binary 
digital option. These options are described in Browne (1996). 

A criticism sometimes applied to the Kelly strategy is that capital is not, in fact, 
infinitely divisible. In the real world, bets are multiples of a minimum unit, such as 
$1 or $.01 (penny "slots". In the securities markets, with computerized records, the 
minimum unit can be as small as desired. With a minimum allowed bet, "ruin" in the 
standard sense is always possible. It is not difficult to show, however (see Thorp and 
Walden, 1966) that if the minimum bet allowed is small relative to the gambler's initial 
capital, then the probability of ruin in the standard sense is "negligible" and also that 
the theory herein described is a useful approximation. This section follows Rotando and 
Thorp (1992). 
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3. Optimal growth: Kelly criterion formulas for practitioners 


Since the Kelly criterion asymptotically maximizes the expected growth rate of wealth, 
it is often called the optimal growth strategy. It is interesting to compare it with the other 
fixed fraction strategies. I will present some results that I have found useful in practice. 
My object is to do so in a way that 1s simple and easily understood. These results have 
come mostly from sitting and thinking about "interesting questions". I have not made 
a thorough literature search but I know that some of these results have been previously 
published and in greater mathematical generality. See, e.g., Browne (1996, 1997) and 
the references therein. 


3.1. The probability of reaching a fixed goal on or before n trials 


We first assume coin tossing. We begin by noting a related result for standard Brownian 
motion. Howard Tucker showed me this in 1974 and it is probably the most useful single 
fact I know for dealing with diverse problems in gambling and in the theory of financial 
derivatives. 
For standard Brownian motion X (t), we have 
P(sup|X() — (at -b] 20, 0€ 1& T) 
= N(—a — B) + e 7^ N (y — B) (3.1) 
where œ = a4/T and B = b/A/T. See Figure 2. See Appendix B for Tucker’s derivation 
of (3.1). 
In our applicationa < 0, b > Oso we expect limp. P(X (t) 2at--b, Oc t «& T) 
=1. 
Let f be the fraction bet. Assume independent identically distributed (i.d.d.) trials Y;, 
i=1,...,n, with P(Y; = 1) = p > 1/2, P(Y; = —1) =q < 1/2; also assume p < | 
to avoid the trivial case p = 1. 


slope a«0 


Fig. 2. 
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Bet a fixed fraction f, 0 < f < l, at each trial. Let V, be the value of the gambler 
or investor’s bankroll after k trials, with initial value Vo. Choose initial stake Vo = | 
(without loss of generality); number of trials n; goal C > 1. 

What is the probability that V; 2 C for some k, 1 < k < n? This is the same as the 
probability that log V, > log C for some k, | < k < n. Letting In = log, we have: 


k 
V2 p[ü-Y;£ and 


i-l 


k 
In Vy = Soin + Y; f), 


i=l 


k 
E In V, =$ Em +Y; f), 
i=l 
k 


Var(In Vj) = Y ^ Var(In(1 + Y; f)), 


Ein 3.) k pna 4- f) - qln(1 — f) 2m 5 gf). 
Var[In(1 + y; f] = pfa + Of + [na — Of =m? 
= (p — p?)[na + AF + (« - a?) [ma - of 
—2pq ln(1 + f) In(1— f) 
= pa{{ind + Df -2ma + m — 0 + [na - op] 
= pafin[(l + 0/0 = PF = s. 
Drift in n trials: mn. 


Variance in n trials: s?n. 


Inve >InC, | ek &n, iff 


k 
imüerf2mnCgkkzn iff 
i=! - 
k 
S, = > [Ind + Y; f) m] > InC — mk, l<k<n, 
i=l 


E(Sj) 20,  Var(Si) = 57k. 


We want Prob(S; 2 InC —mk, 1 <k <n). 
Now we use our Brownian motion formula to approximate S, by Prob(X (f) 2 InC — 
mt /s?, l<te< s?n) where each term of S, is approximated by an X (ż), drift 0 and 
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variance s? OSIS, s? L t < 2s?, ..., (n — Ds? <I <S ns?). Note: the 
approximation is only “good” for "large" n. 
Then in the original formula (3.1): 


T — s?n, 
b — nC, 
a = —mj/s?, 


a =aVT =—mJn/s, 
B=b/VT =InC/sJn. 


Example 3.1. 
C =2, 
n= 10, 
pl 
q = .49, 

f = 0117, 


m — .000165561, 
s? = 000136848. 
Then 


P() = 9142. 


Example 3.2. Repeat with 
f 2.0, 
then 


m = .000200013, s? = .000399947 and P()z.9214. 
3.2. The probability of ever being reduced to a fraction x of this initial bankroll 


This is a question that is of great concern to gamblers and investors. It is readily an- 
swered, approximately, by our previous methods. 

Using the notation of the previous section, we want P(V; < x for some k, 1x k < 
oo). Similar methods yield the (much simpler) continuous approximation formula: 


Prob) 2 e?^ where a = —m/s? and b = — lnx 
which can be rewritten as 


Prob(-) = x^(Qm/s?) where ^ means exponentiation. (3.2) 
P 
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Example 3.3. 

p-5, f= f* = 02, 

2m/s = 1.0002, 

Prob(-) = x. 


We will see in Section 7 that for the limiting continuous approximation and the Kelly 
optimal fraction f*, P(V;(f*) < x for some k > 1) = x. 

My experience has been that most cautious gamblers or investors who use Kelly 
find the frequency of substantial bankroll reduction to be uncomfortably large. We can 
see why now. To reduce this, they tend to prefer somewhat less than the full betting 
fraction f*. This also offers a margin of safety in case the betting situations are less 
favorable than believed. The penalty in reduced growth rate is not severe for moderate 
underbetting. We discuss this further in Section 7. 


3.3. The probability of being at or above a specified value at the end of a specified 
number of trials 


Hecht (1995) suggested setting this probability as the goal and used a computerized 
search method to determine optimal (by this criterion) fixed fractions for p — q = .02 
and various c, n and specified success probabilities. 

This is a much easier problem than the similar sounding in Section 3.1. We have for 
the probability that X (7) at the end exceeds the goal: 


P(X(T) 2aT +b) = exp{—x?/2T} dx 


l oo 
2n T Jar+b 
1 oo 
i A 2x T Jar 24pr-M2 


where u = x//T sox — aT +b gives u/ T — aT +b and U =aT!/*+bT~'/?. The 
integral equals 


exp{—u*/2} du 


= 1—N(a+ B) = N(-a — f). (3.3) 


For example (3.1) f = .0117 and P = .7947. For example (3.2) P = .7433. Example 
(3.1) is for the Hecht optimal fraction and example (3.2) is for the Kelly optimal fraction. 
Note the difference in P values. 

Our numerical results are consistent with Hecht's simulations in the instances we 
have checked. 

Browne (1996) has given an elegant continuous approximation solution to the prob- 
lem: What is the strategy which maximizes the probability of reaching a fixed goal C on 
or before a specified time n and what is the corresponding probability of success? Note 
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that the optimal strategy will in general involve betting varying fractions, depending on 
the time remaining and the distance to the goal. 

As an extreme example, just to make the point, suppose n = | and C = 2. If Xo < | 
then no strategy works and the probability of success is 0. But if 1 < Xo < 2 one should 
bet at least 2 — Xo, thus any fraction f Z (2 — Xo)/ Xo, for a success probability of p. 
Another extreme example: n = 10, C = 2!° = 1024, Xo = 1. Then the only strategy 
which can succeed is to bet f = | on every trial. The probability of success is p!? for 
this strategy and O for all others Gif p < 1), including Kelly. 


3.4. Continuous approximation of expected time to reach a goal 


According to Theorem 1(v), the optimal growth strategy asymptotically minimizes the 
expected time to reach a goal. Here is what this means. Suppose for goal C that m(C) 
is the greatest lower bound over all strategies for the expected time to reach C. Suppose 
t* (C) is the expected time using the Kelly strategy. Then limc+o9(t*(c)/m(c)) = 1. 

The continuous approximation to the expected number of trials to reach the goal 
C > Xo = lis 


n(C, f) = (In C)/gCf) 


where f is any fixed fraction strategy. Appendix C has the derivation. Now g(f) has a 
unique maximum at g(f*) so n(C, f) has a unique minimum at f = f*. Moreover, 
we can see how much longer it takes, on average, to reach C if one deviates from f*. 


3.5. Comparing fixed fraction strategies: the probability that one strategy leads 
another after n trials 


Theorem I(iv) says that wealth using the Kelly strategy will tend, in the long run, to 

an infinitely large multiple of wealth using any "essentially different" strategy. It can 

be shown that any fixed f # f" is an "essentially different" strategy. This leads to the 

question of how fast the Kelly strategy gets ahead of another fixed fraction strategy, and 

more generally, how fast one fixed fraction strategy gets ahead of (or behind) another. 
If W, is the number of wins in n trials and n — W, is the number of losses, 


G(f) = (QV, /n)n( + f) + (0) — W,/n)ln(1 — f) 


is the actual (random variable) growth coefficient. 
As we Saw, its expectation 1s 


gU) = E(G(f)) = plog + f) +4 log(l — f) 4) 
and the variance of G( f) is 
Var G(f) = ((pq)/n){In(l + /a- )l (3.5) 


and it follows that G( f), which has the form G(f) = a(»; %)/n +b, is approximately 
normally distributed with mean g(f) and variance Var G( f). This enables us to give 
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the distribution of X, and once again answer the question of Section 3.3. We illustrate 
this with an example. 


Example 3.4. 


p=5l, q-—.9, f*—.02, N —10,000 and 
s — standard deviation of G( f) 


gis f 8 s Pr(G(f) < 0) 
1.5 .01 .000 150004 .0001 .067 
1.0 .02 .000200013 .0002 .159 

5 .03 .000149977 .0003 .309 


Continuing, we find the distribution of G( f2) — G( f1). We consider two cases. 


Case |. The same game 

Here we assume both players are betting on the same trials, e.g., betting on the same 
coin tosses, or on the same series of hands at blackjack, or on the same games with the 
same odds at the same sports book. In the stock market, both players could invest in the 


same “security” at the same time, e.g., a no-load S&P 500 index mutual fund. 
We find 


E(G(f2) — GD) = plog((1 + £2/0 + f) + alog(( — f2)/U — fd) 


and 


_ 2 
Var(G( fo) — GCfi)) = (palo eg] C + alt ^y 


\-fpJN\I+Sh 


where G(f2) — G(f)) is approximately normally distributed with this mean and vari- 
ance. 


Case 2. Identically distributed independent games 

This corresponds to betting on two different series of tosses with the same coin. 
E(G(f2) — GCf1)) is as before. But now Var(G( fo) — G(fi)) = Var(G(f2)) + 
Var(G ( fi)) because G (f2) and G( fi) are now independent. Thus 


2 2 
Var(GCf2) ~ G(fi)) = (vain) {foe (5 - 4) + oe( = +A) i 
Let 
7 1+ fi _ | * fo 
emet) emel) 


Then in Case 1, Vj = (pq/n)(a — by and in Case 2, V; — (pq /n)(a2 + p?) and since 
a,b > 0, Vi. « V2 as expected. We can now compare the Kelly strategy with other 
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fixed fractions to determine the probability that Kelly leads after n trials. Note that this 
probability is always greater than 1/2 (within the accuracy limits of the continuous 
approximation, which is the approximation of the binomial distribution by the normal, 
with its well known and thoroughly studied properties) because g(f*) — g(f) > 0 
where f* = p —q and f # f* is some alternative. This can fail to be true for small n, 
where the approximation is poor. As an extreme example to make the point, if n = 1, 
any f > f* beats Kelly with probability p > 1/2. If instead n = 2, f > f* wins 
with probability p? and p? > 1/2if p > 1/42 = .7071. Also, f < f* wins with 
probability | — p? and 1 — p? > 1/2 if p? < 1/2, ie, p < 1/V2 = .7071. So when 
n — 2, Kelly always loses more than half the time to some other f unless p — 1/ V2. 

We now have the formulas we need to explore many practical applications of the 
Kelly criterion. 


4. The long run: when will the Kelly strategy “dominate”? 


The late John Leib wrote several articles for Blackjack Forum which were critical of the 
Kelly criterion. He was much bemused by “the long run". What is it and when, if ever, 
does it happen? 

We begin with an example. 


Example 4.1. 
p= 5i, n= 10,000, 


V; and s;,i = J, 2, are the variance and standard deviation, respectively, for Section 3.5 
Cases | and 2, and R = V2/V, = (a? + b?)/(a — b)? so s2 = sı VR. Table 1 sum- 
marizes some results. We can also approximate VR with a power series estimate using 


only the first term of a and of b: a = 2f}, b = 2f» so VR = Jf? + FRU — hl. 
The approximate results, which agree extremely well, are 2.236, 3.606 and 1.581, re- 
spectively. 

The first two rows show how nearly symmetric the behavior is on each side of the 
optimal f* = .02. The column (g2 — 21)/s; shows us that f* = .02 only has a .5 
standard deviation advantage over its neighbors f = .01 and f = .03 after n = 10,000 


Table | 
Comparing strategies 


J P 42 — 81 5| (go — g1)/5| VR 
Ol 02 .00005001 .00010000 50 2.236 
03 02 .00005004 .00010004 50 3.604 


.03 .01 ` 00000003 .00020005 .00013346 1.581! 
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Table 2 
The long run: (g2 — 9;)/s after n trials 


fi h n= 104 n=4x 104 n = 16 x 104 n — 10° 
.01 .02 is) 1.0 2.0 5.0 
.03 .02 5 1.0 2.0 5.0 
.03 .01 .000133 .000267 .000534 .001335 


trials. Since this advantage is proportional to y/n, the column (g2 — g1)/51 from Table 1 
gives the results of Table 2. 

The factor /R in Table 1 shows how much more slowly f» dominates f; in Case 2 
versus Case 1. The ratio (g2 — g1)/s2 is VR times as large so the same level of domi- 
nance takes R times as long. When the real world comparisons of strategies for practical 
reasons often use Case 2 comparisons rather than the more appropriate Case 1 compar- 
isons, the dominance of f* is further obscured. An example is players with different 
betting fractions at blackjack. Case 1 corresponds to both betting on the same sequence 
of hands. Case 2 corresponds to them playing at different tables (not the same table, 
because Case 2 assumes independence). (Because of the positive correlation between 
payoffs on hands played at the same table, this is intermediate between Cases 1 and 2.) 

It is important to understand that "the long run", i.e., the time it takes for f* to 
dominate a specified neighbor by a specified probability, can vary without limit. Each 
application requires a separate analysis. In cases such as Example 4.1, where dominance 
is "slow", one might argue that using f* is not important. As an argument against this, 
consider two coin-tossing games. In game | your edge is 1.0%. In game 2 your edge 
is 1.1%. With one unit bets, after n trials the difference in expected gain is E? — E, = 
.001n with standard deviation s of about /2n hence (E> — E\)/s = 001. /n/V2 which 
is 1 when n = 2 x 106. So it takes two million trials to have an 84% chance of the 
game 2 results being better than the game 1 results. Does that mean it's unimportant to 
select the higher expectation game? 


5. Blackjack 


For a general discussion of blackjack, see Thorp (1962, 1966), Wong (1994) and Griffin 
(1979). The Kelly criterion was introducéd for blackjack by Thorp (1962). The analysis 
is more complicated than that of coin tossing because the payoffs are not simply one 
to one. In particular the variance is generally more than 1 and the Kelly fraction tends 
to be less than for coin tossing with.the same expectation. Moreover, the distribution 
of various payoffs depends on the player advantage. For instance the frequency of pair 
splitting, doubling down, and blackjacks all vary as the advantage changes. By binning 
the probability of payoff types according to ex ante expectation, and solving the Kelly 
equations on a computer, a strategy can be found which is as close to optimal as desired. 
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There are some conceptual subtleties which are noteworthy. To illustrate them we'll 
simplify to the coin toss model. 

At each trial, we have with probability .5 a "favorable situation" with gain or loss X 
per unit bet such that P(X = 1) = .51, P(X = —1) = .49 and with probability .5 an 
unfavorable situation with gain or loss Y per unit bet such that P(Y — 1) — .49 and 
P(Y = —1) = .51. We know before we bet whether X or Y applies. 

Suppose the player must make small "waiting" bets on the unfavorable situations in 
order to be able to exploit the favorable situations. On these he will place “large” bets. 
We consider two cases. 

Case |. Bet fo on unfavorable situations and find the optimal f* for favorable situa- 
tions. We have 


g(f) = .5(.51log(l + f) + 49 log(l — f)) 
+ .5(.49 log(1 + fo) + .51log(1 — fo)). (5.1) 


Since the second expression in (5.1) is constant, f maximizes g( f) if it maximizes the 
first expression, so f" = p — q = .02, as usual. It is easy to verify that when there is 
a spectrum of favorable situations the same recipe, f* = p; — q; for the ith situation, 
holds. Again, in actual blackjack f; would be adjusted down somewhat for the greater 
variance. With an additional constraint such as f; < kfo, where k is typically some 
integral multiple of fo, representing the betting spread adopted by a prudent player, 
then the solution is just f; < min(f*, kf). 

Curiously, a seemingly similar formulation of the betting problem leads to rather 
different results. 

Case 2. Bet f in favorable situations and af in unfavorable situations, 0 <a « I. 

Now the bet sizes in the two situations are linked and both the analysis and results 
are more complex. We have a Kelly growth rate of 


g(f) = .5(5llog(l + f) + 49log(1 — f)) 
+ .5(49log(1 +af) + .51log(1 — af)). (52) 


If we choose a = 0 (no bet in unfavorable situations) then the maximum value for g( f) 
is at f* = .02, the usual Kelly fraction. 

If we make “waiting bets", corresponding to some value of a > 0, this will shift the 
value of f* down, perhaps even to 0. The expected gain divided by the expected bet is 
.02(1 — a)/(1 + a), a 2 0. If a = 0 we get .02, as expected. If a = 1, we get 0, as 
expected: this is a fair game and the Kelly fraction is f* = 0. As a increases from 0 
to | the (optimal) Kelly fraction f* decreases from .02 to 0. Thus the Kelly fraction for 
favorable situations is less in this case when bets on unfavorable situations reduce the 
overall advantage of the game. 

Arnold Snyder called to my attention the fact that Winston Yamashita had (also) made 
this point (March 18, 1997) on the "free" pages, miscellaneous section, of Stanford 
Wong's web site. 
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Table 3 
f* versus a 
a f* a f* a f* 
0 .0200 1/3 .0120 7 .0040 
A .0178 E .0103 8 .0024 
2 0154 5 .0080 9 .0011 
3 .0128 6 .0059 1.0 .0000 


For this example, we find the new f* for a given value of a, 0 < a < 1, by solving 
g'Cf) 2 0. A value of a = 1/3, for instance, corresponds to a bet of 1/3 unit on Y and 
l unit on X, a betting range of 3 to 1. The overall expectation is .01. Calculation shows 
f* = .012001. Table 3 shows how f* varies with a. 

To understand why Cases 1 and 2 have different f*, look first at Equation (5.1). 
The part of g( f) corresponding to the unfavorable situations is fixed when fo is fixed. 
Only the part of g( f) corresponding to the favorable situations is affected by varying f. 
Thus we maximize g( f) by maximizing it over just the favorable situations. Whatever 
the result, it is then reduced by a fixed quantity, the part of g containing fo. On the 
other hand, in Equation (5.2) both parts of g(f) are affected when f varies, because 
the fraction af used for unfavorable situations bears the constant ratio a to the fraction 
f used in favorable situations. Now the first term, for the favorable situations, has a 
maximum at f = .02, and is approximately “flat” nearby. But the second term, for 
the unfavorable situations, is negative and decreasing moderately rapidly at f = .02. 
Therefore, it we reduce f somewhat, this term increases somewhat, while the first term 
decreases only very slightly. There is a net gain so we find f* < .02. The greater a 
is, the more important is the effect of this term so the more we have to reduce f to 
get f*, as Table 3 clearly shows. When there is a spectrum of favorable situations the 
solution is more complex and can be found through standard multivariable optimization 
techniques. 

The more complex Case 2 corresponds to what the serious blackjack player is likely 
to need to do in practice. He will have to limit his current maximum bet to some multiple 
of his current minimum bet. As his bankroll increases or decreases, the corresponding 
bet sizes will increase or decrease proportionately. 


6. Sports betting 


In 1993 an outstanding young computer science Ph.D. told me about a successful sports 
betting system that he had developed. Upon review I was convinced. I made suggestions 
for minor simplifications and improvements. Then we agreed on a field test. We found 
a person who was extremely likely to always be regarded by the other sports bettors as 
a novice. I put up a test bankroll of $50,000 and we used the Kelly.system to estimate 
our bet size. l 
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Fig. 4. Betting log Type 1 sports. 


We bet on |01 days in the first four and a half months of 1994. The system works for 
various sports. The results appear in Figures 3 and 4. After 101 days of bets, our $50,000 
bankroll had a profit of $123,000, about $68,000 from Type | sports and about $55,000 
from Type 2 sports. The expected returns are shown as about $62,000 for Type 1 and 
about $27,000 for Type 2. One might assign the additional $34,000 actually won to luck. 
But this is likely to be at most partly true because our expectation estimates from the 
model were deliberately chosen to be conservative. The reason is that using too large 
an f* and overbetting is much more severely penalized than using too small an f* and 
underbetting. i 
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Though $123,000 is a modest sum for some, and insignificant by Wall Street stan- 
dards, the system performed as predicted and passed its test. We were never more than 
a few thousand behind. The farthest we had to invade our bankroll to place bets was 
about $10,000. 

Our typical expectation was about 696 so our total bets ("action") were about 
$2,000,000 or about $20,000 per day. We typically placed from five to fifteen bets a 
day and bets ranged from a few hundred dollars to several thousand each, increasing as 
our bankroll grew. 

Though we had a net win, the net results by casino varied by chance from a substantial 
loss to a large win. Particularly hard hit was the “sawdust joint" Little Caesar's. It “died” 
towards the end of our test and I suspect that sports book losses to us may have expedited 
its departure. 

One feature of sports betting which is of interest to Kelly users is the prospect of 
betting on several games at once. This also arises in blackjack when (a) a player bets 
on multiple hands or (b) two or more players share a common bankroll. The standard 
techniques readily solve such problems. We illustrate with: 


Example 6.1. Suppose we bet simultaneously on two independent favorable coins with 
betting fractions f| and fz and with success probabilities py and p2, respectively. Then 
the expected growth rate is given by 
g(fi, f2) = pin MO + fi + fo) + piga + fi — fo) 
+ qipain(l — fi + f2) + q1q2]n(1 — fi — f2). 


To find the optimal ff and f7 we solve the simultaneous equations dg/df; = 0 and 
dg/df2 = 0. The result is 


pip? — 41492 


fith = — =c, 
pipa c 4142 
192 — 41p2 
fi-f= PRTA q. 
piq2 + q1p2 
ft = (c + d)/2, fF = (c — d)/2. (6.1) 


These equations pass the symmetry check: interchanging 1 and 2 throughout maps 
the equation set into itself. 

An alternate form is instructive. Let m; = pi — qi, i = 1,2 so p; = (1 + m;)/2 and 
qi = (1 — mj)/2. Substituting in (6.1) and simplifying leads to: 


u m; + m2 d= mı — m? 
E L+mym2’ i 1— mm’ 
mj(1— m2) m»(l — m?) 
fH. ff=—— > (6.2) 
| - mim5 1 — mimi 


which shows clearly the factors by which the f* are each reduced from m*. Since the 
m; are typically a few percent, the reduction factors are typically very close to 1. 
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In the special case p; = p2 = p, d = O and f* = ff = fj =c/2 = (p — qM 
(2(p? + q?)). Letting m = p — q this may be written f* = m/(1 4- m2) as the optimal 
fraction to bet on each coin simultaneously, compared to f* — m to bet on each coin 
sequentially. 

Our simultaneous sports bets were generally on different games and typically not 
numerous so they were approximately independent and the appropriate fractions were 
only moderately less than the corresponding single bet fractions. Question: Is this al- 
ways true for independent simultaneous bets? Simultaneous bets on blackjack hands at 
different tables are independent but at the same table they have a pairwise correlation 
that has been estimated at .5 (Griffin, 1979, p. 142). This should substantially reduce 
the Kelly fraction per hand. The blackjack literature discusses approximations to these 
problems. On the other hand, correlations between the returns on securities can range 
from nearly —1 to nearly 1. An extreme correlation often can be exploited to great 
advantage through the techniques of “hedging”. The risk averse investor may be able 
to acquire combinations of securities where the expectations add and the risks tend to 
cancel. The optimal betting fraction may be very large. 


The next example is a simple illustration of the important effect of covariance on the 
optimal betting fraction. 


Example 6.2. We have two favorable coins as in the previous example but now their 

outcomes need not be independent. For simplicity assume the special case where the 

two bets have the same payoff distributions, but with a joint distribution as in Table 4. 
Now c4- m 4- b = (1+m)/2s0 6 = (1— m)/2 — c and therefore 0 € c < (1—m)/2. 


Calculation shows Var(X;) = 1 — m?, Cor(X,, X3) = 4c — (1 — m)? and 
Cor(X,, X2) = [4c — (1 — my — m?). The symmetry of the distribution shows 
that g( fi, f2) will have its maximum at fj = f2 = f so we simply need to maximize 
gCf) = (c +m) Indl 4-2/) t cln(1 — 2f). The result is f* = m/(2(2c + m)). We 
see that for m fixed, as c decreases from (1 — m)/2 and cor(X;, X2) = 1, to 0 and 
cor(X,, X2) = —(1 — m)/(1 + m), f* for each bet increases from m/2 to 1/2, as in 
Table 5. 


Table 4 Table 5 
Joint distribution of two "identical" fa- — - [* increases as Cor(X |. X5) decreases 
vorable coins with correlated outcomes 


Cor(X 1. X4) c f* 


X1: X2:1 -1 


l od — m)/2 m/2 
1 cm b 0 (üu- m2)/A m/(14- m?) 
-l b € —(1 — m)/Q +n) 0 1/2 
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It is important to note that for an exact solution or an arbitrarily accurate numerical 
approximation to the simultaneous bet problem, covariance or correlation information 
is not enough. We need to use the entire joint distribution to construct the g function. 

We stopped sports betting after our successful test for reasons including: 

(1) It required a person on site in Nevada. 

(2) Large amounts of cash and winning tickets had to be transported between casinos. 
We believed this was very risky. To the sorrow of others, subsequent events con- 
firmed this. 

(3) It was not economically competitive with our other operations. 

If it becomes possible to place bets telephonically from out of state and to transfer 
the corresponding funds electronically, we may be back. 


7. Wall street: the biggest game 


To illustrate both the Kelly criterion and the size of the securities markets, we return to 
the study of the effects of correlation as in Example 6.2. Consider the more symmetric 
and esthetically pleasing pair of bets U; and U2, with joint distribution given in Table 6. 

Clearly 0 € a < 1/2 and Cor(U,, U2) = Cor(U,, Ux) = 4a — | increases from 
—] to 1 as a increases from 0 to 1/2. Finding a general solution for (f, f) appears 
algebraically complicated (but specific solutions are easy to find numerically), which 
is why we chose Example 6.2 instead. Even with reduction to the special case m; = 
m, = m and the use of symmetry to reduce the problem to finding f* = fft = fy, 
a general solution is still much less simple. But consider the instance when a = 0 so 
Cor(U,, U2) = —1. Then g(f) = In(1 + 2mf) which increases without limit as f 
increases. This pair of bets 1s a "sure thing" and one should bet as much as possible. 

This is a simplified version of the classic arbitrage of securities markets: find a pair 
of securities which are identical or "equivalent" and trade at disparate prices. Buy the 
relatively underpriced security and sell short the relatively overpriced security, achiev- 
ing a correlation of —1 and "locking in" a riskless profit, An example occurred in 1983. 
My investment partnership bought $ 330 million worth of “old” AT&T and sold short 
$332.5 million worth of when-issued “new” AT&T plus the new “seven sisters" regional 
telephone companies. Much of this was done in a single trade as part of what was then 
the largest dollar value block trade ever done on the New York Stock Exchange (De- 
cember 1, 1983). 

In applying the Kelly criterion to the securities markets, we meet new analytic prob- 
lems. A bet on a security typically has many outcomes rather than just a few, as in 


Table 6 
Joint distribution of U, and U5 


Ui: U5 m» +! mnj;-—1l 


mil a 1/2—a 
my — 1 1/2- a a 
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most gambling situations. This leads to the use of continuous instead of discrete prob- 
ability distributions. We are led to find f to maximize g(f) = Eln(l + fX) = 
f ln(1 + fx)dP(x) where P (x) is a probability measure describing the outcomes. Fre- 
quently the problem is to find an optimum portfolio from among 7 securities, where 
n may be a "large" number. In this case x and f are n-dimension vectors and fx is 
their scalar product. We also have constraints. We always need 1 + fx > 0so In(-) is 
defined, and X` f; = 1 (or some c > 0) to normalize to a unit (or to a c > 0) invest- 
ment. The maximization problem is generally solvable because g( f) is concave. There 
may be other constraints as well for some or all i such as f; > 0 (no short selling), or 
fi S Mi or f; 2 m; (limits amount invested in ith security), or $^ |f;| € M (limits 
total leverage to meet margin regulations or capital requirements). Note that in some 
instances there is not enough of a good bet or investment to allow betting the full f*, so 
one is forced to underbet, reducing somewhat both the overall growth rate and the risk. 
This is more a problem in the gaming world than in the much larger securities markets. 
More on these problems and techniques may be found in the literature. 


7.1. Continuous approximation 


There is one technique which leads rapidly to interesting results. Let X be a random 
variable with P(X = m +s) = P(X =m — s) =.5. Then E(X) = m, Var(X) = s?. 
With initial capital Vo, betting fraction f, and return per unit of X, the result is 

VCf) = Vo(1 +0 — fr + fX) = Vo(13 +r + f(X — 7), 
where r is the rate of return on the remaining capital, invested in, e.g., Treasury bills. 
Then 

g(f) = E(G(f)) = E(In(V(f)/ Vo)) = Eln(1 +r + f(X — 7) 

= SIn(l+r+ f(m—rt+s)) + .5in(l+r+ f(n — r — 3)). 

Now subdivide the time interval into n equal independent steps, keeping the same drift 
and the same total variance. Thus m, s? and r are replaced by m/n, s?/n and r/n, 
respectively. We have n independent X;,i = 1,...,n, with 

P(X; =m/n sn 2) = P(X; 2 m/n— gne) —.5. 
Then 


n 
Vin f)/ Vo =] [0 +0- for fx). 
i=l 
Taking £ (log(-)) of both sides gives g( f). Expanding the result in a power series leads 
to 
gU) =r + fim =r) —s* f?/2+0(n7') (7.1) 


where O(n- !/?) has the property n 1/?0(n- !/?) is bounded as n — oo. Letting n — oo 
in (7.1) we have 


golf) =r + f(m —r) — s? f? pa. (7.2) 
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The limit V = Voo(f) of V (f) as n — oo corresponds to a log normal diffusion 
process, which is a well-known model for securities prices. The "security" here has 
instantaneous drift rate m, variance rate s2, and the riskless investment of “cash” earns 
at an instantaneous rate r. Then goo(f) in (7.2) is the (instantaneous) growth rate of 
capital with investment or betting fraction f. There is nothing special about our choice 
of the random variable X. Any bounded random variable with mean E(X) — m and 
variance Var(X) — s? will lead to the same result. Note that f no longer needs to 
be less than or equal to 1. The usual problems, with log(-) being undefined for negative 
arguments, have disappeared. Also, f < O causes no problems. This simply corresponds 
to selling the security short. If m « r this could be advantageous. Note further that the 
investor who follows the policy f must now adjust his investment “instantaneously”. 
In practice this means adjusting in tiny increments whenever there is a small change 
in V. This idealization appears in option theory. It is well known and does not prevent 
the practical application of the theory (Black and Scholes, 1973). Our previous growth 
functions for finite sized betting steps were approximately parabolic in a neighborhood 
of f* and often in a range up to 0 € f < 2f*, where also often 2f* = fe. Now with 
the limiting case (7.2), g55 (f) is exactly parabolic and very easy to study. 

Lognormality of V ( f)/ Vo means log(V ( f)/ Vo) is N(M, S?) distributed, with mean 
M = 8o0(f)t and variance S? = Var(Go.(f))t for any time f. From this we can de- 
termine, for instance, the expected capital growth and the time t, required for V( f) 
to be at least k standard deviations above Vo. First, we can show by our previous 
methods that Var(Ga(f)) = sf hence Sdev(Ga(f)) = sf. Solving fígoo = 
kt, 12 Sdev(Goo(f)) gives th ge hence the expected capital growth t; goo, from which 
we find tj. The results are summarized in Equations (7.3). 


f'-m-rnls, — ge(f) ort fim-r -s?f?p2, 

8o0( f*) = (m — r} as? +r, 

Var(Goo(f)) = s? f?, Sdev(Goo(f)) = sf, 

tk8ol f) = K°s? f^ 1 gos, 

tk = s? f? / g. , (7.3) 


Examination of the expressions for t; goo ( f) and tg show that each one increases as f 
increases, for 0 < f < f, where f, is the positive root of s? f?/2— (n —r) f -r «0 
and f, > 2f*. 

Comment: The capital asset pricing model (CAPM) says that the market portfolio 
lies on the Markowitz efficient frontier E in the (s, m) plane at a (generally) unique 
point P = (so, mo) such that the line determined by P and (s = 0, m = r) is tangent 
to E (at P). The slope of this line is the Sharpe ratio $ = (mo — ro)/so and from (7.3) 
goo Cf") = S?/2 +r so the maximum growth rate g55 (f *) depends, for fixed r, only 
on the Sharpe ratio. (See Quaife (1995).) Again from (7.3), f* = 1 when m =r + s? 
in which case the Kelly investor will select the market portfolio without borrowing or 
lending. If m > r + s? the Kelly investor will use leverage and if m < r + s? he will 
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invest partly in T-bills and partly in the market portfolio. Thus the Kelly investor will 
dynamically reallocate as f" changes over time because of fluctuations in the forecast 
m, r and s?, as well as in the prices of the portfolio securities. 

From (7.3), Zoo(1) = m — s?/2 so the portfolios in the (s,m) plane satisfying 
m — s?/2 = C, where C is a constant, all have the same growth rate. In the contin- 
uous approximation, the Kelly investor appears to have the utility function U (s, m) = 
m — s*/2. Thus, for any (closed, bounded) set of portfolios, the best portfolios are ex- 
actly those in the subset that maximizes the one parameter family m — s*/2 = C. See 
Kritzman (1998), for an elementary introduction to related ideas. 


Example 7.1. The long run revisited. For this example let r = 0. Then the basic equa- 
tions (7.3) simplify to 


r=0: ft =m/s*, golf) =mf —s° f?/2, 

Bool f*) = m? 257, 

Var(Goo(f)) = s? f?, Sdev(Goo(f)) = sf. (7.4) 
How long will it take for V(f*) 2 Vo with a specified probability? How about 


V(f*/2)? To find the time t needed for V(f) 2 Vo at the k standard deviations level of 
significance (k = 1, P = 84%; k = 2, P = 98%, etc.) we solve for t = tj: 


tgo (f) = kt? Sdev(Gool P). (7.5) 


We get more insight by normalizing all f with f*. Setting f = cf* throughout, we 
find when r = 0 


r=0: f*t=m/s*, f =cm/s*, 

&oo (cf) = m° (e — e? /2)/s*, 

Sdev(Goo(cf*)) = cm/s, 

tgoo(cf ) = k?e/( — c/2), 

t(k, cf *) = Ks? / (mA — ¢/2)*). (7.6) 


Equations (7.6) contain a remarkable result: V( f) 2 Vo at the k standard deviation 
` level of significance occurs when expected capital growth tgo; = K?^c/(1 — c/2) and 
this result is independent of m and s. For f = f* (c = | in (7.6)), this happens for 
k = 1 at tg% = 2 corresponding to V = Voe? and at k = 2 for tgoo = 8 corresponding 
to V = Voe®. Now e? = 2981 and at a 10% annual (instantaneous) growth rate, it takes 
80 years to have a probability of 98% for V > Vo. At a 20% annual instantaneous rate it 
takes 40 years. However, for f = f*/2, the number for k = 1 and 2 are tgo; = 2/3 and 
8/3, respectively, just 1/3 as large. So the waiting times for Prob(V 2 Vo) to exceed 
84% and 98% become 6.7 years and 26.7 years, respectively, and the expected growth 
rate is reduced to 3/4 of that for f*. 
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Comment: Fractional Kelly versus Kelly when r = 0 

From Equations (7.6) we see that goo(cf*)/e@0(f*) = c2— c) 0 € c « oo, 
showing how the growth rate relative to the maximum varies with c. The relative risk 
Sdev(Gos (cf*)) /Sdev(Goo(f*)) = c and the relative time to achieve the same ex- 
pected total growth is 1/c(2 — c), 0 « c « 2. Thus the relative "spread" for the same 
expected total growth is 1/(2 — c), 0 « c « 2. Thus, even by choosing c very small, the 
spread around a given expected growth cannot be reduced by 1/2. The corresponding 
results are not quite as simple when r > 0. 


7.2. The (almost) real world 


Assume that prices change “continuously” (no jumps"), that portfolios may be revised 
"continuously", and that there are no transactions costs (market impact, commissions, 
"overhead"), or taxes (Federal, State, city, exchange, etc.). Then our previous model 
applies. 


Example 7.2. The S&P 500 Index. Using historical data we make the rough estimates 
m= 11,5 = .15,r = .06. The equations we need for r # 0 are the generalizations 
of (7.6) tor #0 and f = cf*, which follow from (7.3): 

cf* 2 c(m — r)Js?, 

Boolef*) = ((m = r} (e = 7/2)? +r, 

Sdev(Goo(cf*)) = c(m — r)/s, 

tgoo (cf) = K* c / (c — c2 rs*/(m — ry), 

t(k,cf*) = Kc (m — r*/s?)/ (((m — ry /s?)(c — c?) + py. (7.7) 


If we define m = m — r, Goo = G% — r, Boo = foo — r, then substitution into 
Equations (7.7) give Equations (7.6), showing the relation between the two sets. It also 
shows that examples and conclusions about P (V, > Vo) in ther = 0 case are equivalent 
to those about P (In(V (t)/ Vo) > rt) in the r # 0 case. Thus we can compare various 
strategies versus an investment compounding at a constant riskless rate r such as zero 
coupon U.S. Treasury bonds. 

From Equations (7.7) and c = 1, we find 


f'—222, — ge(f')-.15, . Sdev(Goo( f*)) = 33, 
tgl f*) = .96K2, t = 8.32? years. 


Thus, with f* = 2.22, after 8.32 years the probability is 84% that V, > Vo and 
the expected value of log(V,,/ Vo) = .96 so the median value of V, / Vo will be about 
e. 96 = 2.61. 

With the usual unlevered f — 1, and c — .45, we find using (7.3) 

£oo(1) = m — s?/2 = .09875, Sdev(Goo(1)) = .15, 


tgoo(1) = .23k?, t(k, 45 f*) = 2.31k years. 
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Writing 1299 = h(c) in (7.7) as 


h(c) = K*/(M/e +rs?/((m — r?c?) ~ 1/2) 
we see that the measure of riskiness, A(c), increases as c increases, at least up to the 


point c = 2, corresponding to 2 f* (and actually beyond, up to 1 +,/1 + "e ). 
Writing t (k, cf *) — t(c) as 


t(c) =k (in — 0/5?) / (Qm — r?/)Q — e/2 + r/c? 


shows that t (c) also increases as c increases, at least up to the point c = 2. Thus for 
smaller (more conservative) f = cf*, c x 2, specified levels of P(V, > Vo) are 
reached earlier. For c < 1, this comes with a reduction in growth rate, which reduction 
is relatively small for f near f*. 

Note: During the period 1975-1997 the short term T-bill total return for the year, 
a proxy for r if the investor lends (i.e, f < 1), varied from a low of 2.90% (1993) 
to a high of 14.7196 (1981). For details, see Ibbotson Associates, 1998 (or any later) 
Yearbook. 

A large well connected investor might be able to borrow at broker’s call plus about 
1%, which might be approximated by T-bills plus 1%. This might be a reasonable esti- 
mate for the investor who borrows (f > 1). For others the rates are likely to be higher. 
For instance the prime rate from 1975-1997 varied from a low of 6% (1993) to a high 
of 1996 (1981), according to Associates First Capital Corporation (1998). 

As r fluctuates, we expect m to tend to fluctuate inversely (high interest rates tend 
to depress stock prices for well known reasons). Accordingly, f* and goo will also 
fluctuate so the'long term S&P index fund investor needs a procedure for periodically 
re-estimating and revising f* and his desired level of leverage or cash. 

To illustrate the impact of rp > r, where ry is the investor's borrowing rate, suppose 
rp in example (7.2) is r + 2% or .08, a choice based on the above cited historical values 
for r, which is intermediate between “good” rp = r + 1%, and "poor" rp = the prime 
rate = r + 3%. We replace r by rp in Equations (7.7) and, if f* > 1, f* = 1.33, 
Zol f*) = .100, Sdev(Goo( f*)) = 20, tgo (f) = 4k*, t = AK? years. Note how 
greatly f* is reduced. 


Comment. Taxes 

Suppose for simplicity that al] gains are subject to a constant continuous tax rate 7 
and that all losses are subject to a constant continuous tax refund at the same rate T. 
Think of the taxing entities, collectively, as a partner that shares a fraction T of all gains 
and losses. Then Equations (7.7) become: 


cf* — c(m — r)/s* (1 — T), 
£oo (cf) = ((m — r} (e - ?7/2)/s +r ~T), 
Sdev(Goo(cf*)) =c(m —r)/s, 
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fgoo(cf *) = kei (c — e /2+r(l1— T)s?/(m — yo 


t(k, cf^) = Kc? ((m = ry? /s?) [ (((m = ry^[s?) (c — c?2) Lr(l— S 
(7.7T) 


It is interesting to see that cf* increases by the factor 1/(1 — T). For a high income 
California resident, the combined state and federal marginal tax rate is 45% so this 
factor is 1/.55 = 1.82. The amplification of cf * leads to the same growth rate as before 
except for a reduction by rT. The Sdev is unchanged and t (k, cf *) is increased slightly. 
However, as a practical matter, the much higher leverage needed with a high tax rate 
is typically not allowed under the margin regulation or is not advisable because the 
inability to continuously adjust in the real world creates dangers that increase rapidly 
with the degree of leverage. 


7.3. The case for "fractional Kelly" 


Figure 5 shows three g curves for the true m : m, = .5m,, 1.0m, and 1.5m,, where 
n, is the estimated value of m. The vertical lines and the slanting arrows illustrate the 
reduction in g for the three choices of: f = .5/7, fy and 1.5 f7. For example with 
f = Sf or "half Kelly", we have no loss and achieve the maximum g = .25, in 
case m, = .5m,. But if m, = m, then g = .75, a loss of .25 and if m, = 1.5m, then 
g = 1.25, a loss of 1.0, where all g are in units of m? /2s?. This is indicated both by 
LOSS; and LOSS; on the vertical line above f/f" = .5, and by the two corresponding 
arrows which lead upward, and in this case to the right, from this line. A disaster occurs 
when m; = .5m, but we choose f = 1.5/7. This combines overbetting f* by 50% 
with the overestimate of m, — 2m,. Then g — —.75 and we will be ruined. It is still 
bad to choose f = f* when m, = .5m, for then g = 0 and we suffer increasingly wild 
oscillations, both up and down, around our initial capital. During a large downward 
oscillation experience shows that bettors will generally either quit or be eliminated by a 
minimum bet size requirement. 
Some lessons here are: 
(1) To the extent m, is an uncertain estimate of m,, it is wise to assume m, « m, and 
to choose f < f7 by enough to prevent g < 0. 

Estimates of m, in the stock market have many uncertainties and, in cases of 
forecast excess return, are more likely to be too high than too low. The tendency 
is to regress towards the mean. Securities prices follow a “non-stationary process" 
where m and s vary somewhat unpredictably over time. The economic situation can 
change for companies, industries, or the economy as a whole. Systems that worked 
may be partly or entirely based on data mining so m, may be substantially less 
than me. Changes in the "rules" such as commissions, tax laws, margin regulations, 
insider trading laws, etc., can also affect m,. Systems that do work attract capital, 
which tends to push exceptional m, down towards average values. The drift down 
means Me > m, is likely. 
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g(f) unit = m?/2s? 
PENALTY FOR EACH m, IF YOU CHOOSE THIS f: 
o5f, of, 15f 
H d I 


pf. 


Fig. 5. Penalties for choosing f = fe z f* = fi. 


Sports betting has much the same caveats as the securities markets, with its own 
differences in detail. Rules changes, for instance, might include: adding expansion 
teams; the three point rule in basketball; playing overtime sessions to break a tie; 
changing types of bats, balls, gloves, racquets or surfaces. 

Blackjack differs from the securities and sports betting markets in that the prob- 
abilities of outcomes can in principle generally be either calculated or simulated to 
any desired degree of accuracy. But even here m, is likely to be at least somewhat 
less than me. Consider player fatigue and errors, calculational errors and mistakes 
in applying either blackjack theory or Kelly theory (e.g., calculating f* correctly, 
for which some of the issues have been discussed above), effects of a fixed shuffle 
point, non-random shuffling, preferential shuffling, cheating, etc. 

Subject to (1), choosing f in the range .5 7 < f < f offers protection against 
g < 0 with a reduction of g that is likely to be no more than 25%. 
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Example 7.3. The great compounder. In 1964 a young hedge fund manager acquired a 
substantial interest in a small New England textile company called Berkshire Hathaway. 
The stock traded then at 20. In 1998 it traded at 70,000, a multiple of 3500, and an annu- 
alized compound growth rate of about 27%, or an instantaneous rate of 24%. The once 
young hedge fund manager Warren Buffett is now acknowledged as the greatest investor 
of our time, and the world's second richest man. You may read about Buffett in Buffett 
and Clark (1997), Hagstrom (1994, 2004), Kilpatrick (1994) and Lowenstein (1995). If, 
as | was, you were fortunate enough to meet Buffett and identify the Berkshire opportu- 
nity, what strategy does our method suggest? Assume (the somewhat smaller drift rate) 
m = .20, s = .15, r = .06. Note: Plausible arguments for a smaller future drift rate 
include regression towards the mean, the increasing size of Berkshire, and risk from the 
aging of management. A counter-argument is that Berkshire’s compounding rate has 
been as high in its later years as in its earlier years. However, the S&P 500 Index has 
performed much better in recent years so the spread between the growth rates of the In- 
dex and of Berkshire has been somewhat less. So, if we expect the Index growth rate to 
revert towards the historical mean, then we expect Berkshire to do so even more. From 
Iquations (7.3) or (7.7), 


f* = 6.22, goo f*) = .495, Sdev(Goo( f*)) = .93, 
t8o0(f*) = 1.76k?, t = 3.54k* years. 


Compare this to the unlevered portfolio, where f = 1 and c = 1/6.22 = .1607. We 
find: 


f=, goo (f) = .189, Sdev(Goo(f)) = .15, 
th8oo(f) = 119k", ty = .63k* years. 


Leverage to the level 6.22 would be inadvisable here in the real world because secu- 
rities prices may change suddenly and discontinuously. In the crash of October, 1987, 
the S&P 500 index dropped 23% in a single day. If this happened at leverage of 2.0, 
the new leverage would suddenly be 77/27 — 2.85 before readjustment by selling part 
of the portfolio. In the case of Berkshire, which is a large well-diversified portfolio, 
suppose we chose the conservative f — 2.0. Note that this is the maximum initial lever- 
age allowed “customers” under current regulations. Then go9(2) = .295. The values 
in 30 years for median Væ/ Vo are approximately: f = 1, Voo/Vo = 288; f = 2, 
Voo/Vo = 6,974; f = 6.22, Voo/Vo = 2.86 x 10°. So the differences in results of 
leveraging are enormous in a generation. (Note: Art Quaife reports s = .24 for 1980- 
1997. The reader is invited to explore the example with this change.) 

The results of Section 3 apply directly to this continuous approximation model of 
a (possibly) leveraged securities portfolio. The reason is that both involve the same 
"dynamics", namely log G, ( f) is approximated as (scaled) Brownian motion with drift. 
So we can answer the same questions here for our portfolio that were answered in 
Section 3 for casino betting. For instance (3.2) becomes 


Prob(V (1)/ Vo € x for some t) = x^(2855/ Var(Goo)) (7.8) 
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where ^ means exponentiation and 0 < x < l. Using (7.4), for r = 0 and f = f*, 
22oo/ Var(Goo) = | so this simplifies to 


Prob(-) = x. (7.9) 


Compare with Example 3.3 For 0. < r < m and f = f* the exponent of x in (7.9) 
becomes | 4- 2rs? /(m — ry? and has a positive first derivative so, as r increases, P(-) 
decreases since 0 < x < 1, tending to 0 as r tends to m, which is what we expect. 


7.4. A remarkable formula 


In earlier versions of this chapter the exponent in Equations (3.2), (7.8) and (7.9) 
were off by a factor of 2, which I had inadvertently dropped during my derivation. 
Subsequently Don Schlesinger posted (without details) two more general continuous 
approximation formulas for the r = O case on the Internet at www.bjmath.com dated 
June 19, 1997. 

If Vo is the initial investment and y > 1 > x > O then for f* the probability that 
V (t) reaches y Vo before x Vo is 


Prob(V (t, f*) reaches y Vo before x Vo) = (1 — x)/(1 — (x/y)) (7.10) 
and more generally, for f = cf*, 0 < c < 2, 


Prob(V (t, cf*) reaches y Vo before x Vo) 


= [1 = x%Q2/e - I] /[1 — 6/»^Q/e = 0] (7.11) 
where A means exponentiation. 
Clearly (7.10) follows from (7.11) by choosing c = I. The r = O case of our 


Equation (7.8) follows from (7.11) and the r = O case of our Equation (7.9) follows 
from (7.10). We can derive a generalization of (7.11) by using the classical gambler's 
ruin formula (Cox and Miller, 1965, p. 31, Equation (2.0)) and passing to the limit 
as step size tends to zero (Cox and Miller, 1965, pp. 205-206), where we think of 
log(V (t, f)/ Vo) as following a diffusion process with mean goo and variance u(Gogo), 
initial value 0, and absorbing barriers at log y and log x. The result is 


Prob(V (t, cf *) reaches y Vo before x V) = [I — x^al/[1 — (x/y)^a] (7.12) 


where a = 28%/ V (Gæ). = 2M/V where M and V are the drift and variance, respec- 
tively, of the diffusion process per unit time. Alternatively, (7.12) is a simple restatement 
of the known solution for the Wiener process with two absorbing barriers (Cox and 
Miller, 1965, Example 5.5). 

As Schlesinger notes, choosing x = 1/2 and y = 2 in (7.10) gives Prob(V (t, f*) 
doubles before halving) — 2/3. Now consider a gambler or investor who focuses only 
on values V, = 2"Vy,n = 0, x1, £2...., multiples of his initial capital. In log space, 
log(V, / Vo) = n log 2 so we have a random walk on the integer multiples of log 2, where 
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the probability of an increase is p = 2/3 and of a decrease, g = 1/3. This gives us a 
convenient compact visualization of the Kelly strategy's level of risk. 

If instead we choose c = 1/2 (“half Kelly”), Equation (7.11) gives Prob(V (t, f*/2) 
doubles before halving) = 8/9 yet the growth rate go5 (f£ * /2) = .75gos (f *) so “half 
kelly" has 3/4 the growth rate but much less chance of a big loss. 

^ second useful visualization of comparative risk comes from Equation (7.8) which 
vives 


Prob(V (t, cf *)/ Vo € x for some t) —x^QJ/c — 1). (7.13) 


lor c = | we had Prob(.) = x and for c = 1/2 we get Prob() = x3. Thus “half 
Kelly” has a much lessened likelihood of severe capital loss. The chance of ever losing 
half the starting capital is 1/2 for f = f* but only 1/8 for f = f*/2. My gambling 
and investment experience, as well as reports from numerous blackjack players and 
teams, suggests that most people strongly prefer the increased safety and psychological 
comfort of "half Kelly" (or some nearby value), in exchange for giving up 1/4 of their 
growth rate. 


8. A case study 


In the summer of 1997 the XYZ Corporation (pseudonym) received a substantial 
amount of cash. This prompted a review of its portfolio, which is shown in Table 7 
in the column 8/17/97. The portfolio was 54% in Biotime, ticker BTIM, a NASDAQ 
biotechnology company. This was due to existing and historical relationships between 
people in XYZ Corp. and in BTIM. XYZ's officers and directors were very knowledge- 


Table 7 
Statistics for logs of monthly wealth relatives, 3/31/92 through 6/30/97 


Berkshire BioTime SP500 T-bills 
Monthly Mean .0264 .0186 .0146 .0035 
Standard deviation .0582 2237 .0268 .0008 
Annual Mean -3167 -2227 .}753 .0426 
Standard deviation .2016 .7748 .0929 .0028 
Monthly Covariance .0034 —.0021 .0005 1.2E—06 
.0500 —.0001 3.2E—05 
.0007 5.7E—06 
6.7E—07 
Monthly Correlation 1.0000 —.1581 .2954 .0257 
1.0000 —.0237 773 
1.0000 .2610 


1.0000 
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able about BTIM and felt they were especially qualified to evaluate it as an investment. 
They wished to retain a substantial position in BTIM. 
The portfolio held Berkshire Hathaway, ticker BRK, having first purchased it in 1991. 


6.1. The constraints 


Dr. Quaife determined the Kelly optimal portfolio for XYZ Corp. subject to certain con- 
straints. The list of allowable securities was limited to BTIM, BRK, the Vanguard 500 
(S&P 500) Index Fund, and T-bills. Being short T-bills was used as a proxy for mar- 
gin debt. The XYZ broker actually charges about 2% more, which has been ignored in 
this analysis. The simple CAPM (capital asset pricing model) suggests that the investor 
only need consider the market portfolio (for which the S&P 500 is being substituted 
here, with well known caveats) and borrowing or lending. Both Quaife and the author 
were convinced that BRK was and is a superior alternative and their knowledge about 
and long experience with BRK supported this. 

XYZ Corp. was subject to margin requirements of 5096 initially and 3096 mainte- 
nance, meaning for a portfolio of securities purchased that initial margin debt (money 
lent by the broker) was limited to 5096 of the value of the securities, and that whenever 
the value of the account net of margin debt was less than 3096 of the value of the secu- 
rities owned, then securities would have to be sold until the 3096 figure was restored. 

In addition XYZ Corp. wished to continue with a "significant" part of its portfolio in 
BTIM. 


8.2. The analysis and results 


Using monthly data from 3/31/92 through 6/30/97, a total of 63 months, Quaife finds 
the means, covariances, etc. given in Table 7. 

Note from Table 7 that BRK has a higher mean and a lower standard deviation than 
BTIM, hence we expect it to be favored by the analysis. But note also the negative 
correlation with BTIM, which suggests that adding some BTIM to BRK may prove 
advantageous. 

Using the statistics from Table 7, Quaife finds the following optimal portfolios, under 
various assumptions about borrowing. 

As expected, BRK is important and favored over BTIM but some BTIM added to the 
BRK is better than none. 

If unrestricted borrowing were allowed it would be foolish to choose the correspond- 
ing portfolio in Table 8. The various underlying assumptions are only approximations 
with varying degrees of validity: Stock prices do not change continuously; portfo- 
lios can't be adjusted continuously; transactions are not costless; the borrowing rate 
is greater than the T-bill rate; the after tax return, if different, needs to be used; the 
process which generates securities returns is not stationary and our point estimates of 
the statistics in Table 7 are uncertain. We have also noted earlier that because “over- 
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Table 8 
Optimal portfolio allocations with various assumptions about borrowing 


Security fraction 


Security No borrowing 50% margin Unrestricted borrowing 
Berkshire .63 1.50 6.26 
BioTime 37 50 1.18 
S&P 500 .00 .00 12.61 
T-bills .00 —1.00 —19.04 


Portfolio growth rate 


Mean .36 .62 2.10 
Standard deviation .29 45 2.03 


betting” is much more harmful than underbetting, “fractional Kelly” is prudent to the 
extent the results of the Kelly calculations reflect uncertainties. 

In fact, the data used comes from part of the period 1982-1997, the greatest bull 
market in history. We would expect returns in the future to regress towards the mean so 
the means in Table 7 are likely to be overestimates of the near future. The data set is 
necessarily short, which introduces more uncertainty, because it 1s limited by the amount 
of BTIM data. As a sensitivity test, Quaife used conservative (mean, std. dev.) values for 
the price relatives (not their logs) for BRK of (1.15, .20), BTIM of (1.15, 1.0) and the 
S&P 500 from 1926-1995 from Ibbotson (1998) of (1.125, .204) and the correlations 
Irom Table 7. The result was fractions of 1.65, .17, .18 and — 1.00 respectively for BRK, 
BTIM, S&P 500 and T-bills. The mean growth rate was .19 and its standard deviation 
was .30. 


5.3. The recommendation and the result 


The 50% margin portfolio reallocations of Table 8 were recommended to XYZ Corp.’s 
board on 8/17/97 and could have been implemented at once. The board.elected to do 
nothing. On 10/9/97. (in hindsight, a good sale at a good price) it sold some BTIM 
and left the proceeds in cash (not good). Finally on 2/9/98 after a discussion with 
both Quaife and the author, it purchased 10 BRK (thereby gaining almost $140,000 
by 3/31/98, as it happened). The actual policy, led to an increase of 73.5%. What would 
have happened with the recommended policy with no rebalance and with one rebal- 
ance on 10/6/97? The gains would have been 117.6% and 199.4%, respectively. The 
gains over the suboptimal board policy were an additional $475,935 and $1,359,826, 
respéctively. 

The optimal policy displays three important features in this example: the use of lever- 
age, the initial allocation of the portfolio, and possible rebalancing (reallocation) of the 
portfolio over time. Each of these was potentially important in,determining the final 
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result. The potential impact of continuously rebalancing to maintain maximum margin 
is illustrated in Thorp and Kassouf (1967), Appendix A, The Avalanche Effort. 

The large loss from the suboptimal policy was much more than what would have 
been expected because BRK and BTIM appreciated remarkably. In .62 years, BRK was 
up 60.4% and BTIM was up 62.9%. This tells us that—atypically—in the absence of 
rebalancing, the relative initial proportions of BRK and BTIM did not matter much 
over the actual time period. However, rebalancing to adjust the relative proportions of 
BRK and BTIM was important, as the actual policy's sale of some BTIM on 10/9/97 
illustrated. Also, rebalancing was important for adjusting the margin level as prices, in 
this instance, rose rapidly. 

Table 8 illustrates what we might have normally expected to gain by using 5096 mar- 
gin, rather than no margin. We expect the difference in the medians of the portfolio 
distributions to be $1,080,736[exp(.62 x .62) — exp(.36 x .62)] = $236,316 or 21.9% 
which is still large. 


8.4. The theory for a portfolio of securities 


Consider first the unconstrained case with a riskless security (T-bills) with portfolio 
fraction fo and n securities with portfolio fractions fi,..., fa. Suppose the rate of 
return on the riskless security is r and, to simplify the discussion, that this is also the 
rate for borrowing, lending, and the rate paid on short sale proceeds. Let C = (s;;) be 
the matrix such that s;;, i, j = 1,...,n, is the covariance of the ith and jth securities 
and M = (mj,mo,..., my)! be the row vector such that m;, i = 1,...,”, is the drift 
rate of the ith security. Then the portfolio satisfies 


foto tf =), 
m — for + fim, +-+ fum, =r + fiim —r) god film — rn) 
=r+F'(M — R), 

s? = F'CF (8.1) 
where F! = (fo -> fa) and T means “transpose”, and R is the column vector 
Gr, ry... r)" of length n. 

Then our previous formulas and results for one security plus a riskless security apply 
to goo C fis... fi) = m — s?/2. This is a standard quadratic maximization problem. 
Using (8.1) and solving the simultaneous equations 9go5/8f; — 0, i = 1,..., n, we get 


F* = C^'[M — R], 
Bol fE,- --, fZ) mr (P CF*/2 (8.2) 


where for a unique solution we require C^! to exist, i.e., detC Æ 0. When all the 
securities are uncorrelated, C is diagonal and we have f* = (m; — r)/s;ij or f? = 
(mj; — r)/s?, which agrees with Equation (7.3) when n =, 1. 
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Note: BRK issued a new class of common, ticker symbol BRK.B, with the old com- 
mon changing its symbol to BRK.A. One share of BRK.A can be converted to 30 shares 
of BRK.B at any time, but not the reverse. BRK.B has lesser voting rights and no right 
to assign a portion of the annual quota of charitable contributions. Both we and the mar- 
ket consider these differences insignificant and the A has consistently traded at about 
30 times the price of the B. 

If the price ratio were always exactly 30 to 1 and both these securities were included 
in an analysis, they would each have the same covariances with other securities, so 
det C = 0 and C^! does not exist. 

If there is an initial margin constraint of q, 0 < q x 1, then we have the additional 
restriction 


Lib o fal S 1/4. (8.3) 


The n-dimensional subset in (8.3) is closed and bounded. 

If the rate for borrowing to finance the portfolio is rp = r + ep, ej È O, and the rate 
paid on the short sale proceeds is r; = f — €s, €s > 0, then the m in Equation (8.1) is 
altered. Let x = max(x, 0) and x7 = max(0, —x) sox = x* — x7 for all x. Define 
f*- ft +--+ f; , the fraction of the portfolio held long. Let £^ = f, tooth, 
the fraction of the portfolio held short. 

Casel. ft &] 


m=r+ fimi >r) +--+ famir) esf. (8.4.1) 
Case2. f ^ >1 
m=rt film —r) +--+ filmy, =r) elft —N esf. (8.4.2) 


9. My experience with the Kelly approach 


How does the Kelly-optimal approach do in practice in the securities markets? In a 
little-known paper (Thorp, 1971) I discussed the use of the Kelly criterion for portfolio 
management. Page 220 mentions that “On November 3, 1969, a private institutional 
investor decided to ...use the Kelly criterion to allocate its assets”. This was actually 
a private limited partnership, specializing in convertible hedging, which I managed. 
A notable competitor at the time (see Institutional Investor (1998)) was future Nobel 
prize winner Harry Markowitz. After 20 months, our record as cited was a gain of 39.9% 
versus a gain for the Dow Jones Industrial Average of +4.2%. Markowitz dropped out 
after a couple of years, but we liked our results and persisted. What would the future 
bring? 

Up to May 1998, twenty eight and a half years since the investment program began. 
The partnership and its continuations have compounded at approximately 20% annu- 
ally with a standard deviation of about 6% and approximately zero correlation with the 
market (“market neutral”). Ten thousand dollars would, tax exempt, now be worth 18 
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million dollars. To help persuade you that this may not be luck, 1 estimate that dur- 
ing this period I have made about $80 billion worth of purchases and sales ("action", 
in casino language) for my investors. This breaks down into something like one and a 
quarter million individual "bets" averaging about $65.000 each, with on average hun- 
dreds of “positions” in place at any one time. Over all, it would seem to be a moderately 
“Jong run” with a high probability that the excess performance is more than chance. 


10. Conclusion 


Those individuals or institutions who are long term compounders should consider the 
possibility of using the Kelly criterion to asymptotically maximize the expected com- 
pound growth rate of their wealth. Investors with less tolerance for intermediate term 
risk may prefer to use a lesser function. Long term compounders ought to avoid using 
a greater fraction ("overbetting"). Therefore, to the extent that future probabilities are 
uncertain, long term compounders should further limit their investment fraction enough 
to prevent a significant risk of overbetting. 
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Appendix A. Integrals for deriving moments of E, 


oo 
lla, b) = Í exp| - (a^x? + b?/x?°)] dx, 
0 
9 ud 9 2 
L,(a?, 5?) = Í x" exp[ - (a?x? + /3?)] dx. 
0 
Given lo find h 


oo 
hla, b^) = Í exp|-(a°x? + b^ [x?)] dx 
0 


0 
-f exp| - (a? /u? + 5?u?)](—du/u?) 


oo 
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where x = l/u and dx = —du/u? so 


oo 
lo(a?, b?) = f x? exp| - (bx? +a? /x?)] = I_2(b’, a’), 
0 


hence 
2 ,2Y . 2 2) _ J/n —2|ab| 
I-2(a^ b) = ob) = se , 
oo oo 
Io -f exp[ - (ax? + b?/x?)| dx = U - vie -f V dU (A.1) 
0 0 


where U = exp[-], dV = dx, dU = (exp[- ])(—2a2x + 2D?x ^?) and V = x so 

Ip = exp[—(a?x? + b?/x?)] - x19° 

- [ ^ (22d + 2b*/x?) exp[- (a2? + b*/x?)] dx 
= 2a? n (a?, b’) — 2b? 1.» (a?, b’). 

Hence: 

Io(a^, 5?) = 2a? h(a?, b?) — 28? 1.» (a?, b°) 
and I_2(a*, b?) = Io(b*, a?) so substituting and solving for I» gives 

h(a’, 5?) = IU b?) + 2b? Io(b?, a?)). 


Comments. 
(1) We can solve for all even n by using lo, 7—2 and h, and integration by parts. 
(2) We can use the indefinite integral Jo corresponding to 7o, and the previous methods, 
to solve for J_2, J2, and then for all even n. Since 


Io(a?, b?) = VE Lab 


la| 
then 
l »(a?, b’) = E ete and 
L [vz vE) Ze - 
I 2p — + 2b? 2lab| — 2|ab| 1 Wb 
er r)- st [u^ oN am (Mal 2b. 


Appendix B. Derivation of formula (3.1) 


This is based on a note from Howard Tucker. Any errors are mine. 
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From the paper by Paranjape and Park, if x (1) is standard Brownian motion, if a # 0, 
b 0, 


P(X(t) <Sat+b, Ox t€ T | X(T) — s) 
2b 
l-ap- Ter es- 5| ifs X aT +b, 
if s — aT +B. 


Write this as: 
P(X(t) <at+b, Oxt«T | X(T)) 
=j exp| -2b(a7 +b-— xn) 7} if X(T) <aT +b. 
Taking expectations of both sides of the above, we get 
P(X() €at - b, O<1 <T) 


aT 4b . . 1 M 
=f (1 — e rere) em ds 


-oo ~V 2n T 
l aT +b » e 24^ at b "AN 
— e 5 /2T ds — e 6720/21 ds 
V2nT J- V2nT J- 


Hence 


P(X going above line at + b during [0, T]) = 1 — previous probability 


—s? /2T —2ub . 


ds +e e T du, 


l oo 
= — e 
JV2nT i 


where 4 = s — 2b. 


] aT —b 
V2nT I. 
, (B.1) 


Now, when a = 0, b > 0, 


2 (9 5 
P| sup xo 2b|- 5] e "777 dy, 
Oct T zT Jp 


which agrees with a known formula (see, e.g., p. 261 of Tucker (1967)). In the case a > 
0, when 7 — oo, since VT /T — O and VT = s.d. of X (T), the first integral > 0, 
the second integral — 1, and P(X ever rises above line at + b) = e-?^^. Similarly, in 
the case a < 0, P (ever rises above line at + b) = 1. 

The theorem it comes from is due to Sten Malmquist, On certain confidence contours 
for distribution functions, Ann. Math. Stat. 25 (1954), pp. 523—533. This theorem is 
stated in S.R. Paranjape and C. Park, Distribution of the supremum of the two-parameter 
Yeh-Wiener process on the boundary, J. Appl. Prob. 10 (1973). 

Letting & = aA/T, B = b/VT, formula (B.1) becomes 


P(-) = N(—a — B) + e?" N(w — B) wherea, B >0 or 
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P(X(t) €«at--b, O&t « T) 21— P() = N(e + B) - e 7? N(a — p) 


for the probability the line is never surpassed. This follows from: 


l °° SOT l °° 2/2 
—— e™ 4 ds = —— e" /“dy = N(—a — B) and 
A 2x T JaT +b ~V 2x aA T 4-bj T 
l aT —b 5 
— e 7T du = N(a — B) 
J/2x T f. 


where s — aT + b, x =s/J/T =aVT +6/VT, œ — a T and B = b/VT. 
The formula becomes: 
P (sup| X (0) — (at + b)] 20:0<1<T) 
= N(-a — B) ce "^ N (æ — f) 
= N(-a — B) ce"? N (& — B), o, B > 0. 
Observe that 
P()«N(-a-B)t Na-B) = [1 -Nla + Bj Ne-p). 
a —f [oe] 
-f awd f a(x)dx < I 
—oo a tf 


as it should be. 


Appendix C. Expected time to reach goal 


Reference: Handbook of Mathematical Functions, Abramowitz and Stegun, Editors, 
N.B.S. Applied Math. Series 55, June 1964. 
P. 304, 7.4.33 gives with erf z = Y. Io e^" dr the integral: 


J exp{—(a?x? + 2? /x?)] dx 


H VP Go?hertias + b/x) ke "P erf(ax — b/x)]- C, a #0. (cD 


Now the left side is >0 so for real a, we require a > 0 otherwise the right side is «0, 
a contradiction. 
We also note that p. 302, 7.4.3. gives 


oo . 
| exp{—(ar? + b/t’)} dt — idein c2) 
0 H 


with Ra > 0, Rb > 0. 
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To check (C.2) v. (C.1), suppose in (C.1) a > 0, b > O and find lim,_,9 and lim; soo 
of erf(ax + b/x) and erf(ax — b/x), 


li ix +b = ; li —b = 00, 
Tim (ax /x) = +00 Jim (ax /x) oO 

lim (ax + b/x) = +00, lim (ax — b/x) = +œ. 
x> X00 


Equation (C.1) becomes 


YF eom Jn 
4a 


Ne 29 erf(oo) = 
4a 


erf(oo) — erf(—00)] = Y QS 


since we know erf(oo) = 1. 
In (C.2) replace a by a’, b by b? to get 


CO 
l(a’, p?) = Í exp[ - (at? + b?/1?)} di = ezn 
0 a 


which is the same. 
Note: if we choose the lower limit of integration to be 0 in (C.1), then we can find C: 


ot 
0- i exp{—(a?x* + b?/x”)} dx = V [2herf(00) te? erf(—00)] + C 
0 a 


— Aret _ e ^] +C. 
a 


Whence 
F(x) = | exp[— (a?x? + b?/x)} dx 
0 


= Y? (et erttax + b/x) — 1] - e?" erf(ax — b/x) + 1]]. (C.3) 


To see how (C.3) might have been discovered, differentiate: 
F'(x) = exp{—(a*x? + 6? /x?)] 
_ vm 
^. 4a 
t e 1 (a + b/x?) erf (ax — b/x)]. 


(e?^ (a — b/x?) erf (ax + b/x) 


Now erf'(z) = -= exp(—z2) so 
Jn 


erf (ax + b/x) = E: exp|- (ax + b/x)*| = E exp[ - (^x? +b? /x? + 2ab)} 


_— 2 rab (22 1 p2yy2 
= ge exp{—(a*x* + b?/x?)} 
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and, setting b «— —b, 
2 2ab 2.2 27,2 
erf (ax — b/x) = ve exp{—(a°x* + b*/x )) 
P4 
whence 


PO = iG 


rd Ei — b/x?) + (a epe) | exp[—(@2x? + bx?) 
= 5; Da] exp{—(a?x? + b?/x*)} = exp{—(a?x? + 2? x?)]. 
Case of interest: a < 0, b > 0. 
Expect: 
b»0,ax0 > F(T)flasT > oo, 
b>0,a>0 > F(T)fc«lasT > oo. 
Ifb>0,a=0: 


F(T) = N(-B) + N(-f) = 2N(-b/VT) f 2NO) 21. as T 1 oc. 


Also, as expected F(T) 4 | as b |, 0. 
If b > 0,a < 0: See below. 
Ifb>0,a> 0: 


N(-aVT —b/VT) +N (aVT — b/VT ) 
— N(—oo) +e ?"^N (oo) 
— e ^b 


F(T) 


<l ast? Too. 
This is correct. 

Ifb 0 F(T)— N(-aVT ) + N(aVT ) = 1. This is correct. 

Let F(T) = P(X(t) 2 at +h for some t, 0 < t < T) which equals N(—a — B) + 
e 7^ N (y — B) where a = av T and B = b/VT so ab = of; we assume b > 0 and 
a < Qin which case 0 € F(T) Landlimp.,o59 F(T) = 1, imzo F(T) —0; F isa 
probability distribution function: 


lim F(T) = N(—oo) +e ""N(—oo) = 0, 
T30 
lim F(T) = N(+oo) +e ""N(—oo) = 1. 
T—oo 


The density function is 


ð a 
(N= F(T) = — BN’ =b — (a — B)N'(a — 
{(T) (T) = gr ca PN B)t+e ap B)N'(a — B) 
where 


au J 1/2 ap | 
uM —— = " 
aT L aT 2 
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N'(-a — B) = 


MED | ol TE Een) 
J 20 J 20 p 2 : 


l 2 l 2T +b?/T — 2ab 
N'(œ — p) = goce exp GU s jJ 
J 2 2 


= ee: NR (a?T +b*/T) 
= ees 1/2 S 3/2 ab = 
rer) =7( aT + 5 OF jae e| e 


20 2 


2 2 
4 Te Mh ( Lapi tae) T qe ex | (a^T +b*/T) 
2 2 J2n 2 


—ab 2 2 
e | car? + T= exp — * T+b 2 


H 24/27 2 
?T +b?/T 
+ (ert? bT exp| “PLO 
bea 1/2 cu T b^ rT) 
= ———T*exp4———— — — f. 
A/ 2r 2 


The expected time to the goal is 


e «^ ce 


a b 
E =| Tf(T)dT = 
i 0 V2n Jo 
T! =x 


Ex. Pola?) 
TG) 


T 
Io(a?, p?) 2 VZ erzh 


2 2 
(5) ; (=) ) = NE lao whence 
v2 v2 V2la| 
Del E b 
Eos = el g <0, b> O. 
? SIr val lal 


2 12 T 
pag] TAIT a 


so 


Note: 
; Diu. —(a?T +b?/T 
f(T) = FT) = T exp EI: 


>0 
2 


for all a,e.g., a < 0, so F(T) is monotone increasing. Hence, since limz., o5 F(T) = 1 
fora < O and limo F(T) < | fora > 0,0 € F(T) < | forall T so we have more 
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confidence in using the formula for a « 0 too. 


Check: Ego(a,b) | Oas | —co yes, 


Ela, b) T asb 1 yes, 
Eoola, b) ^ as|a| | yes, 

note lim Egs(a, b) = +00 as suspected. 
alor 


This leads us to believe that in a fair coin toss (fair means no drift) and a gambler 
with finite capital, the expected time to ruin is infinite. 
This is correct. Feller gives D = z(a — z) as the duration of the game, where z is 
initial capital, ruin is at 0, and a is the goal. Then limyo D(a) = +00. 
Note: Ego = b/|a| means the expected time is the same as the point where aT + b 
crosses X(t) = 0. See Figure 2. 
Eo =b/la|, a-—-m[s. b=lIna, 
à = C/ Xo = normalized goal, 
m — pln(l E f) t qln(l — f) = eC), 
s? = pa{in((l+ 0/0 - If. 
Kelly fraction f* = p —q.  g(f*) = pln2p —- q1n2q, 
Form > 0, Eo — (In Xs / gt f). 
Now this is the expected time in variance units. However s? variance units — 1 trial 
so 
E In A In A 
na, f= -5 = = 
s gf) m 


is the expected number of trials. 


Check: n(A,f)T]T asAT, 
HA, f)— oo asà > oo, 


aA, f) asm) 0, 


n(A. f)— oo asm — 0. 


Now et has unique maximum at g(f*) where f* = p — q, the "Kelly fraction", 


therefore nt, f/ has a unique minimum for f = f*. Hence f* reaches a fixed goal in 
least ex pected time in this, the continuous case, so we must be asymptotically close to 
least expected time in the discrete case, which this approximates increasing by well in 
the sense ot the CEF (Central Limit Theorem) and its special case, the normal approx- 
imation to the binomial distribution. The difference here is the trials are asymmetric. 


The positise and negative step sizes are unequal. 
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KELLY CAPITAL GROWTH INVESTMENT CRITERION 
THEORY ana PRACTICE 


"Utility functions and marginal utility, a gift from mathematics to economics, helped to decisively shape 
economic thinking over the past centuries. Logarithmic utility played a central role from the beginning. 
Now often named after the seminal work of John Kelly Jr. in 1956, it maximizes the geometric growth 
rate of a portfolio. 


This handbook presents the classical papers and highlights from modern research, along with limitations 
on the use of the Kelly criterion." 
Walter Schachermayer 


Fokultat für Mathematik, Universitat Wien 


"The tragically short-lived genius John Kelly Jr. is best remembered for one of the most original and 
far-reaching ideas in modern finance. The Kelly criterion can be described as a gambling “system” that 
really works, in that it achieves the maximum long-term return from a favorable speculation. Kelly's 
idea has long had a cult following among people ranging from hedge fund managers to blackjack players. 


For those who have heard of the Kelly mythos and want to explore the science behind it, this book will 

be an instant classic. The editors have collected all the pivotal original papers, spanning centuries and 

the rarely bridged gulf between theory and practice. This book is indispensable for anyone interested in 
Kelly's legacy." 

William Poundstone 

The author of "Fortune's Formula: The Untold Story of the Scientific Betting System 

That Beat the Casinos and Wall Street" 


"Edward O. Thorp and the Kelly criterion have been a lighthouse for risk management for me and PIMCO 
for over 45 years. First at the blackjack tables, and then in portfolio management, the Kelly system has 
helped to minimize risk and maximize return for thousands of PIMCO clients." 


William H. Gross, Managing Director, PIMCO 


About the Book 

This volume provides the definitive treatment of fortune's formula or the Kelly capital growth criterion 
as it is often called. The strategy is to maximize long run wealth of the investor by maximizing the 
period by period expected utility of wealth with a logarithmic utility function. Mathematical theorems 
show that only the log utility function maximizes asymptotic long run wealth and minimizes the 
expected time to arbitrary large goals. In general, the strategy is risky in the short term but as the 
number of bets increase, the Kelly bettor's wealth tends to be much larger than those with essentially 
different strategies. So most of the time, the Kelly bettor will have much more wealth than these other 
bettors but the Kelly strategy can lead to considerable losses a small percent of the time. There are 
ways to reduce this risk at the cost of lower expected final wealth using fractional Kelly strategies 
that blend the Kelly suggested wager with cash. The various classic reprinted papers and the new ones 
written specifically for this volume cover various aspects of the theory and practice of dynamic investing. 
Good and bad properties are discussed, as are fixed-mix and volatility induced growth strategies. The 
relationships with utility theory and the use of these ideas by great investors are featured. 


